YugabyteDB internals on range queries over hash-partitioned tables

Question

YugabyteDB internals on range queries over hash-partitioned tables

22 Views Asked by dh YB At 15 January 2024 at 07:33

Is there any good reading on how YB implements range queries over hash-partitioned data? I'm curious because this is a hard problem to solve (yes, even with a good solution, it's generally not advisable to do this query...)

Original Q&A

There are 1 best solutions below

**dh YB** · Answer 1 · 2024-01-15T07:33:39.290000

When you are using hash sharding, a range query on the shard key will result in a scatter/gather type query as there is no way to know what tablets to route the queries to. As a general rule, if you are planning on doing a lot or range queries on the data, consider range based sharding instead. Designing a proper range based shard key does require a lot of considerations in order to avoid data imbalances and hot spots. There is a blog article here on how we chose our sharding strategies: https://www.yugabyte.com/blog/four-data-sharding-strategies-we-analyzed-in-building-a-distributed-sql-database/#google-spanner-and-hbase-range-sharding

One solution could be to union all all possible hash codes (from 0 to 65535) because where yb_hash_code(id)=0 is pushed down to get to the right tablet and the range of hash. But that would do 65536 calls which will not be faster.

When you do hash sharding, the hash is at the start of the primary key (the key in rocksdb) https://docs.yugabyte.com/preview/architecture/docdb-sharding/sharding/#hash-sharding

YugabyteDB internals on range queries over hash-partitioned tables

There are 1 best solutions below

Related Questions in YUGABYTEDB

Trending Questions

Popular # Hahtags

Popular Questions