What is bucketBy equivalent in spark dataframe V2 API or Iceberg?

44 Views Asked by At

We have Spark dataframe V1 API with bucketBy option.

df0.write
   .bucketBy(50, "userid")
   .saveAsTable("myHiveTable")

I don't see similar option in DataFrameWriterV2 API.

What is the equivalent to bucketBy in V2 or Iceberg?

I couldn't find any details on clustredBy/bucketBy in Spark documentation.

0

There are 0 best solutions below