What is bucketBy equivalent in spark dataframe V2 API or Iceberg?

66 Views Asked by user2417458 At 16 January 2024 at 22:17

We have Spark dataframe V1 API with bucketBy option.

df0.write
   .bucketBy(50, "userid")
   .saveAsTable("myHiveTable")

I don't see similar option in DataFrameWriterV2 API.

What is the equivalent to bucketBy in V2 or Iceberg?

I couldn't find any details on clustredBy/bucketBy in Spark documentation.

There are 0 best solutions below