I've started to play with Apache Parquet I was surprised about 2 versions of writers.
PARQUET_1_0 ("v1"),
PARQUET_2_0 ("v2");
I tried to get the metadata/dump using parquet-tools to determine the version, but it did not include this info.
Currently I have a parquet file.
How do I determine the parquet write version used to write this file?
You can use
pyarrow.parquetto view the writer version of the Parquet file:This would print something like:
And
format_versionis what you are looking for.See https://arrow.apache.org/docs/python/parquet.html