I ran this SQL query in databricks to check the distinct values of a column in a parquet file:
SELECT distinct country
FROM parquet_table
This took 1.31 hours to run. Am I doing something wrong here that such a simple query is taking so long?
I ran this SQL query in databricks to check the distinct values of a column in a parquet file:
SELECT distinct country
FROM parquet_table
This took 1.31 hours to run. Am I doing something wrong here that such a simple query is taking so long?
Copyright © 2021 Jogjafile Inc.