Databricks pyspark - specify schema nullability of dataframe returned by spark.sql()

66 Views Asked by gabagool At 27 June 2025 at 07:59

Is there a way to specify the schema of a pyspark DataFrame returned by a query within df = spark.sql(...)? Specifically, I am looking for a way to specify some columns must be nullable = false.

This answer shows you can change the schema by creating a new DataFrame using spark.createDataFrame(df.rdd, df.schema), but as a comment mentions, it is very costly.

Original Q&A

There are 1 best solutions below

user2704177 On 09 November 2023 at 06:15

df.schema["column name"].nullability = True

Databricks pyspark - specify schema nullability of dataframe returned by spark.sql()

There are 1 best solutions below

Related Questions in PYSPARK

Related Questions in DATABRICKS-SQL

Trending Questions

Popular # Hahtags

Popular Questions