my assignment is to store the following into an array type column:
def sample_udf(df:SparkDataFrame):
device_issues = []
if (df['altitude'] == 0):
return "alt"
elif (df['latitude'] <= -90
or df['latitude'] >=90):
return "gps_lat"
elif (df['longitude'] <= -180
or df['longitude'] >= 180):
return "gps_long"
elif (df['direction'] < 0
or df['direction'] > 359):
return "gps_direction"
else:
return device_issues
df_new = df.withColumn("deviceIssues", sample_udf(f.col("altitude"), f.col("latitude")))
when I run that cmd, I got this error:
TypeError: anomaly_detections() takes 1 positional argument but 2 were given
any help will be appreciated
I'm expecting that the column "deviceIssues" will be in arraytype column.
Try:
Output: