I am new to python and I am working on a requirement to list all unique value in a categorical column along with frequency of each value and the % frequency of each value in a column and using a for loop to perform it on the complete dataset. Also I am not sure if I have to use pd.Series to append data into a dataframe as per the screenshot attached because the length of the columns are different based on the unique values in a column.
Appreciate your help.
The below is the code I tried to work out but I am not able to workout on the other columns for unique value and % of frequency and create it as a data frame so that I can export it to CSV
Count_df = []
for item in df.columns:
Count_df_ = pd.DataFrame(df1[item].value_counts())
Count_df.append(Count_df_)
Count_dfdf = pd.DataFrame(Count_df)
Count_dfdf
Count_dfdf.to_csv(path_or_buf = Output + '_' + 'Count_.csv')
The input and Output expected is as below and the same is attached as an :
[Input data and expected Output][1]
Thanks in advance
No magic. Just append the output DataFrame column-by-column patiently.
Here I assume a 4-columned output in a single
.csv
file. Based on personal work experience, this format is more handy than separate files for spreadsheet softwares. However, separated output is also possible within in the loop.Code:
Output