I have found the outliers in my data using the box plot method.
enter image description here Box plot Before applying IQR Method
file1.shape
# (457, 11)
I have applied the IQR method to the data.
q1, q2, q3 = file1['Salary'].quantile([0.25, 0.5, 0.75])
IQR = q3 - q1
f_data = file1[(file1['Salary'] > lower_bound) & (file1['Salary'] < upper_bound)]
And I removed a few data points.
f_data.shape
# (420, 11)
However, after reviewing the filtered data using a box plot, I still found a few outliers in my data.
enter image description here Box plot after applying the IQR method.
What should i do now.
Do i have to perform the IQR method again on the filtered data.
The Salary data is right skewed data . It's skew value is around 1.5
Or should I decrease the skew value. Like using log, power methods.
I think you are wrongly using IQR,
then
should work.