I try to use ROSE library on R to rebalancing target variable in my dataset. Here is my information of my dataset.
- My original dataset have total 132056 records.
- There are total 279 cases (0.21%) of minor class in target variable.
- There are total 131777 cases (99.79%) of major class in target variable.
I would like to undersampling the dataset to make the percentage of minor class increase to 5%.
Here is my code :
df_Under <- ovun.sample(Target ~ ., data = df, method = "under", N =5580, seed = 1)
However, after run the code above, I got the following error message.
"Error in (function (formula, data, method, subset, na.action, N, p = 0.5, :Too few observations."
I tried play with other method of ROSE such as "over" and "both" but there are the same error occurs.
How can I fixed this problem ?
Kind regards,
I was facing the same problem. The problem was actually in the dataset which had columns (variables) with NA/Nan.
Please try running the code after NA removal.
Let me know if this helps.