I am working on a machine learning model(classification) where my dataset is imbalanced and i want to balance it by using oversample() function from 'imbalance' package in R.
Below are the codes used for oversampling where 'Final.Status' is my response variable and it's a factor data type.
training <- na.omit(training)
training.oversamp <- oversample(training,method = "SMOTE",classAttr = 'Final.Status')
But while doing it i am getting below error:
Error in dataset[, classAttr] == c :
comparison of these types is not implemented
In addition: Warning message:
In which(dataset[, classAttr] == c) :
Incompatible methods ("Ops.data.frame", "Ops.factor") for "=="
Also out of curiosity can anyone brief different methods used in oversample() function and which one is commonly used.
I had the exact same error and my solution was to transform the data set from
tibble
todata.frame
.In your case it could be as follows:
training.oversamp <- oversample(as.data.frame(training),method = "SMOTE",classAttr = 'Final.Status')