I tried to calculate the confusion matrix after I conduct the decision tree model
# tree model
tree <- rpart(LoanStatus_B ~.,data=train, method='class')
# confusion matrix
pdata <- predict(tree, newdata = test, type = "class")
confusionMatrix(data = pdata, reference = test$LoanStatus_B, positive = "1")
How can I set the threshold to my confusion matrix, say maybe I want probability above 0.2 as default, which is the binary outcome.
Several things to note here. Firstly, make sure you're getting class probabilities when you do your predictions. With prediction type
="class"
you were just getting discrete classes, so what you wanted would've been impossible. So you'll want to make it"p"
like mine below.Next note that .5 here is just an arbitrary value -- you can change it to whatever you want.
I don't see a reason to use the
confusionMatrix
function, when a confusion matrix can be created simply this way and allows you to acheive your goal of easily changing the cutoff.Having said that, if you do want to use the
confusionMatrix
function for your confusion matrix, then just create a discrete class prediction first based on your custom cutoff like this:Where, again, .5 is your custom chosen cutoff and can be anything you want it to be.