I have a data structure with binary 0-1 variable (click & Purchase; click & not-purchase) against a vector of the attributes. I used logistic regression to get the probabilities of the purchase. How can I use Random Forest to get the same probabilities? Is it by using Random Forest regression? or is it Random Forest classification with type='prob' in R which gives the probability of categorical variable?
can we get probabilities the same way that we get them in logistic regression through random forest?
164 Views Asked by DataScience2020 At
1
There are 1 best solutions below
Related Questions in R
- in R, recovering strings that have been converted to factors with factor()
- How to reinstall pandoc after removing .cabal?
- How do I code a Mixed effects model for abalone growth in Aquaculture nutrition with nested individuals
- How to save t.test result in R to a txt file?
- how to call function from library in formula with R type provider
- geom_bar define border color with different fill colors
- Different outcome using model.matrix for a function in R
- Creating a combination data.table in R
- Force specific interactions in Package 'earth' in R
- Output from recursive function R
- Extract series of observations from dataframe for complete sets of data
- Retrieve path of supplementary data file of developed package
- r package development - own function not visible for opencpu
- Label a dataset according to bins of a histogram
- multiply each columns of a matrix by a vector
Related Questions in RANDOM-FOREST
- Is Gradient Boosting regression be more accurate (lower MSE) than the random forest?
- randomForest package in R mse calculation
- Big accuracy difference between cross-validation and testing with a test set in weka? is it normal?
- displaying variable in plot(varImp(randomForest_model))
- Encoding String to numbers so as to use it in scikit-learn
- Using the predict_proba() function of RandomForestClassifier in the safe and right way
- Why connection is terminating
- How to change the function a random forest uses to make decisions from individual trees?
- What is the equivalent to rpart.plot in Python? I want to visualize the results of my random forest
- How to predict probabilities on test dataset in R's caret package?
- R package for Weighted Random Forest? classwt option?
- is their any way to show random forest as nonlinear using suppose 100 attributes
- Is this the correct way of getting in-sample and out-of-sample predictions / performance in R's caret package?
- How to collapse a RandomForest into an equivalent decision tree?
- Random forests performed under expectation
Related Questions in LOGISTIC-REGRESSION
- Building a logistic trend surface in R
- Error when making a parallel, binary (logistic) regression for a Sparse matrix with glmnet
- 'Unexpected result from lpSolveAPI for primal test' error for Logistic Regression
- logistic regression with caret and glmnet in R
- Logistic regression on huge dataset
- Plot ROC curve of predictive model after internal validation with bootstrap method?
- How to adjust scaled scikit-learn Logicistic Regression coeffs to score a non-scaled dataset?
- Can scikit-learn's LogisticRegression() automatically normalize input data to z-scores?
- Extracting predictor names when one predictor present in regression R
- Find selected features by RandomizedLogisticRegression
- Instrumental variable in logistics regression in R (ivreg in AER)
- Logistic Regression in python using Logit() and fit()
- Pipeline giving different answer in sklearn python
- Python statsmodels logit wald test input
- Wouldn't setting the first derivative of Cost function J to 0 gives the exact Theta values that minimize the cost?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
It won't give you the same result since the structure of the two method are different. Logistic regression is given by a definitive linear specification, where RF is a collective vote from multiple independent/random trees. If specification and input feature are properly tuned for both, they can produce comparable results. Here is the major difference between the two:
RF will give more robust fit against noise, outliers, overfitting or multicollinearity etc which are common pitfalls in regression type of solution. Basically if you don't know or don't want to know much about whats going in with the input data, RF is a good start.
logistic regression will be good if you know expertly about the data and how to properly specify the equation. Or somehow want to engineer how the fit/prediction works. The explicit form of GLM specification will allow you to do that.