I have a data structure with binary 0-1 variable (click & Purchase; click & not-purchase) against a vector of the attributes. I used logistic regression to get the probabilities of the purchase. How can I use Random Forest to get the same probabilities? Is it by using Random Forest regression? or is it Random Forest classification with type='prob' in R which gives the probability of categorical variable?
can we get probabilities the same way that we get them in logistic regression through random forest?
166 Views Asked by DataScience2020 At
1
There are 1 best solutions below
Related Questions in R
- How to make an R Shiny app with big data?
- How do I keep only specific rows based on whether a column has a specific value?
- Likert scale study - ordinal regression model
- Extract a table/matrix from R into Excel with same colors and stle
- How can I solve non-conformable arguments in R netmeta::discomb (Error in B.matrix %*% C.matrix)?
- Can raw means and estimated marginal means be the same ? And when?
- Understanding accumulate function when .dir is set to "backwards"
- Error in if (nrow(peaks) > 0) { : argument is of length zero Calls: CopywriteR ... tryCatch -> tryCatchList -> tryCatchOne -> <Anonymous> Execution ha
- How to increase quality of mathjax output?
- Convert the time intervals to equal hours and fill in the value column
- How to run an R function getpoints() from IPDfromKM package in an R shiny app which in R pops up a plot that utilizes clicks to capture coordinates?
- Replace NA in list of dfs in certain columns and under certain conditions
- R and text on Cyrillic
- The ts() function in R is returning the correct start and frequency but not end value which is 1 and not 179
- TROUBLING with the "DROP_NA" Function
Related Questions in RANDOM-FOREST
- Multioutput regression using GPU
- Calculate RMSE for RF regression hyperparameter tuning in GEE encountering issue with error "(...)List<FeatureCollection>."
- Unsupervised random forest with large dataset
- Issue with proj4: Error: [project] 'to' cannot be missing
- Apache Spark RandomForestClassifier Predict label for single user input
- Feature Selection with Random Forest and R Package 'Ranger' / interpretation of function 'variable.importance'
- Object not found when building a random forest regression
- Modelling for species or community interactions at timepoints
- roc_auc_score differs between RandomForestClassifier GridSearchCV and explicitly coded RandomForestCLassifier
- SKLearn algorithms than handle native NaN values
- Can CNN and RF be trained together
- Partial dependence plot - model developed using scaled data, how to unscale for PDP?
- Trained Random forest model from python to matlab
- evaluation metrics of MSE,MAE and RMSE
- predict_proba() giving probabilities as 0s and 1s but few intermediate values
Related Questions in LOGISTIC-REGRESSION
- Error in eval(predvars, data, env) : object 'Juice_practice' not found when running binary logistic regression in r
- Limited Number of Beta Coefficients for 1000 predictors using glm() in R
- Non-numeric argument to binary operator, despite values being numeric
- fasttext embeddings in order to do logistic regression
- Generalized mixed effect logistic regression model and strange p values (maybe separation of data)?
- Predicting if a customer will do something at a specific time based on previous history
- How do I convert a character string to hms in R? (including hms above 24 hours)
- How to fix the problem "operands could not be broadcast together with shapes" on get_margeff in statsmodels with dummy=True?
- logistic regression implementation - loss is not converging and poor model results
- Printf changes the outcome the program
- Logistic Lasso on large gene dataset specifically through the Knockoff package in R
- Regressions performed by hand
- Logistic Regression - AttributeError: 'tuple' object has no attribute 'fit'
- ImportError: cannot import name '_gb_losses' from 'sklearn.ensemble' ; Error deploying ML python webapp to streamlit community cloud
- why is e used so much in the NN?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
It won't give you the same result since the structure of the two method are different. Logistic regression is given by a definitive linear specification, where RF is a collective vote from multiple independent/random trees. If specification and input feature are properly tuned for both, they can produce comparable results. Here is the major difference between the two:
RF will give more robust fit against noise, outliers, overfitting or multicollinearity etc which are common pitfalls in regression type of solution. Basically if you don't know or don't want to know much about whats going in with the input data, RF is a good start.
logistic regression will be good if you know expertly about the data and how to properly specify the equation. Or somehow want to engineer how the fit/prediction works. The explicit form of GLM specification will allow you to do that.