Is it OK(legitimate approach) to use Generalized Linear Model with Gaussian family distribution to predict discrete quantities by for example rounding the output of Gaussian GLM to the nearest integer?
Using Gaussian family distribution to predict discrete quantities in GLM
797 Views Asked by user1533224 At
1
There are 1 best solutions below
Related Questions in STATISTICS
- How do I code a Mixed effects model for abalone growth in Aquaculture nutrition with nested individuals
- Normally Distributed Z score
- R sensitivity package (fast99)
- How to write conditional probability in Python?
- Skewness in SPSS - SPSS answers differently in multiple PC
- Memory-efficient Benjamini-Hochberg FDR correction using numpy/h5py
- Google Analytics Cohort data doesn't make sense
- Detect patterns in data set
- Elasticsearch date histogram aggregation includes more than one bucket per month
- Calculate variance in bash
- How to compute relative difference in a circular domain (weekday) in R
- Draw geom_smooth only for fits that are significant
- How to apply Chisq.test on levels of different categorical variables?
- awk - count number of occurences for a field in a line containing another specific field
- Can't use scipy stats function on nested list
Related Questions in MACHINE-LEARNING
- How to cluster a set of strings?
- Enforcing that inputs sum to 1 and are contained in the unit interval in scikit-learn
- scikit-learn preperation
- Spark MLLib How to ignore features when training a classifier
- Increasing the efficiency of equipment using Amazon Machine Learning
- How to interpret scikit's learn confusion matrix and classification report?
- Amazon Machine Learning for sentiment analysis
- What Machine Learning algorithm would be appropriate?
- LDA generated topics
- Spectral clustering with Similarity matrix constructed by jaccard coefficient
- Speeding up Viterbi execution
- Memory Error with Classifier fit and partial_fit
- How to find algo type(regression,classification) in Caret in R for all algos at once?
- Difference between weka tool's correlation coefficient and scikit learn's coefficient of determination score
- What are the approaches to the Big-Data problems?
Related Questions in GAUSSIAN
- Apply gaussian filter on text
- opencv gaussian edge color
- Finding the sigma of a Gaussian array without using a fit
- Calculate the gaussian kernel density in python
- Sample a random number following a distribution between two values
- Calculate how a value differs from the average of values using the Gaussian Kernel Density (Python)
- Python 2D Gaussian Fit with NaN Values in Data
- Matlab - FFT of Gaussian - Equivalency
- NaiveBayes classifier handling different data types in python
- Int64 in some Gaussian Randomness Code
- SciKit Gaussian Mixture Model ValueError: x and y must have same first dimension
- GMM - loglikelihood isn't monotonic
- Gaussian elimination for lower triangular matrix
- Issue with Box Muller Transform when implementing a Normal Distribution PRNG
- ImportError: No module named hmm
Related Questions in GLM
- Is it possible to update the formula of a model object while maintaining simplified notation?
- How can I fit a GLM using R's glm() evaluated at specific values (e.g. median)?
- predict with glmer where new data is a Raster Stack of fixed efefcts
- Using predict for more complicated predictions
- Variance with vlm probit
- Pseudo R² for a Poisson GLM with offset
- R bigglm() Error: cannot allocate vector of size
- R glm.predict cannot be found
- Factor/level error in mixed model
- How to extract aic from glm?
- Tukey (bisquare) Psi on GLM Poisson robust model
- GLM returning negative value for threshold (cut-off)(in R)
- How to load data only once for multiple glm calls with varying formulas?
- how to use glmrob with MT methods?
- GLM function for Logistic Regression: what is the default predicted outcome?
Related Questions in DISCRETE-SPACE
- Bezier curves in discrete domain
- How to do space discretization in Gekko?
- How to do 1D discrete collision detection as efficiently as possible?
- 100 Point discrete grid in R
- Creating discrete-time model
- How to create queue of events in order by date
- Trouble with visualizer.pl
- Outlier test in discrete survival analysis
- How to make Conditional Probability Tables (CPTs) for Bayesian networks with pymc
- Coordinate geometry operations in images/discrete space
- Sample from a discrete random distribution in Python
- Determining probability mass function of random variable
- Matlab's slice() function not working as desired
- Using Gaussian family distribution to predict discrete quantities in GLM
- discrete event simulators for C++
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
You can do this but it may not be the best thing to do. It really depends on the nature of the data that you are trying to model. It may well be that poission regression is better suited to your needs.
http://en.wikipedia.org/wiki/Poisson_regression
However, there is nothing whatsoever to stop you from actually fitting a linear model to integer valued data but you may have problems when making inference about your data using the model. If you are simply trying to provide a model from which to predict future observations, it may well work nicely even if not theoretically valid.
Clearly given the nature of the model, you may end up predicting utterly ridiculous results - for example, your reponse variable may only make sense over a limited range (say positive integers) but your model could allow prediction of arbitrarily large values (positive AND negative). Model checking steps like residual checking (normality and correlation) may not give the type of results you would normally see when modelling continuous normally distributed responses.
Overall, I would say that depending on your data, your approach COULD generate a useful predictive model but in general you should proceed with caution.
Read this question and some of the answers to it - it discusses similar themes https://stats.stackexchange.com/questions/3024/why-is-poisson-regression-used-for-count-data
To reach a wider audience you might consider posting this question at http://stats.stackexchange.com