I found the following R codes to fit a Tweedie Compound Poisson Gamma distribution. I have to fit it to my 399 claim amounts. I have seen the following R codes ptweedie.series(q, power, mu, phi)
and dtweedie.series(y, power, mu, phi)
. However I fail to understand the codes fully and after importing my data into R, how to proceed? Thanks in advance.
R codes for Tweedie Compound Poisson Gamma
7.8k Views Asked by user3309969 At
1
There are 1 best solutions below
Related Questions in R
- Passing arguments to main in C using Eclipse
- kernel module does not print packet info
- error C2016 (C requires that a struct or union has at least one member) and structs typedefs
- Drawing with ncurses, sockets and fork
- How to catch delay-import dll errors (missing dll or symbol) in MinGW(-w64)?
- Configured TTL for A record(s) backing CNAME records
- Allocating memory for pointers inside structures in functions
- Finding articulation point of undirected graph by DFS
- C first fgets() is being skipped while the second runs
- C std library don't appear to be linked in object file
Related Questions in TWEEDIE
- Passing arguments to main in C using Eclipse
- kernel module does not print packet info
- error C2016 (C requires that a struct or union has at least one member) and structs typedefs
- Drawing with ncurses, sockets and fork
- How to catch delay-import dll errors (missing dll or symbol) in MinGW(-w64)?
- Configured TTL for A record(s) backing CNAME records
- Allocating memory for pointers inside structures in functions
- Finding articulation point of undirected graph by DFS
- C first fgets() is being skipped while the second runs
- C std library don't appear to be linked in object file
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
First a note: importing your dataset from the comments above yielded 398 claims, not 399. One of these was 4 orders of magnitude larger than the median claim. So I suspect a typo. In the analysis that follows I excluded that sample, leaving 397.
A quick look at the Wikipedia entry for Tweedie Distributions reveals that this is actually a family of exponential distributions distinguished by the
power
parameter (xi
in the R documentation). Power=1 yields the Poisson distribution, power=2 yields the Gamma distribution, power=3 yields the inverse Gaussian distribution, and so on. The Tweedie distributions are also defined for non-integer power. The parameter mu is the mean, and phi is a dispersion parameter, related to variance.So the basic question, as I understand it, is which combination of power, mu, and phi yield a distribution which best fits your claims data?
One way to assess whether a distribution fits a sample is the Q-Q plot. This plots quantiles of your sample vs. quantiles of the test distribution. If the sample is distributed as the test distribution, the Q-Q plot should be a straight line. In R code (and with
X
as your vector of samples):Both the Gamma and the Inverse Gaussian distributions explain your data up to claims of ~40,000. The Gamma distribution underestimates the frequency of larger claims, while the Inverse Gaussian distribution overestimates their frequency. So let's try power=2.5.
So your claims data seems to follow a tweedie distribution with power=2.5. The next step is to estimate mu and phi, given power=2.5. This is a non-linear optimization problem in 2 dimensions, so we use package
nloptr
. It turns out that convergence depends on having starting parameters relatively close the the optimal values, so there is a fair amount of trial and error to getnlopt(...)
to converge.Finally, we confirm that the solution does indeed fit the data well.