I am confused about how statsmodels ARIMA computes fitted values. Consider a simple AR(1) process fitted to a randomly generated series
series = array([ 1.76405235, 0.40015721, 0.97873798, 2.2408932 , 1.86755799,
-0.97727788, 0.95008842, -0.15135721, -0.10321885, 0.4105985 ])
we can fit the model:
model = sm.tsa.ARIMA(series, order = (1,0,0)).fit()
get estimates of the parameters:
parameters = model.params
and also get fitted values:
fitted_values = model.fittedvalues
How are these fitted_values calculated (from the initial data and the parameters estimated by the model.)?
-- I tried model.params[0] + model.params[1]*series[i-1]
, but that didn't work.
-- I am also not sure why model.fittedvalues[0]
is the same as model.params[0]
.
I also find the autoregressive models on statsmodels confusing.
The question here is, what are these three parameters:
One way to find out is to print the summary of the model-fitting result:
In the above, you can see that the three parameters (in order) are 'const', 'ar.L1', and 'sigma2'
Another way, is to give your observation data as a Pandas series:
Then, the parameters are provided as a series with a helpful index:
To figure out what these actually are you have to go into the documentation where you can find this model definition:
While there is no explicit reference to the names 'const', 'ar.L1', and 'sigma2', that I can find, my best guess is:
As to how these are calculated I assume it is using Ordinary Least Squares (OLS). For more details refer to the documentation on the
fit
method