I am trying to add a new variable (residual) to my original data frame based on the residuals of a first-order autoregressive model using lm().
residuals(lm(Var1 ~lag(Var1), panel_data)
It is similar to this question R: Replacement has [x] rows, data has [y] - residuals from a linear model in new variable but with groups. I already tried the proposed code including a line with group_by. However it is producing wrong residuals for the first observation of every group. How can I adapt the following code?
library(dplyr)
library(broom)
panel_data %>%
group.by = group %>%
lm(Var1 ~ Var1, data = .) %>%
augment() %>%
select(.rownames, .std.resid) %>%
right_join(mutate(panel_data, row = as.character(row_number())),
by = c(".rownames" = "row"))
An example data set can be as follows:
# Number of groups
num_groups <- 20
# Number of months
num_months <- 100
panel_data <- data.table(
group = rep(1:num_groups, each = num_months), # Group IDs
time = rep(1:num_months, times = num_groups), # Time period
Var1 = rnorm(num_groups * num_months), # Variable 1
Var2 = rnorm(num_groups * num_months) # Variable 2
)
Please have a look at this: