I want to construct a momentum factor which is part of Fama and French Regression. We have a monthly data table starting from 1980-12-31 to 2021-12-31, and a time series of stock returns, given as a percentage monthly return. [The Image is showing a simple version of our data table, (where RI - Return Index)]
The definition of the momentum factor is the cumulated total stock return (calculated by the total return index, item RI) from month t-12 to month t-2, where t is the month of the forecasted return. Momentum factor is updated monthly.
I know the code of the cumulated total stock return to be:
DS.monthly[, cum_ret := cumprod(RET.USD + 1), by = c("Id", "Date")]
but this would not be the correct way to compute momentum, because of this statement "from month t-12 to month t-2 "
We came up with this new columns for our data set:
DS.monthly %>% setorder(Id, -Date)
DS.monthly[, t_1:=lead(RET.USD/100), by=Id]
DS.monthly[, t_2:=lead(t_1), by=Id]
DS.monthly[, t_3:=lead(t_2), by=Id]
DS.monthly[, t_4:=lead(t_3), by=Id]
DS.monthly[, t_5:=lead(t_4), by=Id]
DS.monthly[, t_6:=lead(t_5), by=Id]
DS.monthly[, t_7:=lead(t_6), by=Id]
DS.monthly[, t_8:=lead(t_7), by=Id]
DS.monthly[, t_9:=lead(t_8), by=Id]
DS.monthly[, t_10:=lead(t_9), by=Id]
DS.monthly[, t_11:=lead(t_10), by=Id]
DS.monthly[, t_12:=lead(t_11), by=Id]
DS.monthly[, cumulative:= ((t_2+1)(t_3+1)(t_4+1)(t_5+1)(t_6+1)(t_7+1)(t_8+1)(t_9+1)(t_10+1)(t_11+1)(t_12+1))]
where we basically calculate the lagged return for each month and compute the cum_return manually, but it is a very inefficient way.
We had already one code suggestion, but the explanation before from my part was not good so I am not sure if it's the correct way. The suggested code is as below:
as.data.table(DS.monthly |>
arrange(Id, Date) |>
group_by(Id) |>
mutate(Momentum = slide_dbl(lag(RET.USD), ~ last(cumprod(1 + .x)), .before = 10, .complete = TRUE)))
Can someone please suggest a solution?
I believe you can leverage
shift()
andReduce()
here..data.table::shift()
can take a vector of offsets, so that you can get all the offsets from -2 to -12 all at once..Output:
Input: