I have a data.frame that has the the following vectors: NAME, JUMP.NUMBER, POWER. These variables are obtained by having performed repeated jumps from one to 20 from which I obtain power.
I want to determine the best rolling means for POWER from 1 to 20 jumps by each subject and then create a new data.frame that contains these values.
While it takes me a long time, I can subset my data by 'NAME', calculate rolling means using the rollmean()
function in zoo, find the maximum values from each of these new data.frames, and then create a new data.set with these values. However, this is incredibly slow.
My code looks like this:
sample<-subset(JUMP.DATA, NAME=="Bob")
ROLLING1<-rollmean(sample,1)
ROLLING2<-rollmean(sample,2)
ROLLING3<-rollmean(sample,3)
ROLLING4<-rollmean(sample,4)
MAXROLLING4<- max(ROLLING4)
MAXROLLING1<- max(ROLLING1)
MAXROLLING2<- max(ROLLING2)
MAXROLLING3<- max(ROLLING3)
NUMBER=c(1, 2, 3, 4)
ROLLING.POWER=c(MAXROLLING1, MAXROLLING2, MAXROLLING3, MAXROLLING4)
BEST.ROLLING.MEAN <-cbind(NUMBER, ROLLING.POWER)
I'm sure there is a much more straightforward method to calculate a rolling mean ~ Group. Any help would be appreciated.
The original data.set would look like this:
NAME=c(Bob, Bob, Bob, Bob, John, John, John, John)
JUMP.NUMBER= c(1, 2, 3, 4, 1, 2, 3, 4)
POWER = c(3000, 2800, 2700, 2600, 3400, 3100, 2900, 2800)
JUMP.DATA= cbind(NAME, JUMP.NUMBER, POWER)
Here's a data.table solution.
Response to @Arun's comment
So here is a benchmark comparing the
runmean(...)
incaTools
torollmean(...)
inzoo
. The former is about 4 X faster. Note that the defaults are different though.