My original data has over 30, 000 rows and 100 column.This is an example: dataset
df <- data.frame(Outcomes= c(3, 4, 5, 6),
ADMDATE_3=c(10,7,9, 7),
ADMDATE_4=c(4,10,6,9),
ADMDATE_5=c(2, 11 ,8,8),
ADMDATE_6=c(4.5,7,9,12))
My expected results is that I want to use the value in the outcomes column, for example 4 to sum across ADMDATE column which has a value of 4 at the end and the subsequent columns (the remaining length of mydataset). Another example is if outcomes = 5 then I want the sum for ADMDate 5 to 6.
This my expected table
Outcomes ADMDATE_3 ADMDATE_4 ADMDATE_5 ADMDATE_6 sum_all
3 10 4 2 4.5 20.5
5 7 10 11 7.0 18.0
6 9 6 8 9.0 9.0
4 7 9 8 12.0 29.0
You can use
apply
andmatch
the column names andsum
:Or using
tidyverse
get the data in long format, keep all the values which are>=
Outcomes
andsum
for eachOutcomes
.