Creating two columns of cumulative sum based on the categories of one column

Question

Creating two columns of cumulative sum based on the categories of one column

102 Views Asked by user15219127 At 16 February 2021 at 09:36

I like to create two columns with cumulative frequency of "A" and "B" in the assignment columns.

df = data.frame(id = 1:10, assignment= c("B","A","B","B","B","A","B","B","A","B"))

            id assignment
        1   1          B
        2   2          A
        3   3          B
        4   4          B
        5   5          B
        6   6          A
        7   7          B
        8   8          B
        9   9          A
        10 10          B

The resulting table would have this format

            id  assignment  A   B
        1   1   B           0   1
        2   2   A           1   1
        3   3   B           1   2
        4   4   B           1   3
        5   5   B           1   4
        6   6   A           2   4
        7   7   B           2   5
        8   8   B           2   6
        9   9   A           3   6
       10   10  B           3   7

How to generalize the codes for more than 2 categories (say for "A","B",C")? Thanks

Original Q&A

There are 3 best solutions below

**Ronak Shah** · Answer 1 · 2021-02-16T09:40:48.610000

Use lapply over unique values in assignment to create new columns.

vals <- sort(unique(df$assignment))
df[vals] <- lapply(vals, function(x) cumsum(df$assignment == x))
df

#   id assignment A B
#1   1          B 0 1
#2   2          A 1 1
#3   3          B 1 2
#4   4          B 1 3
#5   5          B 1 4
#6   6          A 2 4
#7   7          B 2 5
#8   8          B 2 6
#9   9          A 3 6
#10 10          B 3 7

**akrun** · Answer 2 · 2021-02-16T23:07:37.960000

akrun On 16 February 2021 at 23:07

We can use model.matrix with colCumsums

library(matrixStats)
cbind(df, colCumsums(model.matrix(~ assignment - 1, df[-1])))

**ThomasIsCoding** · Answer 3 · 2021-02-16T23:19:40.987000

A base R option

transform(
  df,
  A = cumsum(assignment == "A"),
  B = cumsum(assignment == "B")
)

gives

   id assignment A B
1   1          B 0 1
2   2          A 1 1
3   3          B 1 2
4   4          B 1 3
5   5          B 1 4
6   6          A 2 4
7   7          B 2 5
8   8          B 2 6
9   9          A 3 6
10 10          B 3 7

Creating two columns of cumulative sum based on the categories of one column

There are 3 best solutions below

Related Questions in R

Related Questions in CUMULATIVE-FREQUENCY

Trending Questions

Popular # Hahtags

Popular Questions