I am trying to use ddply to my sample data (call Z) which look

Question

0

Asked: June 14, 20262026-06-14T05:03:24+00:00 2026-06-14T05:03:24+00:00

I am trying to use ddply to my sample data (call Z) which look

0

I am trying to use ddply to my sample data (call Z) which look like as below:

My purpose is the find the sum of the y for the id starting with 1 (i.e.1001,1200,..), 2(2100), 3(3100,3190), 4,…10,11,…65. For example, for id starting with 1 , the sum is 10+11+12=33, for id starting with 2, it is 32.

I wanted to use the apply function which looks like as follows:

>s <- split(z,z$id)
>lapply(s, function(x) colSums(x[, c("y")]))

However, this gives me the sum by each of the unique id, not the one as I was looking for. Any suggestion in this regard would be highly appreciated.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-14T05:03:25+00:00

thelatemail provides a valid approach but I want to point out the problem isn’t really with your understanding of lapply (your code was almost correct) but with thinking about grouping. thelatemail does this in his solution and that’s the key. I’m going to show you with your approach and then how I would actually approach this and then using ave just because I never get to use it 🙂

Read in data

z <- read.table(textConnection("id y #stole this from the latemail
1001 10
1001 11
1200 12
2001 10
2030 12
2100 32
3100 10
3190 13
4100 45
5100 67
5670 56
10001 54
10345 45"),header=TRUE)

Your code adjusted

s <- split(z, substring(as.character(z$id), 1, nchar(as.character(z$id)) - 3))
lapply(s, function(x) sum(x[, "y"]))

Approach I would likely take; add a new factor id variable

z$IDgroup <- substring(as.character(z$id), 1, nchar(as.character(z$id)) - 3)
aggregate(y ~ IDgroup, z, sum)
#similar approach but adds the solution back as a new column
z$group.sum <- ave(z$y, z$IDgroup, FUN=sum)
z

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am trying to use ddply to my sample data (call Z) which look

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply