I have a data.frame with several columns (17). Column 2 have several rows with

Question

0

Asked: June 17, 20262026-06-17T05:34:22+00:00 2026-06-17T05:34:22+00:00

I have a data.frame with several columns (17). Column 2 have several rows with

0

I have a data.frame with several columns (17).
Column 2 have several rows with the same value, I want to keep only one of those rows, specifically the one that has the maximum value in column 17.

For example:

A    B
'a'  1
'a'  2
'a'  3
'b'  5
'b'  200

Would return
A    B
'a'  3
'b'  200

(plus the rest of the columns)

So far I’ve been using the unique function, but I think it randomly keeps one or keeps just the first one that appears.

** UPDATE **
The real data has 376000 rows. I’ve tried the data.table and ddply suggestions but they take forever. Any idea which is the most efficient?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-17T05:34:24+00:00

Editorial Team

2026-06-17T05:34:24+00:00Added an answer on June 17, 2026 at 5:34 am

A solution using package data.table:

set.seed(42)
dat <- data.frame(A=c('a','a','a','b','b'),B=c(1,2,3,5,200),C=rnorm(5))
library(data.table)

dat <- as.data.table(dat)
dat[,.SD[which.max(B)],by=A]

   A   B         C
1: a   3 0.3631284
2: b 200 0.4042683

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a data.frame with several columns (17). Column 2 have several rows with

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply