I have three data frames, the first (with column headers, but no row numbering)

Question

0

Asked: June 1, 20262026-06-01T22:04:12+00:00 2026-06-01T22:04:12+00:00

I have three data frames, the first (with column headers, but no row numbering)

0

I have three data frames, the first (with column headers, but no row numbering) looks like

ID    1   2   3
 A   12  NA  NA
 B   NA   7  NA
 C   NA  NA  22

The second may look like

ID    1   2   3
 A   NA   6  NA
 B   NA  NA  29
 C   43  NA  NA

Lastly, the third looks like

ID    1   2   3
 A   NA  NA  32
 B    5  NA  NA
 C   NA   2  NA

The first column is an ID column and the same for all three data frames. The final three columns represent the same variables (1, 2, and 3). The record for observation A, variable 1 is only in one of the data sets. So is the record for observation A, variable 2, but it’s in a different data set.

How can I merge these data sets together to get something like

ID    1   2   3
 A   12   6  32
 B    5   7  29
 C   43   2  22

I apologize that I didn’t have a better way of describing this problem. If someone could share the terminology for it, that would be great.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T22:04:13+00:00

Nice title! This is quite similar to R – Vector/ Array Addition

You can turn your data into a multi-dimensional array then sum or take the mean across the “puzzle piece” dimension:

df1 <- read.table(text="ID    1   2   3
A   12  NA  NA
B   NA   7  NA
C   NA  NA  22", header = TRUE)

df2 <- read.table(text="ID    1   2   3
A   NA   6  NA
B   NA  NA  29
C   43  NA  NA", header = TRUE)

df3 <- read.table(text="ID    1   2   3
A   NA  NA  32
B    5  NA  NA
C   NA   2  NA", header = TRUE)

# gather inputs and remove common ID column
lists  <- list(df1, df2, df3)
pieces <- lapply(lists, '[', , -1)

# turn data into a multi-dimensional array
a <- array(unlist(pieces), dim = c(nrow(df1),
                                   ncol(df1) - 1,
                                   length(pieces)))

# compute sums across pieces
rowSums(a, na.rm = TRUE, dims = 2)
# [,1] [,2] [,3]
# [1,]   12    6   32
# [2,]    5    7   29
# [3,]   43    2   22

Then you’re only left with pasting the ID column back.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have three data frames, the first (with column headers, but no row numbering)

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply