I would like to match two files based on one column and combine the

Question

0

Asked: June 4, 20262026-06-04T02:17:32+00:00 2026-06-04T02:17:32+00:00

I would like to match two files based on one column and combine the

0

I would like to match two files based on one column and combine the matching lines. But one of the files (file1.txt) has the same entry more than once. As an example:

file1.txt

chr:123 a
chr:123 b
chr:456 a

file2.txt

chr:123 aa
chr:456 bb

I would like to extract the indexes based on the first column.

The final output should look like:

chr:123 a aa
chr:123 b aa
chr:456 a bb

I tried intersect on R but couldn’t figure out how to combine matching lines when file1.txt has the same entry more than once.
I am using two for loops but the files are very big and it takes lots of time.

Is there a quicker way to do this in perl or R?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-04T02:17:33+00:00

Editorial Team

2026-06-04T02:17:33+00:00Added an answer on June 4, 2026 at 2:17 am

Try this:

one <- data.frame(
id=c("chr:123","chr:123","chr:456"),
value=c("a","b","a")
)

two <- data.frame(
id=c("chr:123","chr:456"),
value=c("aa","bb")
)

merge(one,two,by="id",all.x=TRUE)

#result
       id value.x value.y
1 chr:123       a      aa
2 chr:123       b      aa
3 chr:456       a      bb

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I would like to match two files based on one column and combine the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply