I have two files A – nodes_to_delete and B – nodes_to_keep . Each file

Question

0

Editorial Team

Asked: May 14, 20262026-05-14T02:25:28+00:00 2026-05-14T02:25:28+00:00

I have two files A – nodes_to_delete and B – nodes_to_keep . Each file

0

I have two files A–nodes_to_delete and B–nodes_to_keep. Each file has a many lines with numeric ids.

I want to have the list of numeric ids that are in nodes_to_delete but NOT in nodes_to_keep, i.e. A\B

Doing it within a PostgreSQL database is unreasonably slow. Any neat way to do it in bash using Linux CLI tools?

UPDATE: This would seem to be a Pythonic job, but the files are really, really large. I have solved some similar problems using uniq, sort and some set theory techniques. This was about two or three orders of magnitude faster than the database equivalents.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T02:25:28+00:00

Editorial Team

2026-05-14T02:25:28+00:00Added an answer on May 14, 2026 at 2:25 am

The comm command does exactly that.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have two files A – nodes_to_delete and B – nodes_to_keep . Each file

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply