There are many reasons that a file may not be…

Question

0

Asked: May 11, 20262026-05-11T12:55:28+00:00 2026-05-11T12:55:28+00:00

Here’s the scenario: I have a local git repository that mirrors the contents of

0

Here’s the scenario: I have a local git repository that mirrors the contents of another source control system (a proprietary one). I’ve written a script that periodically syncs my git branch with that system’s latest copy of the same branch (called by another term in the other system but conceptually similar).

Now, suppose that in the other system, someone creates a branch from the branch I’m currently syncing and starts hacking on it. What I’d like to do is pull down the first version of that other branch, then find the commit in my git version of the main branch that is closest to the new branch. If I can do this, I’ll know which commit from the main branch to make as the parent of this new branch.

This sounds to me like a problem of computing ‘tree distances’. But as SHA1 hashes don’t have a distance metric, is there another way to do this besides the obvious manual deep search on each commit to find out which one has the most number of similar blobs?

UPDATE: See below, found a domain-specific way to do it.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

score 0 · Answer 1 · 2026-05-11T12:55:29+00:00

It’s worse than that; in the general case you’ll have to count edit distance on the blobs to see how similar they are.

Hoping this is a rare event, I would clone the git repository and start rolling back versions to locate the commit that is closest to the tree you wish to duplicate. It would be nice to think of using git bisect for this, but since there’s no total ordering and no absolute concept of good or bad, I don’t see how to avoid trying every commit.

Mininum edit distance is NP-hard as well, so you have a real pain in the ass here.

If you are lucky, in the other system, you can recover the date and time the new branch is created. Then maybe you can just grab the last commit before that timestamp?

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions