I can’t seem to grok the different solutions I’ve found and studied for tracking external code. Let alone understand how to apply them to my use case…
Would you guys be so kind to shed some light on this and help me with my specific use case? What would be the best solution for the following, concrete problem? (I’m not gonna attempt to generalize my problem, since I might make wrong assumptions about stuff, especially since I’m so new with all this…)
I’m building a website in Django (a web framework in Python). Now, there are a lot of 3rd party plugins available for use with Django (Django calls them ‘apps’), that you can drop in your project. Some of these apps might require a bit of modification to get working like I want them. But if you start making modifications to 3rd party code you introduce the problem of updating that code when newer versions appear AND at the same time keeping your local modifications.
So, the way I would do that in Subversion is by using vendor branches. My repository layout would look like this:
/trunk ... /apps /blog-app ... /tags ... /branches ... /vendor /django-apps /blog-app /1.2 /1.3 /current /other-app /3.2 /current
In this case /trunk/apps/blog-app would have been svn copy’d of one of the tags in /vendor/django-apps/blog-app. Say that it was v1.2. And that I now want to upgrade my version in trunk to v1.3. As you can see, I have already updated /vendor/django-apps/blog-app/current (using svn_load_dirs) and ‘tagged’ (svn copy) it as /vendor/django-apps/blog-app/1.3. Now I can update /trunk/apps/blog-app by svn merge’ing the changes between /vendor/django-apps/blog-app/1.2 and /vendor/django-apps/blog-app/1.3 on /trunk/apps/blog-app. This will keep my local changes. (for people unknown with this process, it is described in the Subversion handbook: http://svnbook.red-bean.com/en/1.5/svn.advanced.vendorbr.html)
Now I want to do this whole process in Git. How can I do this?
Let me re-iterate the requirements:
- I must be able to place the external code in an arbitrary position in the tree
- I must be able to modify the external code and keep (commit) these modifications in my Git repos
- I must be able to easily update the external code, should a new version be released, whilst keeping my changes
Extra (for bonus points 😉 ):
- Preferably I want to do this without something like svn_load_dirs. I think it should be possible to track the apps and their updates straight from their repository (most 3rd party Django apps are kept in Subversion). Giving me the added benefit of being able to view individual commit messages between releases. And fixing merge conflicts more easily since I can deal with a lot of small commits instead of the one artificial commit created by svn_load_dirs. I think one would do this with svn:externals in Subversion, but I have never worked with that before…
A solution where a combination of both methods could be used would be even more preferable, since there might be app developers who don’t use source control or don’t make their repos available publicly. (Meaning both svn_load_dirs-like behavior and tracking straight from a Subversion reposity (or another Git))
I think I would either have to use subtrees, submodules, rebase, branches, … or a combination of those, but smack down me if I know which one(s) or how do to it :S
I’m eagerly awaiting your responses! Please be as verbose as possible when replying, since I already had a hard time understanding other examples found online.
Thanks in advance
There are two separate problems here:
Problem 1 is pretty easy by itself. Just do something like:
You can then work on your forked repository normally. When you want to merge in upstream changes, run:
As for problem 2, one option is to use submodules. For this, cd into your main project, and run:
If I use git submodules, what do I need to know?
You may find git submodules to be a little bit tricky at times. Here are some things to keep in mind:
You can work around (4) to a certain extent by using an alias created by one of my coworkers:
…and then running:
If git submodules are so tricky, what are the advantages?
git submodules aren’t for me. What next?
If you don’t want to use git submodules, you might want to look into git merge’s subtree strategy. This keeps everything in one repository.
What if the upstream repository uses Subversion?
This is pretty easy if you know how to use git svn:
Then set up a local tracking branch in git.
Then, to merge from upstream, run:
(I haven’t tested this code, but it’s more-or-less how we maintain one submodule with an upstream SVN repository.)
Don’t use git svn rebase, because it will make it very difficult to use git submodule in the parent project without losing data. Just treat the Subversion branches as read-only mirrors of upstream, and merge from them explicitly.
If you need to access the upstream Subversion repository on another machine, try:
You should then be able to merge changes from upstream as before.