Multi- To Mono-repository
Merge multiple repositories into one big monorepository. Migrates every branch in every subrepo to the eponymous branch in the monorepo, with all files (including in the history) rewritten to live under a subdirectory.
Features:
- Preserve full history and commit hashes of all repositories.
- Don't Stop The World: keep working in your other repositories during the migration and pull the changes into the monorepo as you go.
- No conflicts: Each original repository keeps their directory structure, no merging required. All files are moved into a subdirectory.
Requirements:
- git version 2.9+.
Usage
Prepare a list of repositories to merge in a file. The format is
<repository_url><space><new_name>
. If you try and use a slash in
<new_name>
it will fail because it uses this as a git remote
. If
you need to have a slash, i.e. some folder depth, pass a third
parameter, the format will then be:
<repository_url><space><new_name><space><folder_name>
Here is an example repos.txt
where the services are directly in the
root of the repository and the libraries are in a /lib
subfolder:
git@github.com:mycompany/service-one.git one
git@github.com:mycompany/service-two.git two
git@github.com:mycompany/library-three.git three lib/three
git@github.com:mycompany/library-four.git four lib/four
Now pipe the file to the tomono.sh script. Assuming you've downloaded this program to your home directory, for example, you can do:
$ cat repos.txt | ~/tomono/tomono.sh
This will create a new repository called core
, in your current directory.
If you already have a repository called core
and wish to import more into it,
pass the --continue
flag. Make sure you don't have any outstanding changes!
To change the name of the monorepo directory, set an envvar before any other operations:
$ export MONOREPO_NAME=my_directory
$ ...
Tags and namespacing
Note that all tags are namespaced by default: e.g. if your remote foo
has tags
v1
and v2
, your new monorepo will have tags foo/v1
and foo/v2
. If you'd
rather not have this, and just risk the odd tag clash (not a big deal: worst
case one tag overrides the other), you can do the following after running the
full script:
$ ....tomono.sh # after this
$ cd core
$ rm -rf .git/refs/tags
$ git fetch --all
That will re-fetch all tags for you, verbatim.
Fluid migration: Don't Stop The World
New changes to the old repositories can be imported into the monorepo and
merged in. For example, in the above example, say repository one
had a branch
my_branch
which continued to be developed after the migration. To pull those
changes in:
# Fetch all changes to the old repositories
$ git fetch --all --no-tags
$ git checkout my_branch
$ git merge --strategy recursive --strategy-option subtree=one/ one/my_branch
This is a regular merge like you are used to (recursive is the default). The
only special thing about it is the --strategy-option subtree=one/
: this tells
git that the files have all been moved to a subdirectory called one
.
N.B.: new tags won't be merged, because they would not be namespaced if fetched
this way. If you don't mind having all your tags together in the same scope,
follow the "no namespaced tags" instructions from above, and remove the
--no-tags
bit, here.
Github branch protection
If:
- the changes have been made to master in the old repo, and
- your mono repo is stored on Github, and
- you have branch protection set up for master,
you could create a PR from the changes instead of directly merging into master:
$ git fetch --all --no-tags
# Checkout to master first to make sure we're basing this off the latest master
$ git checkout master
# Now the new "some_branch" will be where our current master is
$ git checkout -b new_one_master
$ git merge --strategy recursive --strategy-option subtree=one/ one/master
$ git push -b origin new_one_master
# Go to Github and create a PR from branch 'new_one_master'
Explanation
The contents of each repository will be moved to a subdirectory. A new branch will be created for each branch in each of those repositories, and branches of equal name will be merged.
In the example above, if both repositories one
and two
had a branch called
feature-XXX
, your new repository (core) would have one branch called
feature-XXX
with two directories in it: one/
and two/
.
Usually, every repository will have at least a branch called master
, so your
new monorepo will have a branch called master
with a subdirectory for each
original repository's master branch.
A detailed explanation of this program can be found in the accompanying blog post:
https://syslog.ravelin.com/multi-to-mono-repository-c81d004df3ce
Further steps
Once your new repository is created, you'll need to update your CI environment. This means merging all .travis.yml, .circle.yml and similar files into a single file in the top level. The same holds for the Makefile, which can branch off into the separate subdirectories to do independent work there.
Additionally, you will need to make a decision about vendoring, if applicable:
do you want to use one vendoring dir for all your code (e.g. a top-level
vendor
for Go, or node_modules
for node), or do you want to keep independent
vendoring directories for each project? Both solutions have their respective
pros and cons, which is best depends on your situation.