In order to make it easier to understand and explain the input format for the raw git crawler, we want to build a sample repository with some representative sample actions in the log, and a corresponding set of dump files.
There is an RFC for the format which showcases the simpler cases. For a fuller description, run the scripts in this repo to generate the sample repository and its dump file.
NOTE: In order to not over-engineer things, this script keeps things simple. Not all features of the above RFC are exposed in this example!
The Python script depends on some Python packages, which can be installed with pip
:
pip install -r requirements.txt
(The requirements file also installs some developer tooling, like black
and flake8
.)
Couldn't be simpler:
./create-sample-repo.sh
./create-dump.py