Push and fetch commits to IPFS with a published version of the contents.
go get -v github.com/dhappy/git-remote-ipfs/cmd/git-remote-ipfs/...
cp --no-dereference "$GOPATH/src/github.com/dhappy/git-remote-ipfs/cmd/git-remote-ipfs/git-remote-ipns" "$GOBIN"
#2 assumes the environment variables $GOPATH
and $GOBIN
are defined and $GOBIN
is in your $PATH
.
Push:
git push ipfs:: master
Clone an example repository:
git clone ipns://git-remote-ipfs.dhappy.org git-remote-ipfs
Pull a commit:
git pull ipfs://Qma5iwyvJqxzHqCT9aqyc7dxZXXGoDeSUyPYFqkCWGJw92
Push with the .git/vfs/
directory:
GIT_IPFS_VFS=t git push ipfs::
Push to an IPNS remote:
ipfs key gen --type=rsa --size=2048 mysite
git remote add ipns ipns::key:mysite
git push ipns
git pull
/
: the contents of the branch that was pushed.git/blobs/
,.git/trees/
,.git/commits/
,.git/tags/
: various Git objects stored by their SHA1 hash as filename.git/refs/heads/*
: files containing the root hash of various Git branches.git/HEAD
: the name of the branch contained in this repo
The virtual filesystem makes all the trees associated with all the commits available, but takes about twice as long to generate:
.git/vfs/messages/
: all the trees linked by commit message and sorted by date.git/vfs/authors/#{name}/
: commits sorted by author.git/vfs/rev/messages/
,.git/vfs/rev/authors/#{name}/
: the commits as before, but prefaced with a count to reverse the order.git/vfs/commits/
: vfs commits named by commit SHA1.git/vfs/trees/
: content trees named by tree SHA1.git/vfs/HEAD
: root vfs commit
Git is at its core an object database. There are four types of objects and they are stored by the SHA1 hash of the serialized form in .git/objects
.
When a remote helper is asked to push, it receives the key of a Commit object. That commit has an associated Tree and zero or more Commit parents.
Trees are lists of umasks, names, and keys for either nested Trees or Blobs.
Tags are named links to specific commits. They are frequently used to mark versions.
The helper traverses the tree and parents of the root Commit and stores them all on the remote.
Integrating Git and IPFS has been on ongoing work with several solutions over the years. The predecessor to this one used ipfs block put
to store the Commit tree using SHA1. Fetching converts the SHA1 keys in the raw git blocks to IPFS content ids and retrieves them directly.
The SHA1 keys used by Git aren't exactly the hash of the object. Each serialized form is prefaced with a header of the format: "#{type} #{size}\x00
". So a Blob in Git is this header plus the file contents.
Because the IPLD remote stores the raw Git blocks, the file data is fully present, but unreadable because of the header.
There were also technical issues because the contents of a block put
aren't sharded and there are reliability problems with large blocks.
This project mitigates that issue by leaving off the header and storing the simple serialized form in a named directory. The filename is the SHA1 key for the file contents.
When fetching, a map is created between the SHA1 keys and their CID along with the type. A traditional headered block can then be generated.
-
fetch: manifest has unsupported version: x (we support y)
on any command- This usually means that cache tracker data format has changed
- Remove the cache with:
rm -rf .git/remote-ipfs
-
panic: runtime error: invalid memory address or nil pointer dereference
- This dramatic message likely means the IPFS server isn't running.
MIT