💬 Join the Automerge Slack community
Automerge is a library of data structures for building collaborative applications in JavaScript.
A common approach to building JavaScript apps involves keeping the state of your application in model objects, such as a JSON document. For example, imagine you are developing a task-tracking app in which each task is represented by a card. In vanilla JavaScript you might write the following:
var doc = { cards: [] }
// User adds a card
doc.cards.push({ title: 'Reticulate splines', done: false })
// User marks a task as done
doc.cards[0].done = true
Automerge is used in a similar way, but the big difference is that it supports automatic syncing and merging:
-
You can have a copy of the application state locally on several devices (which may belong to the same user, or to different users). Each user can independently update the application state on their local device, even while offline, and save the state to local disk.
(Similar to git, which allows you to edit files and commit changes offline.)
-
When a network connection is available, Automerge figures out which changes need to be synced from one device to another, and brings them into the same state.
(Similar to git, which lets you push your own changes, and pull changes from other developers, when you are online.)
-
If the state was changed concurrently on different devices, Automerge automatically merges the changes together cleanly, so that everybody ends up in the same state, and no changes are lost.
(Different from git: no merge conflicts to resolve!)
- Network-agnostic. Automerge is a pure data structure library that does not care about what kind of network you use: client/server, peer-to-peer, Bluetooth, USB drive in the mail, whatever, anything goes. Bindings to particular networking technologies are handled by separate libraries; see the section on Sending and receiving changes for examples.
- Immutable state. An Automerge object is an immutable snapshot of the application state at one point in time. Whenever you make a change, or merge in a change that came from the network, you get back a new state object reflecting that change. This fact makes Automerge compatible with the functional reactive programming style of React and Redux, for example.
- Automatic merging. Automerge is a Conflict-Free Replicated Data Type (CRDT), which allows concurrent changes on different devices to be merged automatically without requiring any central server. It is based on academic research on JSON CRDTs, but the details of the algorithm in Automerge are different from the JSON CRDT paper, and we are planning to publish more detail about it in the future.
- Fairly portable. We're not yet making an effort to support old platforms, but we have tested Automerge in Node.js, Chrome, Firefox, Safari, MS Edge, and Electron. For TypeScript users, Automerge comes with type definitions that allow you to use Automerge in a type-safe way.
If you're using npm, npm install automerge
. If you're using yarn, yarn add automerge
. Then you
can import it with require('automerge')
as in the example below (or
import * as Automerge from 'automerge'
if using ES2015 or TypeScript).
Otherwise, clone this repository, and then you can use the following commands:
yarn install
— installs dependencies.yarn test
— runs the test suite in Node.yarn run browsertest
— runs the test suite in web browsers.yarn build
— creates a bundled JS filedist/automerge.js
for web browsers. It includes the dependencies and is set up so that you can load through a script tag.
For examples of real-life applications built upon Automerge, check out:
- PushPin, a mature React-based personal archiving application
- Farm, a programmable, collaborative computing environment
- Capstone, a tablet-based note-taking and idea-development tool (blog post)
- Pixelpusher, a pixel art editor (blog post).
- Trellis, a project management tool in the style of Trello.
The following code sample gives a quick overview of how to use Automerge.
// This is how you load Automerge in Node. In a browser, simply including the
// script tag will set up the Automerge object.
const Automerge = require('automerge')
// Let's say doc1 is the application state on device 1.
// Further down we'll simulate a second device.
// We initialize the document to initially contain an empty list of cards.
let doc1 = Automerge.from({ cards: [] })
// The doc1 object is treated as immutable -- you must never change it
// directly. To change it, you need to call Automerge.change() with a callback
// in which you can mutate the state. You can also include a human-readable
// description of the change, like a commit message, which is stored in the
// change history (see below).
doc1 = Automerge.change(doc1, 'Add card', doc => {
doc.cards.push({ title: 'Rewrite everything in Clojure', done: false })
})
// Now the state of doc1 is:
// { cards: [ { title: 'Rewrite everything in Clojure', done: false } ] }
// Automerge also defines an insertAt() method for inserting a new element at
// a particular position in a list. Or you could use splice(), if you prefer.
doc1 = Automerge.change(doc1, 'Add another card', doc => {
doc.cards.insertAt(0, { title: 'Rewrite everything in Haskell', done: false })
})
// { cards:
// [ { title: 'Rewrite everything in Haskell', done: false },
// { title: 'Rewrite everything in Clojure', done: false } ] }
// Now let's simulate another device, whose application state is doc2. We
// initialise it separately, and merge doc1 into it. After merging, doc2 has
// a copy of all the cards in doc1.
let doc2 = Automerge.init()
doc2 = Automerge.merge(doc2, doc1)
// Now make a change on device 1:
doc1 = Automerge.change(doc1, 'Mark card as done', doc => {
doc.cards[0].done = true
})
// { cards:
// [ { title: 'Rewrite everything in Haskell', done: true },
// { title: 'Rewrite everything in Clojure', done: false } ] }
// And, unbeknownst to device 1, also make a change on device 2:
doc2 = Automerge.change(doc2, 'Delete card', doc => {
delete doc.cards[1]
})
// { cards: [ { title: 'Rewrite everything in Haskell', done: false } ] }
// Now comes the moment of truth. Let's merge the changes from device 2 back
// into device 1. You can also do the merge the other way round, and you'll get
// the same result. The merged result remembers that 'Rewrite everything in
// Haskell' was set to true, and that 'Rewrite everything in Clojure' was
// deleted:
let finalDoc = Automerge.merge(doc1, doc2)
// { cards: [ { title: 'Rewrite everything in Haskell', done: true } ] }
// As our final trick, we can inspect the change history. Automerge
// automatically keeps track of every change, along with the "commit message"
// that you passed to change(). When you query that history, it includes both
// changes you made locally, and also changes that came from other devices. You
// can also see a snapshot of the application state at any moment in time in the
// past. For example, we can count how many cards there were at each point:
Automerge.getHistory(finalDoc).map(state => [state.change.message, state.snapshot.cards.length])
// [ [ 'Initialization', 0 ],
// [ 'Add card', 1 ],
// [ 'Add another card', 2 ],
// [ 'Mark card as done', 2 ],
// [ 'Delete card', 1 ] ]
Automerge.init()
creates a new, empty Automerge document.
const doc = Automerge.init() // doc = {}
Automerge.from(initialState)
creates a new Automerge document and populates it with the contents
of the object initialState
.
const doc = Automerge.from({ cards: [] }) // doc = { cards: [] }
The value passed to Automerge.from
must always be an object.
An Automerge document must be treated as immutable. It is never changed directly, only with the
Automerge.change
function, described below.
At the moment, Automerge does not enforce this immutability due to the performance cost. If you want to make the document object strictly immutable you can pass an option:
Automerge.init({freeze: true})
orAutomerge.load(string, {freeze: true})
.
Automerge.change(doc, message, changeFn)
enables you to modify an Automerge document doc
,
returning an updated copy of the document.
The changeFn
function you pass to Automerge.change()
is called with a mutable version of doc
,
as shown below.
The optional message
argument allows you to attach an arbitrary string to the change, which is not
interpreted by Automerge, but saved as part of the change history. You can omit the message
argument and simply call Automerge.change(doc, callback)
.
Within the callback you can use standard JavaScript object manipulation operations to change the document:
newDoc = Automerge.change(currentDoc, doc => {
// NOTE: never modify `currentDoc` directly, only ever change `doc`!
doc.property = 'value' // assigns a string value to a property
doc['property'] = 'value' // equivalent to the previous line
delete doc['property'] // removes a property
// all JSON primitive datatypes are supported
doc.stringValue = 'value'
doc.numberValue = 1
doc.boolValue = true
doc.nullValue = null
doc.nestedObject = {} // creates a nested object
doc.nestedObject.property = 'value'
// you can also assign an object that already has some properties
doc.otherObject = { key: 'value', number: 42 }
// Arrays are fully supported
doc.list = [] // creates an empty list object
doc.list.push(2, 3) // push() adds elements to the end
doc.list.unshift(0, 1) // unshift() adds elements at the beginning
doc.list[3] = Math.PI // overwriting list element by index
// now doc.list is [0, 1, 2, 3.141592653589793]
// Looping over lists works as you'd expect:
for (let i = 0; i < doc.list.length; i++) doc.list[i] *= 2
// now doc.list is [0, 2, 4, 6.283185307179586]
doc.list.splice(2, 2, 'automerge')
// now doc.list is [0, 'hello', 'automerge', 4]
doc.list[4] = { key: 'value' } // objects can be nested inside lists as well
// Arrays in Automerge offer the convenience functions `insertAt` and `deleteAt`
doc.list.insertAt(1, 'hello', 'world') // inserts elements at given index
doc.list.deleteAt(5) // deletes element at given index
// now doc.list is [0, 'hello', 'world', 2, 4]
})
The newDoc
returned by Automerge.change()
is a regular JavaScript object containing all the
edits you made in the callback. Any parts of the document that you didn't change are carried over
unmodified. The only special things about it are:
- It is treated as immutable, so all changes must go through
Automerge.change()
. - Every object has a unique ID, which you can get by passing the object to the
Automerge.getObjectId()
function. This ID is used by Automerge to track which object is which. - Objects also have information about conflicts, which is used when several users make changes to
the same property concurrently (see below). You can get conflicts using
the
Automerge.getConflicts()
function.
Automerge.save(doc)
serializes the state of Automerge document doc
to a string, which you can
write to disk. The string contains an encoding of the full change history of the document (a bit
like a git repository).
Automerge.load(str)
unserializes an Automerge document from a string that was produced by
Automerge.save()
.
The Automerge
init
,from
, andload
functions take an optionalactorId
parameter:const actorId = '1234-abcd-56789-qrstuv' const doc1 = Automerge.init(actorId) const doc2 = Automerge.from({ foo: 1 }, actorId) const doc3 = Automerge.load(str, actorId)The
actorId
is a string that uniquely identifies the current node; if you omitactorId
, a random UUID is generated. If you pass in your ownactorId
, you must ensure that there can never be two different processes with the same actor ID. Even if you have two different processes running on the same machine, they must have distinct actor IDs.Unless you know what you are doing, you should stick with the default, and let
actorId
be auto-generated.
Automerge makes it easy to support an undo/redo feature in your application. Note that undo is a somewhat tricky concept in a collaborative application! Here, "undo" is taken as meaning "what the user expects to happen when they hit ctrl+Z/⌘ Z". In particular, the undo feature undoes the most recent change by the local user; it cannot currently be used to revert changes made by other users.
Moreover, undo is not the same as jumping back to a previous version of a document; see the next section on how to examine document history. Undo works by applying the inverse operation of the local user's most recent change, and redo works by applying the inverse of the inverse. Both undo and redo create new changes, so from other users' point of view, an undo or redo looks the same as any other kind of change.
To check whether undo is currently available, use the function Automerge.canUndo(doc)
. It returns
true if the local user has made any changes since the document was created or loaded. You can then
call Automerge.undo(doc)
to perform an undo. The functions canRedo()
and redo()
do the
inverse:
let doc = Automerge.change(Automerge.init(), doc => {
doc.birds = []
})
doc = Automerge.change(doc, doc => {
doc.birds.push('blackbird')
})
doc = Automerge.change(doc, doc => {
doc.birds.push('robin')
})
// now doc is {birds: ['blackbird', 'robin']}
Automerge.canUndo(doc) // returns true
doc = Automerge.undo(doc) // now doc is {birds: ['blackbird']}
doc = Automerge.undo(doc) // now doc is {birds: []}
doc = Automerge.redo(doc) // now doc is {birds: ['blackbird']}
doc = Automerge.redo(doc) // now doc is {birds: ['blackbird', 'robin']}
You can pass an optional message
as second argument to Automerge.undo(doc, message)
and
Automerge.redo(doc, message)
. This string is used as "commit message" that describes the undo/redo
change, and it appears in the change history.
The Automerge library itself is agnostic to the network layer — that is, you can use whatever communication mechanism you like to get changes from one node to another. There are currently a few options, with more under development:
- Use
Automerge.getChanges()
andAutomerge.applyChanges()
to manually capture changes on one node and apply them on another. Automerge.Connection
, is an implementation of a protocol that syncs up two nodes by determining missing changes and sending them to each other. The automerge-net repository contains an example that runs the Connection protocol over a simple TCP connection.- MPL runs the
Automerge.Connection
protocol over WebRTC. - Hypermerge is a peer-to-peer networking layer that combines Automerge with Hypercore, part of the Dat project.
The getChanges()/applyChanges()
API works as follows:
// On one node
newDoc = Automerge.change(currentDoc, doc => {
// make arbitrary change to the document
})
let changes = Automerge.getChanges(currentDoc, newDoc)
network.broadcast(JSON.stringify(changes))
// On another node
let changes = JSON.parse(network.receive())
newDoc = Automerge.applyChanges(currentDoc, changes)
Note that Automerge.getChanges(oldDoc, newDoc)
takes two documents as arguments: an old state and
a new state. It then returns a list of all the changes that were made in newDoc
since oldDoc
. If
you want a list of all the changes ever made in newDoc
, you can call
Automerge.getChanges(Automerge.init(), newDoc)
.
The counterpart, Automerge.applyChanges(oldDoc, changes)
applies the list of changes
to the
given document, and returns a new document with those changes applied. Automerge guarantees that
whenever any two documents have applied the same set of changes — even if the changes were applied
in a different order — then those two documents are equal. That property is called convergence,
and it is the essence of what Automerge is all about.
Automerge.merge(doc1, doc2)
is a related function that is useful for testing. It looks for any
changes that appear in doc2
but not in doc1
, and applies them to doc1
, returning an updated
version of doc1
. This function requires that doc1
and doc2
have different actor IDs (that is,
they originated from different calls to Automerge.init()
). See the Usage section above
for an example using Automerge.merge()
.
Automerge allows different nodes to independently make arbitrary changes to their respective copies of a document. In most cases, those changes can be combined without any trouble. For example, if users modify two different objects, or two different properties in the same object, then it is straightforward to combine those changes.
If users concurrently insert or delete items in a list (or characters in a text document), Automerge preserves all the insertions and deletions. If two users concurrently insert at the same position, Automerge will arbitrarily place one of the insertions first and the other second, while ensuring that the final order is the same on all nodes.
The only case Automerge cannot handle automatically, because there is no well-defined resolution, is when users concurrently update the same property in the same object (or, similarly, the same index in the same list). In this case, Automerge arbitrarily picks one of the concurrently written values as the "winner":
// Initialize documents with known actor IDs
let doc1 = Automerge.change(Automerge.init('actor-1'), doc => {
doc.x = 1
})
let doc2 = Automerge.change(Automerge.init('actor-2'), doc => {
doc.x = 2
})
doc1 = Automerge.merge(doc1, doc2)
doc2 = Automerge.merge(doc2, doc1)
// Now, doc1 might be either {x: 1} or {x: 2} -- the choice is random.
// However, doc2 will be the same, whichever value is chosen as winner.
assert.deepEqual(doc1, doc2)
Although only one of the concurrently written values shows up in the object, the other values are
not lost. They are merely relegated to a conflicts object. Suppose doc.x = 2
is chosen as the
"winning" value:
doc1 // {x: 2}
doc2 // {x: 2}
Automerge.getConflicts(doc1, 'x') // {'actor-1': 1}
Automerge.getConflicts(doc2, 'x') // {'actor-1': 1}
Here, we've recorded a conflict on property x
. The key actor-1
is the actor ID that "lost" the
conflict. The associated value is the value actor-1
assigned to the property x
. You might use
the information in the conflicts object to show the conflict in the user interface.
The next time you assign to a conflicting property, the conflict is automatically considered to be
resolved, and the conflict disappears from the object returned by Automerge.getConflicts()
.
An Automerge document internally saves a complete history of all the changes that were ever made to it. This enables a nice feature: looking at the document state at past points in time, a.k.a. "time travel"!
Automerge.getHistory(doc)
returns a list of all edits made to a document. Each edit is an object
with two properties: change
is the internal representation of the change (in the same form that
Automerge.getChanges()
returns), and snapshot
is the state of the document immediately after the
change was applied.
Automerge.getHistory(doc2)
// [ { change: { message: 'Set x to 1', ... }, snapshot: { x: 1 } },
// { change: { message: 'Set x to 2', ... }, snapshot: { x: 2 } } ]
Within the change object, the property message
is set to the free-form "commit message" that was
passed in as second argument to Automerge.change()
(if any). The rest of the change object is
specific to Automerge implementation details, and normally shouldn't need to be interpreted.
If you want to find out what actually changed in a particular edit, rather than inspecting the
change object, it is better to use Automerge.diff(oldDoc, newDoc)
. This function returns a list of
edits that were made in document newDoc
since its prior version oldDoc
. You can pass in
snapshots returned by Automerge.getHistory()
in order to determine differences between historic
versions.
The data returned by Automerge.diff()
has the following form:
let history = Automerge.getHistory(doc2)
Automerge.diff(history[2].snapshot, doc2) // get all changes since history[2]
// [ { action: 'set', type: 'map', obj: '...', key: 'x', value: 1 },
// { action: 'set', type: 'map', obj: '...', key: 'x', value: 2 } ]
In the objects returned by Automerge.diff()
, obj
indicates the object ID of the object being
edited (the same as returned by Automerge.getObjectId()
), and type
indicates whether that object
is a map
, list
, or text
.
The available values for action
depend on the type of object. For type: 'map'
, the possible
actions are:
action: 'set'
: Then the propertykey
is the name of the property being updated. If the value assigned to the property is a primitive (string, number, boolean, null), thenvalue
contains that value. If the assigned value is an object (map, list, or text), thenvalue
contains the ID of that object, and additionally the propertylink: true
is set. Moreover, if this assignment caused conflicts, then the conflicting values are additionally contained in aconflicts
property.action: 'remove'
: Then the propertykey
is the name of the property being removed.
For type: 'list'
and type: 'text'
, the possible actions are:
action: 'insert'
: Then the propertyindex
contains the list index at which a new element is being inserted, andvalue
contains the value inserted there. If the inserted value is an object, thevalue
property contains its ID, and the propertylink: true
is set.action: 'set'
: Then the propertyindex
contains the list index to which a new value is being assigned, andvalue
contains that value. If the assigned value is an object, thevalue
property contains its ID, and the propertylink: true
is set.action: 'remove'
: Then the propertyindex
contains the list index that is being removed from the list.
If you have a numeric value that is only ever changed by adding or subtracting (e.g. counting how
many times the user has done a particular thing), you should use the Automerge.Counter
datatype
instead of a plain number, because it deals with concurrent changes correctly.
Note: Using the
Automerge.Counter
datatype is safer than changing a number value yourself using the++
or+= 1
operators. For example, suppose the value is currently 3:
- If two users increment it concurrently, they will both register 4 as the new value, whereas the two increments should result in a value of 5.
- If one user increments twice and the other user increments three times before the documents are merged, we will now have conflicting changes (5 vs. 6), rather than the desired value of 8 (3 + 2 + 3).
To set up a Counter
:
state = Automerge.change(state, doc => {
// The counter is initialized to 0 by default. You can pass a number to the
// Automerge.Counter constructor if you want a different initial value.
doc.buttonClicks = new Automerge.Counter()
})
To get the current counter value, use doc.buttonClicks.value
. Whenever you want to increase or
decrease the counter value, you can use the .increment()
or .decrement()
method:
state = Automerge.change(state, doc => {
doc.buttonClicks.increment() // Add 1 to counter value
doc.buttonClicks.increment(4) // Add 4 to counter value
doc.buttonClicks.decrement(3) // Subtract 3 from counter value
})
Note: In relational databases it is common to use an auto-incrementing counter to generate primary keys for rows in a table, but this is not safe in Automerge, since several users may end up generating the same counter value! See the Table datatype below for implementing a relational-like table with a primary key.
Automerge.Text
provides support for collaborative text editing. Under the hood, text is
represented as a list of characters, which is edited by inserting or deleting individual characters.
Compared to using a regular JavaScript array, Automerge.Text
offers better performance.
Note: Technically, text should be represented as a list of Unicode grapheme clusters. What the user thinks of as a "character" may actually be a series of several Unicode code points, including accents, diacritics, and other combining marks. A grapheme cluster is the smallest editable unit of text: that is, the thing that gets deleted if you press the delete key once, or the thing that the cursor skips over if you press the right-arrow key once. Emoji make a good test case, since many emoji consist of a sequence of several Unicode code points (for example, the skintone modifier is a combining mark).
You can create a Text object inside a change callback. Then you can use insertAt()
and
deleteAt()
to insert and delete characters (same API as for list modifications, shown
above):
newDoc = Automerge.change(currentDoc, doc => {
doc.text = new Automerge.Text()
doc.text.insertAt(0, 'h', 'e', 'l', 'l', 'o')
doc.text.deleteAt(0)
doc.text.insertAt(0, 'H')
})
To inspect a text object and render it, you can use the following methods (outside of a change callback):
newDoc.text.length // returns 5, the number of characters
newDoc.text.get(0) // returns 'H', the 0th character in the text
newDoc.text.toString() // returns 'Hello', the concatenation of all characters
for (let char of newDoc.text) console.log(char) // iterates over all characters
Automerge.Table
provides a collection datatype that is similar to a table in a relational
database. It is intended for a set of objects (rows) that have the same properties (columns in a
relational table). Unlike a list, the objects have no order. You can scan over the objects in a
table, or look up individual objects by their primary key. An Automerge document can contain as many
tables as you want.
Each object is assigned a primary key (a unique ID) by Automerge. When you want to reference one object from another, it is important that you use this Automerge-generated ID; do not generate your own IDs.
You can create new tables and insert rows like this:
let database = Automerge.change(Automerge.init(), doc => {
// When creating a table, provide a list of column names
doc.authors = new Automerge.Table(['surname', 'forename'])
doc.publications = new Automerge.Table([
'type',
'authors',
'title',
'publisher',
'edition',
'year',
])
// Automerge.Table.add() inserts a new row into the database
// and returns the primary key (unique ID) of the new row
const martinID = doc.authors.add({ surname: 'Kleppmann', forename: 'Martin' })
// Adding a publication that references the above author ID
const ddia = doc.publications.add({
type: 'book',
authors: [martinID],
title: 'Designing Data-Intensive Applications',
publisher: "O'Reilly Media",
year: 2017,
})
})
You can read the contents of a table like this:
// Array of row objects
database.publications.rows
// Array of row IDs (primary keys)
database.publications.ids
// Looking up a row by primary key
database.publications.byId('29f6cd15-61ff-460d-b7fb-39a5594f32d5')
// Number of rows in the table
database.publications.count
// Like "SELECT * FROM publications WHERE title LIKE 'Designing%'"
database.publications.filter(pub => pub.title.startsWith('Designing'))
// Like "SELECT publisher FROM publications"
database.publications.map(pub => pub.publisher)
Note that currently the Automerge.Table
type does not enforce a schema; the list of columns is
given because it is useful metadata, but it doesn't actually change how rows are stored. It's
possible to have row objects that don't have values for all columns (e.g. in the example above, the
"edition" property is not set).
The project currently has a number of limitations that you should be aware of:
- No integrity checking: if a buggy (or malicious) device makes corrupted edits, it can cause the application state on other devices to become corrupted or go out of sync.
- No security: there is currently no encryption, authentication, or access control.
- Storage overhead: Automerge needs to store additional metadata besides the actual objects you create; for some datatypes, such as text, the overhead is substantial. We are working on improving this.
- ...and more, see the open issues.
Copyright 2017–2019, Ink & Switch LLC, and University of Cambridge. Released under the terms of the
MIT license (see LICENSE
).
Created by Martin Kleppmann and many great contributors.