@atom/memo
TypeScript icon, indicating that this package has built-in type declarations

0.21.0 • Public • Published

Memo JS

Memo allows multiple remote collaborators to share the state of a single Git working copy. The core of the library is written in Rust for efficiency and reusability in other contexts. This library exposes the capabilities of the Rust core via WebAssembly and wraps them in an idiomatic JavaScript API.

Creating a WorkTree

WorkTree is the fundamental abstraction provided by this library. The state of a WorkTree is expressed as a sequence of fine-grained operations applied on top of a a base commit. There are two possible cases when constructing a new WorkTree:

  • We are the first collaborator, and we want to build future operations on top of a given base commit.
  • We are joining an existing collaborative session, and we want to construct the WorkTree from a sequence of existing operations.

Both scenarios are automatically handled when you call WorkTree.create:

const replicaId = generateUUID();
const baseCommitOID = "8251a3c491b3884d7f828d2a1c5c565855171a2c";
const startOps = await fetchInitialOperations();
const [tree, ops] = await WorkTree.create(
  replicaId,
  baseCommitOID,
  startOps,
  gitProvider
);
broadcast(ops);

In the example above, WorkTree.create is called with a replica id (replicaId), a base commit (baseCommitOID) and an array of existing operations (startOps). If the existing operations array is empty, we assume this is the first collaborator and initialize the tree at the provided base commit. If operations are provided, the baseCommitOID argument is ignored and the current base commit is determined from the given operations.

You can ignore how fetchInitialOperations works for now. It is not included as part of this library, and a reference implementation will be covered later in the guide.

The third parameter to WorkTree.create is an object that implements the GitProvider interface:

export interface GitProvider {
  baseEntries(oid: Oid): AsyncIterable<BaseEntry>;
  baseText(oid: Oid, path: Path): Promise<string>;
}

This provider allows the WorkTree to retrieve information from the underlying Git repository. Here's a potential implementation that reads data from GitHub:

class GitHubProvider implements GitProvider {
  async *baseEntries(oid: Oid): AsyncIterable<BaseEntry> {
    const entries = await fetch(
      `/repos/rust-lang/rust/git/trees/${oid}?recursive=1"`
    );
    for (const entry of entries) {
      yield fromGitHubEntryToBaseEntry(entry);
    }
  }

  async baseText(oid: Oid, path: Path): Promise<string> {
    const file = await fetch(
      `/repos/rust-lang/rust/contents/${path}?ref=${oid}`
    );
    return fromBase64ToString(file.content);
  }
}

The baseEntries method must return a collection that can be asynchronously iterated over and that yields memo.BaseEntry elements, like the following:

{ depth: 1, name: "a", type: memo.FileType.Directory }
{ depth: 2, name: "b.txt", type: memo.FileType.Text }
{ depth: 1, name: "c.txt", type: memo.FileType.Text }

Listing the work tree's current entries

To list the work tree's current paths, call entries. This will return an array of entries arranged in a depth-first order, similar to the entries returned by GitProvider.prototype.baseEntries. For example, the base entries populated above could be retrieved as follows:

for (const entry of tree.entries()) {
  console.log(entry.depth, entry.name, entry.type);
}

// Prints:
// 1 a Directory
// 2 b.txt File
// 1 c.txt File

Each returned entry has the following fields:

  • depth: The length of the path leading to this entry.
  • name: The entry's name.
  • path: The entry's path.
  • basePath: The entry's original path at the beginning of the commit.
  • type: The type of this file ("File" or "Directory")
  • status: How this path has changed since the base commit ("New", "Renamed", "Removed", "Modified", "RenamedAndModified", or "Unchanged")
  • visible: Whether or not this file is currently visible (not deleted).

The entries method accepts two options as fields in an optional object passed to the method.

  • showDeleted: If true, returns entries for deleted files and directories, but marks them as visible: false.
  • descendInto: An optional array of paths. If provided, the traversal will skip descending into any directory not present in this whitelist. You can use this option to limit the number of entries you need to process if you are rendering a UI with collapsed directories.

Creating, renaming and removing files

WorkTree APIs all function in terms of paths and allow you to manipulate files exactly as you would expect from a typical file system:

const op1 = tree.createFile("foo", memo.FileType.Directory);
const op2 = tree.createFile("foo/bar", memo.FileType.Text);
const op3 = tree.createFile("foo/baz", memo.FileType.Text);
const op4 = tree.rename("foo/bar", "foo/qux");
const op5 = tree.remove("foo/baz");
broadcast([op1, op2, op3, op4, op5]);

Reading and manipulating text files

To manipulate text files you'll need to call openTextFile with the path you want to open. This method will return a Buffer object that you can interact with:

const buffer = await tree.openTextFile("foo/qux");
const editOp1 = buffer.edit(
  [{ start: { row: 0, column: 0 }, end: { row: 0, column: 0 } }],
  "Hello, world!"
);
const editOp2 = buffer.edit(
  [{ start: { row: 0, column: 10 }, end: { row: 0, column: 12 } }],
  "ms"
);
console.log(buffer.getText()); // ==> "Hello worms"

const [set, createSetOp] = buffer.addSelectionSet([
  { start: point(0, 0), end: point(0, 1) }
]);
const replaceSetOp = buffer.replaceSelectionSet(set, [
  { start: point(0, 2), end: point(0, 3) }
]);
console.log(buffer.getSelections()); /* => {
  local: {
    1: [{ start: { row: 0, column: 2 }, end: { row: 0, column: 3 } }]
  },
  remote: {
    "65242244-9706-4b42-9785-fa5cbe5d5709": [
      [{ start: { row: 0, column: 2 }, end: { row: 0, column: 3 } }]
    ]
  }
}*/

const removeSetOp = buffer.removeSelectionSet(set);
broadcast([editOp1, editOp2, createSetOp, replaceSetOp, removeSetOp]);

As you incorporate operations received from other peers, you may want to use Buffer.prototype.onChange to keep an external representation of the buffer up-to-date:

buffer.onChange(change => {
  for (const textChange of change.textChanges) {
    console.log(textChange); // => { start: { row: 0, column: 0 }, end: { row: 0, column: 5 }, text: "Goodbye" }
    externalBuffer.edit(textChange.start, textChange.end, textChange.text);
  }
  console.log(change.selectionRanges); /* => {
    local: {
      1: [{ start: { row: 0, column: 2 }, end: { row: 0, column: 3 } }]
    },
    remote: {
      "65242244-9706-4b42-9785-fa5cbe5d5709": [
        [{ start: { row: 0, column: 2 }, end: { row: 0, column: 3 } }]
      ]
    }
  }*/
});

Changing the active location

Optionally, you can also retrieve the location of other peers and transmit yours using the location API:

const operation = tree.setActiveLocation(buffer);
broadcast([operation]);

console.log(tree.getReplicaLocations()); /* => {
  "65242244-9706-4b42-9785-fa5cbe5d5709": "foo/qux"
}*/

Resetting to a different base commit

If you want to reset the work tree to a different (possibly null) base (e.g. after a commit or a git reset), you can use the reset method:

const commitOid = "70403cdf91c2e6fbf76167f725935e6b0993eeb1";
const resetOps = tree.reset(commitOid);
await broadcast(resetOps);
console.log(tree.head()); // => 70403cdf91c2e6fbf76167f725935e6b0993eeb1

This resets you and all the other peers to the new commit. Note that this is an asynchronous action, as the tree needs to perform I/O in order to retrieve the new base entries.

After switching to a new base all open buffers will still be valid and you can continue using them normally.

Working with operations

All methods that update the state of the tree return operations, the fundamental primitive this library uses to synchronize with other peers. Sometimes operations are returned synchronously, sometimes they are async iterators instead. Make sure you handle both cases, as illustrated in the broadcast function later in this section.

In either case, operations are wrapped in an OperationEnvelope. An operation envelope is defined as follows:

export interface OperationEnvelope {
  epochId(): Uint8Array;
  epochTimestamp(): number;
  epochReplicaId(): string;
  operation(): Operation;
}

Technically, to synchronize with other peers, you only need to transmit the operation that is stored inside of the envelope; so, why including those extra timestamp and replica id fields?

You may recall the fetchInitialOperations function that we called when creating a new WorkTree. It turns out that, in order to instantiate a new WorkTree, you only need operations associated with the latest epoch. By exposing the epoch timestamp and replica id, we allow you to store operations such that they can be efficiently queried later when instantiating new work trees:

// Here we simulate having a database that stores every operation that has been
// generated.

async function fetchInitialOperations(): Operation[] {
  // Note that this is very inefficient. In a production system, you should
  // perform the computation contained in this function on the database, using
  // an index on the (timestamp, replicaId) tuple.
  const allEnvelopes = await database.getAllOperationEnvelopes();

  // First we sort by timestamp, then by replica id.
  const sortedEnvelopes = database
    .getAllOperationEnvelopes()
    .sort(
      (a, b) =>
        a.epochTimestamp() - b.epochTimestamp() ||
        a.epochReplicaId() - b.epochReplicaId()
    );

  // Then, we only retrieve operations for the latest epoch.
  const lastEnvelope = sortedEnvelopes[sortedEnvelopes.length - 1];
  const latestEpochEnvelopes = sortedEnvelopes.filter(
    e =>
      e.epochTimestamp() == lastEnvelope.epochTimestamp() &&
      e.epochReplicaId() == lastEnvelope.epochReplicaId()
  );

  // Finally, we unwrap the envelopes and just return the operations inside.
  return latestEpochEnvelopes.map(envelope => envelope.operation());
}

async function broadcast(
  envelopes: OperationEnvelope[] | AsyncIterable<OperationEnvelope>
) {
  for await (const envelope of envelopes) {
    // Note how we store the full envelope in the database, but we only transmit
    // the operation inside of it to peers.
    database.store(envelope);
    network.broadcast(envelope.operation());
  }
}

So far we have covered storing and trasmitting operations sent by the local replica. To apply remote operations, you should use the applyOps method:

const remoteOps = await receiveOps();
const fixupOps = tree.applyOps(remoteOps);
broadcast(fixupOps);

Whenever you call applyOps, there is a chance that additional "fixup" operations could be generated to deal with cycles and name conflicts in the tree. Be sure to broadcast these operations to peers to ensure convergence.

/@atom/memo/

    Package Sidebar

    Install

    npm i @atom/memo

    Weekly Downloads

    0

    Version

    0.21.0

    License

    MIT

    Unpacked Size

    11.2 MB

    Total Files

    32

    Last publish

    Collaborators

    • kashef
    • daviwil
    • binarymuse
    • kuychaco
    • atom-team
    • smashwilson
    • nathansobo
    • maxbrunsfeld
    • simurai
    • iolsen
    • as-cii
    • jasonrudolph
    • lee-dohm
    • donokuda