@jayrbolton/suffix-tree
DefinitelyTyped icon, indicating that this package has TypeScript declarations provided by the separate @types/jayrbolton__suffix-tree package

0.0.1 • Public • Published

suffix-tree

Suffix trees are useful for efficient string searching of suffixes and substrings. They're often used in bioinformatics on genomes. One big advantage is that you can search for the same suffix across many strings in linear time. This is an optimized implementation using Ukkonnen's algorithm and requires O(n) time and O(n) space to construct a tree for a string of length n.

You can can check whether a string is a substring of another, whether a string is a suffix of another (starting from any point), the number of occurrences of a substring, or you can find the longest repeated substring. You can also do these operations on a set of multiple strings in linear time.

Usage

var STree = require('@jayrbolton/suffix-tree')
var tree = STree.create('banana')
var suffixIndex = STree.findSuffix('ana') // -> 3
var substringIndex = STree.findSubstring('ana') // -> 1
var occurrences = STree.occurrences('ana') // -> 2
var longestRepeating = STree.longestRepeating() // -> 'ana'

You can view all tested strings in test/index.js -- if you have a string that you want to include in the test suite, please open an issue or pr.

API

STree.create(initialStr)

Initiialize a suffix tree. Returns a suffix tree object.

Optionally pass in an initial string to add to the tree

STree.create('banana')

STree.add(string, tree)

Add another, separate string to the tree. This will result in a single, larger tree that combines all strings into one tree.

Mutates the given tree and returns it.

const t = STree.create('banana')
STree.add('plantain', t)

STree.format(tree)

Return a formatted string for a tree to see how it looks. This creates an left indentation-based tree.

console.log(STree.foramt(tree))

STree.findSuffix(string, tree)

Find a given suffix for any string in the tree. Will return an array of indexes of strings that have the suffix. Returns an empty array if the suffix is not found.

You can get the string from its index by using getStringByIndex

const ids = STree.findSuffix('ana', tree)

STree.getStringByIndex(id, tree)

Return the original string based on its index

const t = STree.create('banana')
STree.add('plantain', t)
STree.getStringByIndex(0) // -> 'banana'
STree.getStringByIndex(1) // -> 'plantain'

STree.allSuffixes(tree)

Return an array of arrays of ALL suffixes for the entire tree. Traverses every path of the tree

STree.longestCommonSubstring(tree)

Get the longest common substring among all strings in a tree. Returns the actual string.

const t = STree.create('plantain')
STree.add('entertain', t)
const s = STree.longestCommonSubstring(t) // -> 'tain'

STree.findSubstrings(string, tree)

Find all occurrences of a substring in any string in the tree. Returns an object where:

  • Each key in the object is an index to a string in the tree (use getStringByIndex to get the string)
  • Each value in the object is an array of starting indexes for the substring occurrences in the string
const t = STree.create('banana')
const occs = STree.findSubstrings('an', t) // -> {0: [1, 3]}
// We found occurences in string 0 ('banana'), at starting indexes 1 and 3 inside 'banana'

Un-implemented functions

Suffix trees have a lot of other efficient functions that can be written for them. See the Wikipedia page for a comprehensive list of these functions. They could either be added to this module or created in independent modules.

Install

With npm installed, run

$ npm install @jayrbolton/suffix-tree

See Also

License

MIT

Readme

Keywords

none

Package Sidebar

Install

npm i @jayrbolton/suffix-tree

Weekly Downloads

8

Version

0.0.1

License

MIT

Unpacked Size

18 kB

Total Files

5

Last publish

Collaborators

  • jayrbolton