Search results
146 packages found
Multilingual tokenizer that automatically tags each token with its type
WHATWG HTML5 specification-compliant, fast and ready for production HTML parsing/serialization toolset for Node.js
- html
- parser
- html5
- WHATWG
- specification
- fast
- html parser
- html5 parser
- htmlparser
- parse5
- serializer
- html serializer
- htmlserializer
- sax
- View more
Tokenizes a text using regex rules and returns the tokens with their positions in the text given.
Uses snapdragon to tokenize a single JavaScript block comment into an object, with description, tags, and code example sections that can be passed to any other comment parsers for further parsing.
General purpose generic matcher
Simple utility to get unique / repeated words from strings or phrases
Easily replace and transform :props in strings.
Straightforward HTML parser for Node.js and browser
- html
- parser
- html5
- html5 parser
- htmlparser
- html parser
- html tree-constructor
- html to JSON
- html to AST
- html tokenizer
- tokenize
- tokenizer
- stream parsing
- stream parser
- View more
Create a snapdragon token. Used by the snapdragon lexer, but can also be used by plugins.
get word counts / frequencies on a per-speaker or per-category basis, or as an aggregate
Easily scan a string with an object of regex patterns to produce an array of tokens. ~100 sloc.
Uses esprima to extract line and block comments from a string of JavaScript. Also optionally parses code context (the next line of code after a comment).
Fast (~300 MB/sec) and light (~1.3 kb) JSON/UTF-8 tokenizer for creating custom parsers
Get search tokens from an INSA/NASA station name.
Quick writer for transforming tokenized JSON back into JSON/UTF-8 output (works with qb-json-tok)