ⒾⓃⓉⒺⓇⓉⒺⓍⓉ
Table of Contents generated with DocToc
InterText: Services for Recurrent Text-related Tasks
InterText provides pre-packaged solutioons for a number of tasks in text formatting and typesetting that tend to show up frequently. I'm aiming at conducing comparative benchmarks and soundness checks for all solutions. Areas covered so far include:
-
InterText HYPH for hyphenating text in multiple languages (only en-US covered so far, but underlying software is multilingual and configurable).
-
InterText SLABS for segmenting and re-assembling text according to Unicode Standard Annex #14: Unicode Line Breaking Algorithm (UAX#14); this is useful to determine line breaking opportunities (LBOs) for running text.
See also the (rough) list of planned features.
Related Links
- regexpu-core/property-escapes.md at master · mathiasbynens/regexpu-core
- Unicode property escapes in JavaScript regular expressions · Mathias Bynens
- https://unicode.org/Public/UNIDATA/PropertyValueAliases.txt
- https://unicode.org/Public/UNIDATA/PropertyAliases.txt
- UAX #24: Unicode Script Property
- UTS #18: Unicode Regular Expressions
- UAX #44: Unicode Character Database
- Unicode Utilities: UnicodeSet
To Do
- [X] use
INTERTEXT.rpr()
for tabulation instead ofJSON.stringify()
- [ ] implement path manipulation, integrate
pathmap
- [ ] integrate color-related code from DataMill colorizer
- [ ] implement number formatting using
Intl.NumberFormat
, including percentages, rounding - [ ] CupOfHTML: make compatible with Paragate HTMLish parser
- [ ] CupOfHTML: consider using template strings as in
H`div#id.class` 'content'
- [ ] turn into monorepo
- [ ] integrate
jzr-old/timetunnel