tinymce-word-paste-filter
TypeScript icon, indicating that this package has built-in type declarations

0.8.0 • Public • Published

TinyMCE Word Paste Filter (standalone)

Standalone filter for pasted MS Word HTML content; extracted from TinyMCE

npm version npm downloads

What?

This project provides a standalone version of the filter function used by TinyMCE to clean up pasted MS Word content.

Why?

TinyMCE has one of the best implementations of Word cleanup around; they have tackled pernicious problems like restoring list items to actual lists when these arrive as <p> tags with a lot of odd styles and bullet characters. However, TinyMCE is no tiny library, and does not export that capability by itself.

This repo extracts just the Word cleanup capability from TinyMCE, export a sole filter function (and its internal dependencies) into a standalone library much smaller (contributes ~50k if you minify your build) than the full TinyMCE (contribrutes at minimum >400k to your project -- with the internal Word clean function not exposed), published on npm as tinymce-word-paste-filter for easy incorporation into other projects.

How?

import wordFilter from 'tinymce-word-paste-filter';

// if you have incoming pasted Word data in your program...
const awfulWordHTML = '...'; // content here

// clean it up in one step
const cleaned = wordFilter(awfulWordHTML); 

Typescript types are included.

Please note that this is intended to be bundled into browser applications; if running in a node environment for some reason, you'll need to have browser / dom globals (document, navigator, etc) available.

See the brief test under ./test/ and its fixtures.

Building from source, testing

TinyMCE's build process is absolutely monstrous... with countless internal dependencies and remapped paths at every level. In order to pull out the minimal functionality here without all that bloat, the build first uses rollup to shake out the main components needed, then transpiles from there with tsc.

Make sure to update the submodule first.

git submodule update
cd tinymce
yarn install
yarn run build

cd ..
yarn install
yarn run build
yarn run test

Dependents (4)

Package Sidebar

Install

npm i tinymce-word-paste-filter

Weekly Downloads

11,297

Version

0.8.0

License

none

Unpacked Size

144 kB

Total Files

11

Last publish

Collaborators

  • jasonphillips