textractor-js
TypeScript icon, indicating that this package has built-in type declarations

0.1.6ย โ€ขย Publicย โ€ขย Published

textractor-js ๐Ÿšœ๐Ÿ•ธ๏ธ

A simple text extractor for various files compiled to WebAssembly. This is intended to be used directly in the browser to quickly and easily extract text from various file formats.

If you are looking for the main textractor project, you can find it here.

Installation

You can use the textractor package in your own project by installing from npm:

npm install textractor-js

This is very much a work in progress. I am working to prioritize adding PPTX and XLSX support, as well as improving the text extraction for PDFs.

Supported formats

  • [X] Text (txt)
  • [X] PDF
  • [X] Word (docx)
  • [ ] Excel (xlsx)
  • [ ] PowerPoint (pptx)
  • [ ] Images (png, jpg, etc)

Readme

Keywords

none

Package Sidebar

Install

npm i textractor-js

Weekly Downloads

28

Version

0.1.6

License

none

Unpacked Size

2.4 MB

Total Files

6

Last publish

Collaborators

  • nleroy917