char-encoding-detector
TypeScript icon, indicating that this package has built-in type declarations

0.0.9 • Public • Published

travis static

=====

Port of node-chardet in pure JavaScript without NodeJS specific code. Module is based on ICU project http://site.icu-project.org/, which uses character occurrence analysis to determine the most probable encoding.

Installation

npm i char-encoding-detector
yarn add char-encoding-detector

Usage

To return the encoding with the highest confidence:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';
const encoding = detectEncoding(uint8Array);
// or
detectFileEncoding(file).then((encoding) => {});

To return the full list of possible encodings:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';
 
const matches = detectEncoding(uint8Array, { allMatches: true });
// or
detectFileEncoding(file, { allMatches: true }).then((matches) => {});

Working with large data sets

Sometimes, when data set is huge and you want to optimize performance (in tradeoff of less accuracy), you can sample only first N bytes of the buffer.

Supported Encodings:

  • UTF-8
  • UTF-16 LE
  • UTF-16 BE
  • UTF-32 LE
  • UTF-32 BE
  • ISO-2022-JP
  • ISO-2022-KR
  • ISO-2022-CN
  • Shift-JIS
  • Big5
  • EUC-JP
  • EUC-KR
  • GB18030
  • ISO-8859-1
  • ISO-8859-2
  • ISO-8859-5
  • ISO-8859-6
  • ISO-8859-7
  • ISO-8859-8
  • ISO-8859-9
  • windows-1250
  • windows-1251
  • windows-1252
  • windows-1253
  • windows-1254
  • windows-1255
  • windows-1256
  • KOI8-R

Versions

Current Tags

  • Version
    Downloads (Last 7 Days)
    • Tag
  • 0.0.9
    529
    • latest

Version History

Package Sidebar

Install

npm i char-encoding-detector

Weekly Downloads

566

Version

0.0.9

License

MIT

Unpacked Size

519 kB

Total Files

14

Last publish

Collaborators

  • mycoach-deploy