char-encoding-detector
TypeScript icon, indicating that this package has built-in type declarations

0.0.9 • Public • Published

travis static

=====

Port of node-chardet in pure JavaScript without NodeJS specific code. Module is based on ICU project http://site.icu-project.org/, which uses character occurrence analysis to determine the most probable encoding.

Installation

npm i char-encoding-detector
yarn add char-encoding-detector

Usage

To return the encoding with the highest confidence:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';
const encoding = detectEncoding(uint8Array);
// or
detectFileEncoding(file).then((encoding) => {});

To return the full list of possible encodings:

import { detectEncoding, detectFileEncoding } from 'char-encoding-detector';
 
const matches = detectEncoding(uint8Array, { allMatches: true });
// or
detectFileEncoding(file, { allMatches: true }).then((matches) => {});

Working with large data sets

Sometimes, when data set is huge and you want to optimize performance (in tradeoff of less accuracy), you can sample only first N bytes of the buffer.

Supported Encodings:

  • UTF-8
  • UTF-16 LE
  • UTF-16 BE
  • UTF-32 LE
  • UTF-32 BE
  • ISO-2022-JP
  • ISO-2022-KR
  • ISO-2022-CN
  • Shift-JIS
  • Big5
  • EUC-JP
  • EUC-KR
  • GB18030
  • ISO-8859-1
  • ISO-8859-2
  • ISO-8859-5
  • ISO-8859-6
  • ISO-8859-7
  • ISO-8859-8
  • ISO-8859-9
  • windows-1250
  • windows-1251
  • windows-1252
  • windows-1253
  • windows-1254
  • windows-1255
  • windows-1256
  • KOI8-R

Dependencies (0)

    Dev Dependencies (23)

    Package Sidebar

    Install

    npm i char-encoding-detector

    Weekly Downloads

    334

    Version

    0.0.9

    License

    MIT

    Unpacked Size

    519 kB

    Total Files

    14

    Last publish

    Collaborators

    • mycoach-deploy