utf-8
Encode/decode UTF8.
Usage
npm install utf-8
Encoding
A char:
UTF8;// [0xC3, 0xA9]
A string:
UTF8;// [49, 46, 51, 36, 32, 126, 61, 32, 49, 226, 130, 172]
Decoding
A char:
String;// 'é'
A string:
UTF8;// '1.3$ ~= 1€'
TypedArrays are welcome
As inputs :
var bytes=0xC3 0xA9 49 46 51 36 32 126 61 32 49 226 130 172; // The first charString;// é // The following string at the offset 2UTF8;// '1.3$ ~= 1€'
As well as outputs :
var bytes=14; // First encoding a charUTF8; // Then encoding a stringUTF8;
UTF8 encoding detection
UTF8;// true | false
This function can prove the text contained by the given bytes is not UTF-8 (or badly encoded UTF-8 string). It's not reciprocally true, especially for short strings with which false positives are frequent.
Strict mode
If you try to encode an UTF8 string in an ArrayBuffer too short to contain the complete string, it will silently fail. To avoid this behavior, use the strict mode :
UTF8;
Thanks
- The Debian project for it's free (as freedom) russian/japanese man pages used for real world files tests !