npmpackage.info

Gathering detailed insights and metrics for chardet

Other packages similar to chardet

@types/chardet

1.0.0

Stub TypeScript definitions entry for chardet, which provides its own types definitions

jschardet

3.1.4

Character encoding auto-detection in JavaScript (port of python's chardet)

bemuse-chardet

0.0.8

Fork of chardet for use in bemuse

grunt-chardet-encoding

0.1.6

Check character encoding of files using chardet.

Gathering detailed insights and metrics for chardet

chardet

Character encoding detection tool for NodeJS

2.0.0

283

MIT

TypeScript

9.87 kB

4,844,773,356 4.2

Installations

npm install chardet

Developer Guide

BETA

Typescript

No

Module System

CommonJS, UMD

Node Version

18.18.0

NPM Version

9.8.1 Score

99.7

Supply Chain

100

Quality

76.2

Maintenance

100

Vulnerability

100

License

Pull Requests

Open

0

Total

82

Closed

3

Merged

79

Issues

Open

1

Total

23

Closed

22

Releases

v2.0.0

Published on 28 Sept 2023

v1.6.1

Published on 28 Sept 2023

v1.6.0

Published on 16 Jun 2023

v1.5.1

Published on 05 Jan 2023

v1.5.0

Published on 09 Oct 2022

v1.4.0

Published on 19 Oct 2021

View all 20 releases

Contributors

Unable to fetch Contributors

View all 11 contributors

Languages

TypeScript

JavaScript

TypeScript (99.69%)

JavaScript (0.31%)

Developer

runk

Download Statistics

Total Downloads

4,844,773,356

Last Day

4,067,309

Last Week

18,384,237

Last Month

82,816,170

Last Year

1,051,450,913

GitHub Statistics

283 Stars

199 Commits

73 Forks

8 Watching

17 Branches

11 Contributors

Bundle Size

35.90 kB

Minified

9.87 kB

Minified + Gzipped

Bundlephobia

Maintainers

Package Meta Information

Latest Version

2.0.0

Package Id

chardet@2.0.0

Unpacked Size

155.25 kB

Size

22.05 kB

File Count

NPM Version

9.8.1

Node Version

18.18.0

Publised On

28 Sept 2023

Total Downloads

Cumulative downloads

Total Downloads

4,844,773,356

Last day

-3.7%

4,067,309

Compared to previous day

Last week

-15.3%

18,384,237

Compared to previous week

Last month

3.7%

82,816,170

Compared to previous month

Last year

11.7%

1,051,450,913

Compared to previous year

Daily Downloads

Weekly Downloads

Monthly Downloads

Yearly Downloads

Dev Dependencies

@types/jest @types/node jest prettier semantic-release ts-jest ts-node typescript

Versions

chardet

Chardet is a character detection module written in pure JavaScript (TypeScript). Module uses occurrence analysis to determine the most probable encoding.

Packed size is only 22 KB
Works in all environments: Node / Browser / Native
Works on all platforms: Linux / Mac / Windows
No dependencies
No native code / bindings
100% written in TypeScript
Extensive code coverage

Installation

npm i chardet

Usage

To return the encoding with the highest confidence:

1import chardet from 'chardet';
2
3const encoding = chardet.detect(Buffer.from('hello there!'));
4// or
5const encoding = await chardet.detectFile('/path/to/file');
6// or
7const encoding = chardet.detectFileSync('/path/to/file');

To return the full list of possible encodings use analyse method.

1import chardet from 'chardet';
2chardet.analyse(Buffer.from('hello there!'));

Returned value is an array of objects sorted by confidence value in descending order

1[
2  { confidence: 90, name: 'UTF-8' },
3  { confidence: 20, name: 'windows-1252', lang: 'fr' },
4];

In browser, you can use Uint8Array instead of the Buffer:

1import chardet from 'chardet';
2chardet.analyse(new Uint8Array([0x68, 0x65, 0x6c, 0x6c, 0x6f]));

Working with large data sets

Sometimes, when data set is huge and you want to optimize performance (with a tradeoff of less accuracy), you can sample only the first N bytes of the buffer:

1chardet
2  .detectFile('/path/to/file', { sampleSize: 32 })
3  .then((encoding) => console.log(encoding));

You can also specify where to begin reading from in the buffer:

1chardet
2  .detectFile('/path/to/file', { sampleSize: 32, offset: 128 })
3  .then((encoding) => console.log(encoding));

Supported Encodings:

UTF-8
UTF-16 LE
UTF-16 BE
UTF-32 LE
UTF-32 BE
ISO-2022-JP
ISO-2022-KR
ISO-2022-CN
Shift_JIS
Big5
EUC-JP
EUC-KR
GB18030
ISO-8859-1
ISO-8859-2
ISO-8859-5
ISO-8859-6
ISO-8859-7
ISO-8859-8
ISO-8859-9
windows-1250
windows-1251
windows-1252
windows-1253
windows-1254
windows-1255
windows-1256
KOI8-R

Currently only these encodings are supported.

TypeScript?

Yes. Type definitions are included.

References

ICU project http://site.icu-project.org/

Fuzzing

Determines if the project uses fuzzing.

0

SAST

Determines if the project uses static code analysis.

Score

4.2

/10

Last Scanned on 2025-01-27

The Open Source Security Foundation is a cross-industry collaboration to improve the security of open source software (OSS). The Scorecard provides security health metrics for open source projects.

Learn More