Skip to content

Reduce file sizes #806

@Balearica

Description

@Balearica

The amount of data loaded by Tesseract.js is quite large. For example, if default settings are used, a new user will end up downloading 15.34 MB of JavaScript and language data before recognition will be run (not taking into account compression). While this is largely mitigated by caching language data after it is first downloaded (and should not be an issue for Node users at all), this amount of data likely causes annoyance for first-time browser users.

File Size
tesseract.min.js 0.07 MB
worker.min.js 0.13 MB
tesseract-core-simd.wasm.js 4.74 MB
eng.traineddata.gz 10.4 MB
total 15.34 MB

We should investigate whether this can be reduced without significant tradeoffs (e.g. runtime increase, dropping support for file formats, etc.).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions