Evaluate other language identification methods.

This is issue is a reminder for myself.

Possible options:
* Chars frequencies
* 2-grams?
* The most frequent words (100 or 1000)?
* Smart/complex resolve between _LangA_ and _LangB_ by identifying traits that are present in one language and absent in another. - This could help when 2 languages have a very similar statistical characteristics.
* [Řehůřek and Kolkus (2009)](https://radimrehurek.com/cicling09.pdf)

See:
* [Language Identification (wiki article)](https://en.wikipedia.org/wiki/Language_identification)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evaluate other language identification methods. #117

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Evaluate other language identification methods. #117

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions