Tesseract home made French spellcheck Dictionnary #9641
ghost
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @niksedk,
Thank you for your software. As a hobby i'm writting a small spellckeck dictionnary parser. I thought i should share / contribute my dictionnary here so it can be usefull for everyone, my project will never be made public.
This is the result of a year+ of spellcheck with Gaupol on french subtitles from various sources: sup2srt, pgsTosrt, are command line OCR tools that use tesseract 5.5. And also subtitles from Opensubtitles, probably created using SE.
If you decide to merge it to your "fra_OCRFixReplaceList.xml", just send a reply here or whatever format that suit you, and i will post futur updates on consequent changes.
note: duplicates may exist.
Best.
Beta Was this translation helpful? Give feedback.
All reactions