Getting pixel coordinates as output #593

MrSpaghettiCode · 2022-01-21T14:14:19Z

MrSpaghettiCode
Jan 21, 2022

Hello,

i am using your wrapper to ocr some documents and i was wondering if it is possible to get pixel coordinates of found words.
I am trying to read certain data by drawing rectangles around it and feeding them to tess afterwards, but unfortunately the datacoordinates vary from document to document.

so, what i want to do:

let tesseract search for buzzword.
get coordinates of found word.
draw a rectangle around a certain area
feed the rectangle to tesseract

Normaly i would just scan the whole thing and extract the data, but since my scans are super unrelyable and there is much unneeded data, i have to do this rectangle stuff.

thx in advance

charlesw · 2022-01-21T20:53:39Z

charlesw
Jan 21, 2022
Maintainer

The bounds are available on the iterators. See console demo here: https://github.com/charlesw/tesseract-samples So in your case: * OCR page * Iterator through results identifying words of interest * Get regions from iterator (TryGetBoundingBox) Hope that helps 🙂

…

On Sat, 22 Jan 2022, 01:14 MrSpaghettiCode, ***@***.***> wrote: Hello, i am using your wrapper to ocr some documents and i was wondering if it is possible to get pixel coordinates of found words. I am trying to read certain data by drawing rectangles around it and feeding them to tess afterwards, but unfortunately the datacoordinates vary from document to document. so, what i want to do: 1. let tesseract search for buzzword. 2. get coordinates of found word. 3. draw a rectangle around a certain area 4. feed the rectangle to tesseract Normaly i would just scan the whole thing and extract the data, but since my scans are super unrelyable and there is much unneeded data, i have to do this rectangle stuff. thx in advance — Reply to this email directly, view it on GitHub <#593>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAB7HSE542HVWET3JTK7IWLUXFS4LANCNFSM5MPUDNYQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

1 reply

MrSpaghettiCode Jan 24, 2022
Author

hey, thx for the reply, this helped me a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Getting pixel coordinates as output #593

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Getting pixel coordinates as output #593

Uh oh!

MrSpaghettiCode Jan 21, 2022

Replies: 1 comment · 1 reply

Uh oh!

charlesw Jan 21, 2022 Maintainer

Uh oh!

MrSpaghettiCode Jan 24, 2022 Author

MrSpaghettiCode
Jan 21, 2022

Replies: 1 comment 1 reply

charlesw
Jan 21, 2022
Maintainer

MrSpaghettiCode Jan 24, 2022
Author