-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Still doesn't find 'print' #2
Comments
Thanks, I will look at this in the next few days. |
By the way, I did solve the Tesseract-bug causing this disappearing of the word PRINT for at least a year ago, but Tesseract has a lack of testing capacity to approve my PR.
Verzonden vanaf Outlook voor Android<https://aka.ms/AAb9ysg>
…________________________________
From: artunit ***@***.***>
Sent: Friday, August 9, 2024 8:31:59 PM
To: OurDigitalWorld/hocrmod ***@***.***>
Cc: rmast ***@***.***>; Author ***@***.***>
Subject: Re: [OurDigitalWorld/hocrmod] Still doesn't find 'print' (Issue #2)
Thanks, I will look at this in the next few days.
—
Reply to this email directly, view it on GitHub<#2 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAZPZ5TOJZZAO6Z5RFBQ6ATZQUDJ7AVCNFSM6AAAAABMF6R7FGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENZYGUYTQMJXGU>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
This PR is nearly two years old: tesseract-ocr/tesseract#3899 |
I've seen my patch work in combination with 5.1.0 derived for Scribeocr by balearica from within Scribeocr.
Verzonden vanaf Outlook voor Android<https://aka.ms/AAb9ysg>
…________________________________
From: Stefan Weil ***@***.***>
Sent: Saturday, August 10, 2024 5:53:30 PM
To: OurDigitalWorld/hocrmod ***@***.***>
Cc: rmast ***@***.***>; Author ***@***.***>
Subject: Re: [OurDigitalWorld/hocrmod] Still doesn't find 'print' (Issue #2)
This PR is nearly two years old: tesseract-ocr/tesseract#3899<tesseract-ocr/tesseract#3899>
—
Reply to this email directly, view it on GitHub<#2 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAZPZ5SX4EV33HRIUCWDYTTZQYZPVAVCNFSM6AAAAABMF6R7FGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDEOBSGE4TENJSHE>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
You are leagues ahead of me in trouble-shooting this problem. I have tinkered with using opencv's color selecting tools to try to leverage multi-coloured blocks but tackling this in Tesseract proper makes a lot of sense. I will leave this issue open for a few days but I don't really have anything to add. |
For reproduction steps see the workaround for #1
On the right top is the text 'print' that still isn't found by this script.
The text was updated successfully, but these errors were encountered: