Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request / Question] Use different OCR engine #491

Open
artt opened this issue Mar 15, 2024 · 1 comment
Open

[Feature Request / Question] Use different OCR engine #491

artt opened this issue Mar 15, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@artt
Copy link

artt commented Mar 15, 2024

I find Apple's OCR (through ocrmac much more reliable than PDF Miner, especially with Thai script. It already outputs text and its rectangular boundaries. Wondering if it's possible to specify a custom OCR engine or how hard would it be to incorporate this feature. Thanks!


I mistakenly labeled this as bug but have no way to edit this. Sorry.

@artt artt added the bug Something isn't working label Mar 15, 2024
@bosd
Copy link

bosd commented Aug 11, 2024

Hey!

As #343, we try to build a maintained fork at pypdf_table_extraction.

The closest thing to an alternative ocr engine was here:
#209
But it has never been finished.

Please open an issue/pr in the new repo if you like to discuss this further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants