[Feature Request / Question] Use different OCR engine #491

artt · 2024-03-15T17:27:37Z

I find Apple's OCR (through ocrmac much more reliable than PDF Miner, especially with Thai script. It already outputs text and its rectangular boundaries. Wondering if it's possible to specify a custom OCR engine or how hard would it be to incorporate this feature. Thanks!

I mistakenly labeled this as bug but have no way to edit this. Sorry.

The text was updated successfully, but these errors were encountered:

bosd · 2024-08-11T19:51:55Z

Hey!

As #343, we try to build a maintained fork at pypdf_table_extraction.

The closest thing to an alternative ocr engine was here:
#209
But it has never been finished.

Please open an issue/pr in the new repo if you like to discuss this further.

artt added the bug Something isn't working label Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request / Question] Use different OCR engine #491

[Feature Request / Question] Use different OCR engine #491

artt commented Mar 15, 2024 •

edited

Loading

bosd commented Aug 11, 2024

[Feature Request / Question] Use different OCR engine #491

[Feature Request / Question] Use different OCR engine #491

Comments

artt commented Mar 15, 2024 • edited Loading

bosd commented Aug 11, 2024

artt commented Mar 15, 2024 •

edited

Loading