Add endpoint for text recognition (OCR) #447

DavidMStraub · 2023-11-03T16:45:50Z

Fixes #445, uses pytesseract.

New endpoint: POST to /media/<handle>/ocr

Query parameters:

lang (required) - a tesseract language code (e.g. eng)
format: the output format (default is string)

It's POST because the task queue is used when available. Although OCR should be pretty fast, this seems more robust and might allow OCR'ing multi-page PDFs in the future (currently, only images are supported).

DavidMStraub added 5 commits November 3, 2023 17:43

Implement OCR

7afbaca

Fix test

a7575e5

Install tesseract in CI

3a8b29e

Fix test

d3d30fa

Return string, not JSON

72da2f2

DavidMStraub mentioned this pull request Nov 4, 2023

UI for OCR gramps-project/gramps-web#302

Merged

DavidMStraub merged commit c98d3cf into gramps-project:master Nov 5, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add endpoint for text recognition (OCR) #447

Add endpoint for text recognition (OCR) #447

DavidMStraub commented Nov 3, 2023

Add endpoint for text recognition (OCR) #447

Add endpoint for text recognition (OCR) #447

Conversation

DavidMStraub commented Nov 3, 2023