Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bordered tables are not detected properly #223

Open
harundiri opened this issue Oct 13, 2024 · 0 comments
Open

bordered tables are not detected properly #223

harundiri opened this issue Oct 13, 2024 · 0 comments

Comments

@harundiri
Copy link

harundiri commented Oct 13, 2024

I use this to extract tables. I also extract text from non-table areas by creating a mask of the non-table areas using the bounding boxes of the tables.
for some images, Image.extract_tables does not seem to detect tables properly.
Here is an example image:
page_2

and this is the image after creating a mask of the non-table areas for regular text extraction
non_table_regions

it is not detecting the first rows of both tables and the last row of the first table in the image.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant