You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In some cases, there is an index error while using the Hybrid parser on a multipage pdf.
It is described and tested in #251
What is merged there is rather a workaround then a fix.
A it now fails gracefully.
Potential better fix would be to re-assemble the parts of the table detcted by the netwerk parser into the hybrid parser.
That part of the code also contained a TODO note from the original author.
Describe the bug
In some cases, there is an index error while using the Hybrid parser on a multipage pdf.
It is described and tested in #251
What is merged there is rather a workaround then a fix.
A it now fails gracefully.
Steps to reproduce the bug
See
pypdf_table_extraction/tests/test_network.py
Lines 145 to 157 in 35d8d20
Expected behavior
Potential better fix would be to re-assemble the parts of the table detcted by the netwerk parser into the hybrid parser.
That part of the code also contained a TODO note from the original author.
pypdf_table_extraction/camelot/parsers/network.py
Lines 935 to 957 in 35d8d20
PDF
tabula/schools.pdf
Screenshots
The text was updated successfully, but these errors were encountered: