Releases: xavctn/img2table
Releases · xavctn/img2table
img2table 1.4.0
What's Changed
- Migrate PDF library to pypdfium2 for licence compliance
Full Changelog: 1.3.1...1.4.0
img2table 1.3.1
img2table 1.3.0
Features
- Complete overhaul of the line detection algorithm to improve detection of lines defined by background color changes
- Improvement in detection of semi-bordered cells
- Update detection of rows in borderless tables
- Add support for Surya OCR
- Add detection of implicit columns via the
implicit_columns
parameter - Optimization of code performance via numba refactoring
- Update of examples notebooks
Bug fixes
- Fix bug with text position when extracting text from rotated PDFs
img2table 1.2.11
- Simpler and more consistent line detection
- Detection of discontinuous columns in borderless tables
1.2.10
1.2.9
img2table 1.2.8
- Fix division by zero bug introduced in previous release
img2table 1.2.7
- Fix bugs
- Improve computation of image metrics on noisy documents
- Modify row detection for borderless tables in order to account for merged cells
- Implement Adaptive Run Length Smoothing Algorithm in order to isolate text areas in images
img2table 1.2.6
- Fix bugs related to OCR / table content extraction
img2table 1.2.5
- Fix bug in line detection
- Fix bug in cell creation
- Optimization of algorithm performances