magic_pdf-0.9.1-released
myhloli
released this
06 Nov 04:07
·
106 commits
to master
since this release
What's Changed
- Feat/tune docs by @icecraft in #833
- fix(ocr_mkcontent): improve content handling for different languages and equation types by @myhloli in #839
- feat(list): improve list detection algorithm & fix(list): improve list identification accuracy by @myhloli in #843
- docs(tutorial): update magic-pdf command with output directory by @myhloli in #844
- feat(para_split_v3): improve list identification with block aspect ratio by @myhloli in #845
- fix(dict2md): improve text concatenation logic by @myhloli in #847
- Update pdf_extract_kit.py by @CiaranYoung in #853
- feat(table): upgrade StructEqTable model and integrate into PDF Extract Kit by @myhloli in #854
- feat(model): add HTML minification to StructTableModel by @myhloli in #855
- chore: add .gitattributes to configure file linguist attributes by @myhloli in #856
- fix(merge_text): add ligature replacement functionality #305 #241 by @myhloli in #857
- chore: add CSS and SCSS files to linguist-vendored- Update .gitattributes to mark CSS and SCSS files as vendored by @myhloli in #858
- docs(README): update Colab demo link by @myhloli in #860
- fix(table): improve table image processing by @myhloli in #866
- docs(faq): add troubleshooting for illegal instruction error on Linux servers by @myhloli in #867
- feat: mineru_demo接口文档替换为链接 by @LollipopsAndWine in #871
- test(table): improve HTML validation for table extraction by @myhloli in #874
- docs: update arXiv paper link in README files by @myhloli in #875
- docs(README): update changelog for v0.9.1 release by @myhloli in #877
New Contributors
- @CiaranYoung made their first contribution in #853
Full Changelog: magic_pdf-0.9.0-released...magic_pdf-0.9.1-released