Skip to content

mhjohnson/pypdf2xml

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 

Repository files navigation

pypdf2xml

This project started as an alternative to poppler's pdftoxml, which didn't properly decode CID Type2 fonts in PDFs. This script requires pdfminer.

License

Public domain.

About

Convert text from PDF to XML.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published