-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reading contents of a PDF #195
Comments
What content do you want?
You can use the pdf crate in two version:
The pdf_render and pdf_text crates only work with the latest master. |
What are the pdf_render and pdf_text crates ? crates.io doesn't know anything about that. |
They are not on crates.io because they do not meet my stability requirements for publishing there. |
pdf-extract crate exists, but depends on I'd be curious to see a C/Rust comparison but with poppler against |
Any chance for an easy example that just converts a PDF file to a String? I need to search through valid utf8 text of a pdf and not panic if the pdf is formatted in any unexpected way.. Documentation found regarding this seems so scarce.. |
If pdf_text does not do what you need, then no, there is no easy example. |
Ahh thank you for clarifying that! Unrelated to this project i was working with lopdf on that task. Sadly lopdf just panics in every case and does not error instead. |
Oh sure. If everything is in standard encoding, it is easy. pdf_render is used in production with "random" PDFs. And it's not great for a server to crash from a user supplied PDF. |
@santiagomed @alexis779 |
Is there an example on how to simply read the contents of a PDF successfully? I tried looking into read.rs but it seems to be outdated so I can't run it. Any way to read a PDF?
The text was updated successfully, but these errors were encountered: