Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support PDF file uploads #1471

Open
aalemayhu opened this issue Apr 28, 2024 · 2 comments
Open

Support PDF file uploads #1471

aalemayhu opened this issue Apr 28, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@aalemayhu
Copy link
Contributor

People are uploading PDF files and those are not supported at all and will result in no flashcards generated. I am thinking of adding some basic content analysis and then generate flashcards that. This is a better approach then the current failed uploads. Of course this adds complexity but there are open source alternatives out there to do the heavy lifting.

Interesting projects

  • Apache Tika - a content analysis toolkit
  • Poppler - Poppler is a PDF rendering library based on the xpdf-3.0 code base.
  • Apache PDFBox - open source Java tool for working with PDF documents.
  • pdfminer.six - community maintained fork of the original PDFMiner.
@aalemayhu aalemayhu added the enhancement New feature or request label Apr 28, 2024
@aalemayhu
Copy link
Contributor Author

Another issue is password protected files, should that be done with this or seperately?

@aalemayhu
Copy link
Contributor Author

This is being worked on and has been moved to private Jira due to some PDF documents being sensitive

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant