Skip to content

Intelligently convert PDF files into EPUB files with OCR and spellcheck

License

Notifications You must be signed in to change notification settings

jvadair/bookify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bookify

A program for intelligently converting PDF files into EPUB ones. Also contains an interactive spell-checker to fix errors made by OCR.

Features

  • Text reflow
  • Built in OCR for non-searchable PDFs
  • Page number removal
  • Interactive spell-check
  • Metadata (added via interactive prompt)
  • Interactive table of contents

Limitations

  • Only supports simple books where chapters are titled "Chapter 1", "chapter one", etc. (Not "1" or "One"!)
  • Only 1 author can be added
  • Doesn't support front/end matter* (end matter will be added to the last chapter)

Usage

Clone the repository and run main.py (with python). That's it! Remember where your files are relevant to the cloned folder/directory.

Contributing

Bookify is more of a proof-of-concept than a polished product. Contributions are definitely welcome! E-mail [email protected] if you'd like to help.

About

Intelligently convert PDF files into EPUB files with OCR and spellcheck

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages