PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

fontTools for creating font subsets.
pymupdf-fonts contains some nice fonts for your text output.
Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Name		Name	Last commit message	Last commit date
Latest commit History 2,672 Commits
.github		.github
.vs		.vs
docs		docs
scripts		scripts
src		src
src_classic		src_classic
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
.readthedocs.yaml		.readthedocs.yaml
COPYING		COPYING
README.md		README.md
READMEb.md		READMEb.md
READMEd.md		READMEd.md
changes.txt		changes.txt
pipcl.py		pipcl.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py
valgrind.supp		valgrind.supp
wdev.py		wdev.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyMuPDF

Community

Installation

Usage

Documentation

Optional Features

About

License and Copyright

About

Releases 154

Packages

Used by 34.6k

Contributors 70

Languages

License

pymupdf/PyMuPDF

Folders and files

Latest commit

History

Repository files navigation

PyMuPDF

Community

Installation

Usage

Documentation

Optional Features

About

License and Copyright

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 154

Packages 0

Used by 34.6k

Contributors 70

Languages

Packages