We have several introductory tutorials to help get you started with using Fonduer.
In this tutorial, we build a Fonduer application to tackle the task of extracting maximum storage temperatures for specific transistor part numbers from their datasheets.
In this tutorial, we build a Fonduer application to tackle the task of
extracting images for specific transistors from their datasheets. This
demonstrates how Fonduer
can be used for image data in additional to text.
We use a few applications that you'll need to install and be sure are on your PATH.
For OS X using homebrew:
brew install poppler
brew install postgresql
brew install libpng freetype pkg-config
On Debian-based distros:
sudo apt install libxml2-dev libxslt-dev python3-dev build-dep python-matplotlib
sudo apt install poppler-utils
sudo apt install postgresql
For the Python dependencies, we recommend using a virtualenv. Once you have cloned the repository, change directories to the root of the repository and run
virtualenv -p python3 .venv
Once the virtual environment is created, activate it by running
source .venv/bin/activate
Any Python libraries installed will now be contained within this virtual
environment. To deactivate the environment, simply run deactivate
.
Then, install Fonduer and any other python dependencies by running:
pip install -r requirements.txt
After installing all the requirements, just run:
jupyter notebook