Install dependencies
sudo apt install unrar atool p7zip-full
Install current version:
pip install --user --upgrade git+https://github.com/anguelos/dagtasets
A collection of datasets suited for documents and text image analysis.
A python module for generating document datasets for training models.
General utilities related to language models and training
- Diamond Square algorithm used in DIBCO data augmentation taken from [https://github.com/buckinha/DiamondSquare]