Skip to content

Latest commit

 

History

History
29 lines (20 loc) · 1004 Bytes

README.md

File metadata and controls

29 lines (20 loc) · 1004 Bytes

deepOCR

This is a school project in deep learning I am currently working on.

It consists in building a handwritten text recognition system using a CNC-LSTM-CTC architecture.

I have planned to use a language model later to analyse recognized words and improve the accuracy.

This article was really helpful to understand the concept of Convolutional Recurrent Neural Network (CRNN).

Data

I am using the IAM Dataset which includes about 115,000 labelled images of English words from more than 1500 handwritten letters.

Use

You have to register to download the dataset. Once it is done, unzip it and place the 'words' directory and 'words.txt' file in the project repository as following :

deepOCR repository
├── data
│   ├── words
│   │   ├── a01
│   │   ├── a02
│   │   ├── ...
│   ├── words.txt
├── src
├── ...