Skip to content
Valerii Zuev edited this page Feb 11, 2021 · 11 revisions

Braille OCR

Problem

Angelina Dataset (see "links" section) consists of 240 labeled images with Braille text, divided into a train set (212 pictures) and validation set (28 pictures). Braille text is written in a 6-dot code, each letter is formed by 1 to 6 raized dots located in a 2*3 cell. An image may contain recto as well as verso dots; only recto dots should be detected.

Double-sided Braille book Single-sided Braille writing
Double-sided Braille book Single-sided Braille writing

The primary goal is to build a detector that recognizes Braille characters on validation images with high accuracy (in accordance with the IoU loss), presumably using the training dataset.

The next goal, which I will aim only if I succeed in the first one, is to add more images to the Angelina Dataset which contain Braille characters not only from books and Braille paper sheets, but also those from Braille plates and other devices such as Braille Contraction cards used by students.

Links

Papers

Braille OCR

Miscellaneous

Clone this wiki locally