Skip to content
This repository has been archived by the owner on Sep 25, 2024. It is now read-only.

rossumai/flying-rectangles

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

flying-rectangles

A repository with anonymized invoices. Total number of documents is 3741.

The format is a json with one big list. In the list, there are tuples, in the format:
(#document_id, (boundingbox cordinates left, top, right, bottom), page number of the document, field type, (anonymized text features - 17 floats OR None) )

Field type is either 'text', that is a word from the original pdf file, in which case the features are 17 floats. Or something else, which means it is an annotation and has None in place of text features.

The original invoices do look like the provided sample:
Sample invoice
The red underlined boxes are examples of some word's boundingboxes (not all boundingboxes are shown for simplicity reasons).

About

A repository with anonymized invoices

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published