39,993 Images – OCR Data of Internet Image. The collecting scenes of this dataset include subtitle, advertisement, cellphone screenshot, comic, emoticon, poster, magazine cover, etc. The language distribution is Chinese, English (a few). For annotation, line-level rectangular bounding box annotation and transcription for the texts were adopted for the internet images (column-level quadrilateral bounding box annotation and transcription for the texts were adopted for small amount of data). The dataset can be used for OCR tasks of internet images.
For more details, please refer to the link: https://www.nexdata.ai/datasets/ocr/171?source=Github
39,993 images, 227,910 bounding boxes
including subtitle, advertisement, cellphone screenshot, comic, emoticon, poster, magazine cover etc.
including multiple types of internet images
Chinese, English (a few)
the image data format is .jpg, the annotation file format is .json
line-level rectangular bounding box annotation and transcription for the texts (column-level quadrilateral bounding box annotation and transcription for the texts were adopted for small amount of data)
the error bound of each vertex of a rectangular bounding box is within 5 pixels, which is a qualified annotation, the accuracy of bounding boxes is not less than 97%; the texts transcription accuracy is not less than 97%
Commercial License