Skip to content

Camerash/mahjong-dataset

Repository files navigation

Mahjong Dataset

Computer Vision Dataset for Chinese Mahjong Tiles

Structures

  • Raw, unsliced images are located under ./raw-images
  • Unscaled tile images are located under ./raw-tiles
  • Scaled, ready-to-use images are located under ./tiles-resized
  • Csv file containing tagged labels of the images, and the csv template file are located under ./tiles-data
  • Raw, unsliced images for untagged tiles are located under ./untagged-images-raw
  • Untagged tile images are located under ./untagged-tiles

Data

  • Images are mostly scraped from Google image search, Ebay and Alibaba
  • Scaled images have the dimension 240(W) x 320(H). Format used are .jpg

Ready to use data bundle can be located at the root of the repo, namely train.zip. It contains

  • images folder for mahjong tile images
  • data.csv for image labels
  • Tagged labels are stored in csv with the following format:
image-name label label-name
1.jpg 38 bonus-winter
... ... ...

label is the index of the respective class of the image, see the classes below

Classes

table-name table-index
dots-1 1
dots-2 2
dots-3 3
dots-4 4
dots-5 5
dots-6 6
dots-7 7
dots-8 8
dots-9 9
bamboo-1 10
bamboo-2 11
bamboo-3 12
bamboo-4 13
bamboo-5 14
bamboo-6 15
bamboo-7 16
bamboo-8 17
bamboo-9 18
characters-1 19
characters-2 20
characters-3 21
characters-4 22
characters-5 23
characters-6 24
characters-7 25
characters-8 26
characters-9 27
honors-east 28
honors-south 29
honors-west 30
honors-north 31
honors-red 32
honors-green 33
honors-white 34
bonus-spring 35
bonus-summer 36
bonus-autumn 37
bonus-winter 38
bonus-plum 39
bonus-orchid 40
bonus-chrysanthemum 41
bonus-bamboo 42

Refer the classes of mahjong tiles here

Feel free contribute to this repo with your own data.

License

Open sourced under MIT License