Skip to content

Commit

Permalink
Some ocr and readings fixes, restore license, add vocab files for res…
Browse files Browse the repository at this point in the history
…t of brandt
  • Loading branch information
justinsilvestre committed Oct 30, 2024
1 parent cf22de9 commit 53481d7
Show file tree
Hide file tree
Showing 14 changed files with 117 additions and 37 deletions.
12 changes: 6 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -27,17 +27,17 @@ The current focus is to transcribe + format the content of the 1927 textbook _In

The biggest challenge at the moment is transcribing the portions in mixed Chinese/Latin script. OCR tools can automate some of the process, but not all of it. If you have time, please consider helping out by transcribing the remaining "Vocabulary" and "Notes" chapters listed [here](./docs/brandt.md).

## texts   [![CC BY-NC-SA 4.0][cc-by-sa-shield]][cc-by-sa]
## texts   [![CC BY-NC-SA 4.0][cc-by-nc-sa-shield]][cc-by-nc-sa]

The following license information applies to the texts in the [texts](./texts) folder.

[Creative Commons Attribution-ShareAlike 4.0 International License][cc-by-sa].
[Creative Commons Non-Commercial Attribution-ShareAlike 4.0 International License][cc-by-nc-sa].

[![CC BY-NC-SA 4.0][cc-by-sa-image]][cc-by-sa]
[![CC BY-NC-SA 4.0][cc-by-nc-sa-image]][cc-by-nc-sa]

[cc-by-sa]: http://creativecommons.org/licenses/by-sa/4.0/
[cc-by-sa-image]: https://licensebuttons.net/l/by-sa/4.0/88x31.png
[cc-by-sa-shield]: https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg
[cc-by-nc-sa]: http://creativecommons.org/licenses/by-nc-sa/4.0/
[cc-by-nc-sa-image]: https://licensebuttons.net/l/by-nc-sa/4.0/88x31.png
[cc-by-nc-sa-shield]: https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg

## development

Expand Down
50 changes: 25 additions & 25 deletions docs/brandt.md
Original file line number Diff line number Diff line change
Expand Up @@ -1881,7 +1881,7 @@ total progress: 535 / 1384 tasks complete (~38%)
- [ ] proofread
- format content
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
Expand All @@ -1890,99 +1890,99 @@ total progress: 535 / 1384 tasks complete (~38%)
- [ ] write Chinese -> English gloss
- Lesson 37, Text 2
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss
- Lesson 38, Text 2
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss
- Lesson 39, Text 2
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss
- Lesson 40, Text 1
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss
- Lesson 40, Text 2
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss
- Lesson 40, Text 3
- transcribe content
- [ ] use OCR on text
- [x] use OCR on text
- [ ] proofread OCR results
- [ ] transcribe English definitions in Vocabulary section
- [ ] transcribe Notes section
- [ ] proofread
- format content
- [ ] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [ ] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- [x] create `.passage.md` according to established format ([example](../texts/brandt-ch01-1.passage.md))
- [x] create `.vocab.tsv` according to established format ([example](../texts/brandt-ch01-1.vocab.tsv))
- add content
- [ ] check/proofread/fill in missing Mandarin pinyin readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Vietnamese readings in `.vocab.tsv`
- [ ] check/proofread/fill in missing Cantonese readings in `.vocab.tsv`
- [ ] align transcribed Chinese + English sentences
- [x] align transcribed Chinese + English sentences
- [ ] write Chinese -> English gloss

<!-- ([①-⑳㉑-㊷]) -->
2 changes: 1 addition & 1 deletion texts/LICENSE
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
The following license applies to the files in this directory.

https://creativecommons.org/licenses/by-sa/4.0/
https://creativecommons.org/licenses/by-nc-sa/4.0/
2 changes: 1 addition & 1 deletion texts/brandt-ch04-3.vocab.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
明三B宵平 miáo miu⁴ meo sprouts; shoots
知三尤去 zhòu zau³ daylight; daytime
端開一咍上?, 端開一登上? děng dang² đẳng a class; a sort; equal; equally; a sign of the plural
澄開三陽平?, 知開三陽上?, 澄開三陽去? zhǎng coeng⁴ trường long
澄開三陽平?, 知開三陽上?, 澄開三陽去? cháng coeng⁴ trường long
並四青上 bìng bing⁶ two together; united; all; equally; also; really
澄三鍾平?, 澄三鍾上?, 澄三鍾去? zhòng cung⁵ trọng heavy; important; severe
影開一寒去 àn on³ duyên?, án? a table. A case at law
Expand Down
2 changes: 1 addition & 1 deletion texts/brandt-ch37-2.passage.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@

探問友人疾病函

某某仁兄大人閣下運啟者。吾兄貴體違和殊深。惦念伏思尊軀素健。今偶失檢點。乃爲二豎所侵。惟期安心靜養。定占勿藥之喜。達人自玉。皇閣下勿稍介意。未悉請何醫士診治。弟稍暇卽冨趨府看望。特此致候。順頌痊安。
某某仁兄大人閣下運啟者。吾兄貴體違和殊深。惦念伏思尊軀素健。今偶失檢點。乃爲二豎所侵。惟期安心靜養。定占勿藥之喜。達人自玉。皇閣下勿稍介意。未悉請何醫士診治。弟稍暇卽當趨府看望。特此致候。順頌痊安。

弟某某鞠躬月日

Expand Down
7 changes: 7 additions & 0 deletions texts/brandt-ch37-2.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
diàn dim³ điếm
溪三虞平 keoi¹ xo
羣開三元去 jiàn gin⁶ kiện
常三虞上 shù syu⁶
以開三陽入 yào joek⁶ dược
見開二皆去 jiè gaai³ giới
5 changes: 5 additions & 0 deletions texts/brandt-ch38-1.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
章開三之去?, 章開三蒸入? zhī zik¹ chức
船合三眞入 shù seot⁶ thuật
並三A眞平 pín pan⁴
幫三陽上 fǎng fong² phỏng
9 changes: 9 additions & 0 deletions texts/brandt-ch38-2.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
清開四青平 qīng cing¹ thanh
竿 見開一寒平 gān gon¹ cần
影開三蒸入 jik¹
並三元入 fat⁶
見開二山上 jiǎn gaan² giản
滂三鍾平 fēng fung¹
匣一東平?, 匣一東上? hóng hung⁴ hồng
日開三魚平?, 日開三魚上?, 日開三魚去? jyu⁴ nhà
4 changes: 2 additions & 2 deletions texts/brandt-ch39-1.passage.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ President Ts'ao-Ku'n's Telegram of Resignation
公鑒。
To the Peking Cabinet of the 10th month, 13th year of the Republic, to the Senate and the House of Representatives, to high military and civil authorities of all provinces and special administrative areas, to all provincial assemblies, to all legal organizations and all newspapers for information of all citizens:

錕泰膺重托。德薄能鲜。致令部曲橋貳紀綱失墜。
錕泰膺重托。德薄能鮮。致令部曲橋貳紀綱失墜。
I, K'un, was entrusted with the heavy burden (of the presidency). My virtue and ability however were so poor that a conflict among my followers broke out and all laws became ineffective (lit. fell down).

十三年十月二十三日。馮玉祥倒戈。錕受閉錮。
Expand All @@ -36,7 +36,7 @@ I earnestly hope that all my former colleagues will do their utmost to bring abo
錕優遊林下。獲睹承平。欣幸曷極。
And in the quietness and freedom of my private life I will be able to witness peaceful times which will be for me the highest happiness.

特電佈達。願共察之。曹锟
特電佈達。願共察之。曹錕
I specially send forth this telegram for general information. Ts'ao-Ku'n.

---
20 changes: 20 additions & 0 deletions texts/brandt-ch39-1.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
從開一豪平 cáo cou⁴ tào
見合一魂平?, 見合一魂上? kūn kwan¹
溪三虞平?, 影開一侯平? keoi¹ khu
tuō tok³ thác?, thách?, thốc?, thước?, thướt?
羣開三B宵平 qiáo kiu⁴ kiều
日開三脂去 èr ji⁶ nhị
見開三之上 gei² kỉ
見開一唐平 gāng gong¹ cương
幫四齊去?, 幫四先入? bai³ bế
見一模去 gu³
見三尤去 jiù gau³ nhíu
透開一豪上 tǎo tou² thảo
曉開一談平?, 匣開一談去? hān ham¹ hám
疑開三魚去 jyu⁶ ngựa
生開三支上?, 生開三支去? saai²
並一豪平?, 並一豪去? páo pou⁴ bào
邪合三眞平 xún ceon⁴
見合三B脂上 guǐ gwai² quẫy
端一模上 dou² đủ
12 changes: 12 additions & 0 deletions texts/brandt-ch39-2.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
滂三虞去 fu⁶ phó
影開三B侵平 yīn jam¹ âm
來開四青平 líng ling⁴ 령?, 영? liếng
影一東平 wēng jung¹ ông
從合三仙平 quán cyun⁴ tuyền
常合三眞平?, 章合三眞上? chún seon⁴ 순?, 준? thuần
滂三虞上 fu² vỗ
來開四青平 líng ling⁴ 령?, 영? linh
端開四蕭去?, 端開四青入? diào diu³ điếu
疑開三B仙去 yàn jin⁶ ngon
初三虞平 chú co¹ so
14 changes: 14 additions & 0 deletions texts/brandt-ch40-1.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
見開三蒸平 jīng ging¹ cạnh
定開一豪去 dǎo dou⁶ đạo
定開四先入 dié dit⁶ dật
幫一唐去 bàng pong³ báng
昌開三清入 chì cek³ xích
來開三眞去 lín leon⁴ lận
匣開一寒去 hàn hon⁶
心一模去 sou³
生合三脂去?, 生合三眞入? shuài seoi³ 솔?, 수? soái
知開三清平 zhēn zing¹ trinh
莊開二咸上 zhǎn zaam² trảm
清開三鹽平 qiān cim¹
幫三A眞平 bīn ban¹
2 changes: 1 addition & 1 deletion texts/brandt-ch40-2.passage.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@

慰友人喪母函

某某仁兄大人苫次頃奉訃聞。驚知老伯母大人於某月某日駕返瑤池。驚閱之下。悼働莫名。伏維 伯母大人。閫範永垂。母儀足式。今者星墜女嫌。對萱堂而雨泣。峯願天姥。感樹木之風悲雖歸真於天上。無遺憾於人間。尙望兄台勉釋軫懷。是爲至禱期屆駕輛。自應前往執紼。謹具奠儀。尙所代薦靈凡之右。此泐。順候孝履。
某某仁兄大人苫次頃奉訃聞。驚知老伯母大人於某月某日駕返瑤池。驚閱之下。悼慟莫名。伏維 伯母大人。閫範永垂。母儀足式。今者星墜女嫌。對萱堂而雨泣。峯願天姥。感樹木之風悲雖歸真於天上。無遺憾於人間。尙望兄台勉釋軫懷。是爲至禱期屆駕輀。自應前往執紼。謹具奠儀。尙所代薦靈凡之右。此泐。順候孝履。

弟某某鞠躬月日

Expand Down
13 changes: 13 additions & 0 deletions texts/brandt-ch40-2.vocab.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
Traditional Qieyun Hanyu Pinyin Jyutping Korean Vietnamese English
書開三鹽平?, 書開三鹽去? shān sim¹ chôm
定開一豪去 dào dou⁶ điệu
定一東去 tòng dung⁶ đỏng
溪合一魂上 kǔn kwan²
曉合三元平 xuān hyun¹
明一模上 lǎo mou⁵ mụ
匣開一覃去 hàn ham⁶ hám
章開三眞上 zhěn zan²
日開三之平 ér ji⁴
幫三文入 fat¹
定開四先去?, 端開四青去? diàn din⁶
精開四先去 jiàn zin³

0 comments on commit 53481d7

Please sign in to comment.