Skip to content

Commit

Permalink
Merge remote-tracking branch 'origin/develop'
Browse files Browse the repository at this point in the history
  • Loading branch information
mozillazg committed Sep 19, 2021
2 parents 1b24e4e + 5255d7e commit 5dd1c52
Show file tree
Hide file tree
Showing 13 changed files with 6,379 additions and 22 deletions.
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -60,3 +60,4 @@ target/

#Ipython Notebook
.ipynb_checkpoints
/tools/cc_cedict.py
6 changes: 6 additions & 0 deletions .gitmodules
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
[submodule "tools/phrase-pinyin-data"]
path = tools/phrase-pinyin-data
url = https://github.com/mozillazg/phrase-pinyin-data.git
[submodule "tools/python-pinyin"]
path = tools/python-pinyin
url = https://github.com/mozillazg/python-pinyin.git
7 changes: 7 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,12 @@
# ChangeLog


## [0.11.0] (2021-09-19)

* 新增 `cc_cedict.txt`: 根据 [cc-cedict.org](https://cc-cedict.org/) 的词语拼音数据生成的单个汉字拼音数据
* 更新 Unihan 数据版本为 14.0.0


## [0.10.2] (2021-03-13)

* 修改 `` 的最常用读音为 `zhēn`
Expand Down
7 changes: 7 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -15,3 +15,10 @@ pua:
.PHONY: check
check:
-rg 'ɡ|ɑ|í|è'

.PHONY: cc_cedict
cc_cedict:
cd tools && \
git submodule update && \
python python-pinyin/gen_phrases_dict.py phrase-pinyin-data/cc_cedict.txt cc_cedict.py && \
python gen_cc_cedict.py > ../cc_cedict.txt
13 changes: 8 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,11 +16,7 @@
# 注释
U+4E2D: zhōng,zhòng # 中


[Unihan Database][unihan] 数据版本:

> Date: 2020-02-18 18:27:33 GMT [JHJ]
> Unicode version: 13.0.0
拼音文件介绍:

* `kTGHZ2013.txt`: [Unihan Database][unihan][kTGHZ2013](http://www.unicode.org/reports/tr38/#kTGHZ2013) 部分的拼音数据(来源于《通用规范汉字字典》的拼音数据)
* `kHanyuPinyin.txt`: [Unihan Database][unihan][kHanyuPinyin](http://www.unicode.org/reports/tr38/#kHanyuPinyin) 部分的拼音数据(来源于《漢語大字典》的拼音数据)
Expand All @@ -35,6 +31,13 @@
* `overwrite.txt`: 手工纠正的拼音数据(**可以修改**
* `pinyin.txt`: 合并上述文件后的拼音数据
* `zdic.txt`: [汉典网](http://zdic.net) 的拼音数据(**可以修改**
* `cc_cedict.txt`: 根据 [cc-cedict.org](https://cc-cedict.org/) 的词语拼音数据生成的单个汉字拼音数据


[Unihan Database][unihan] 数据版本:

> Date: 2021-08-06 16:32:36 GMT [JHJ]
> Unicode version: 14.0.0

## 修改数据
Expand Down
Loading

0 comments on commit 5dd1c52

Please sign in to comment.