-
Notifications
You must be signed in to change notification settings - Fork 501
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
幫助更新 simp.txt 與 trad.txt #55
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
NLPchina/nlp-lang/src/main/resources/simp.txt 與 NLPchina/nlp-lang/src/main/resources/trad.txt 自 2015年7月之後未再更新。
我發現了一些問題. 例如,藴 (以下註為 T2) 是 蘊 (以下註為 T1) 的異體字。而 蕴 (以下註為 S) 是 蘊 (T1) 及 藴 (T2) 的簡體字. 請參見 http://www.cojak.org/index.php?function=code_lookup&term=8574
因此, 基於為了符合上述的異體與簡繁體關係
14197 蘊 (T1) 14197 蕴 (S)
14325 藴 (T2) 14325 藴 (T2)
14347 蘊 (T1) 14347 蕴 (S)
應修正為
14197 蘊 (T1) 14197 蕴 (S)
14325 藴 (T2) 14325 蕴 (S)
14347 蘊 (T1) 14347 藴 (T2)
此問題已在 Unihan 15.1.0 > Unihan_Variants.txt 中更正。 除此之外,我還發現 Unihan 15.0.0 > Unihan_Variants.txt 和 Unihan 15.1.0 > Unihan_Variants.txt 之間還有的其他修正需要合併到 simp.txt 和 trad.txt 裏。
您是否同意讓我根據 Unihan 15.1.0 > Unihan_Variants.txt 來更新 trad.txt 和 simp.txt 呢?
如同意,煩請告知 trad.txt 和 simp.txt 的編纂規則。 多謝。
Raymond Jou 周永瑞
FamilySearch International
[email protected]
(801)240-3871 office 辦公室
(408)568-8989 mobile & text 手機 及 簡訊
The text was updated successfully, but these errors were encountered: