Skip to content

Chinese text normalization. 中文文本规范化。

Notifications You must be signed in to change notification settings

Darius-H/chn_text_norm

 
 

Repository files navigation

Chn Text Norm

this is a repository for chinese text normalization (no longer maintained).

Quick Start

Git Clone Repo

git clone this repo to the root directory of your project which need to use it. pip install opencc-python-reimplemented cd /path/to/proj git clone https://github.com/Joee1995/chn-text-norm.git

after that, your doc tree should be:

proj                     # root of your project
|--- chn_text_norm       # this chn-text-norm tool
     |--- text.py
     |--- ...
|--- text_normalize.py   # your text normalization code
|--- ...

How to Use ?

# text_normalize.py
from chn_text_norm.text import *

raw_text = 'your raw text'
normalizer = Text()
text = normalizer(raw_text=raw_text)

About

Chinese text normalization. 中文文本规范化。

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%