Skip to content

Latest commit

 

History

History
62 lines (49 loc) · 2.25 KB

README.md

File metadata and controls

62 lines (49 loc) · 2.25 KB

Pykakasi library

travis-ci PyPI version

pykakasi is re-imprelemtation of kakasi library by Python.

How To Use pykakasi

    from pykakasi import kakasi,wakati

    kakasi = kakasi()
    kakasi.setMode("H","a") # default: Hiragana no conversion
    kakasi.setMode("K","a") # default: Katakana no conversion
    kakasi.setMode("J","a") # default: Japanese no conversion
    kakasi.setMode("r","Hepburn") # default: use Hepburn Roman table
    kakasi.setMode("C", true) # add space default: no Separator
    kakasi.setMode("c", false) # capitalize default: no Capitalize
    conv = kakasi.getConverter()
    result = conv.do(text)

    wakati = wakati()
    conv = wakati.getConverter()
    result = conv.do(text)

You can use output Mode values from "h", "k", "a" which is each means "Hiragana", "Katakana" and "Alphabet". For input, you can use "J" that means "Japanese" that is mixture of Kanji, Katakana and Hiragana. Also there is values of "H", "K" that means "Hiragana", and "Katakana". You can use "Hepburn", "Kunrei" or "Passport" as mode "r", Roman table switch. Also "s" used for separator switch, "C" for capitalize switch. "S" for separator string option.

wakati is an implementation of kakasi's wakati gaki option.

Options

These switch alphabets are derived from original Kakasi. Now it support following options:

Option Description Values Note
K Katakana convertion a,H,None roman or noconversion
H Hiragana convertion a,K,None roman or noconversion
J Kanji conversion a,H,K,None roman or Hiragana, Katakana
E Eigou convesion a,None roman or noconversion

Each character means character sets as follows:

  Character Sets:
       a: ascii  j: jisroman  g: graphic  k: kana 
       (j,k     defined in jisx0201)
       E: kigou  K: katakana  H: hiragana J: kanji
       (E,K,H,J defined in jisx0208)