GitHub - yichen0831/opencc-python: OpenCC made with Python (original) (raw)

開放中文轉換(Pure Python)

Open Chinese convert (OpenCC) in pure Python.

Introduction 簡介

opencc-python 是用純 Python 所寫,使用由 BYVoid(byvoid.kcp@gmail.com) 所開發的 OpenCC 中的字典檔案。 opencc-python 可以支援 Python2.7 及 Python3.x。

opencc-python is made by pure Python with the dictionary files of OpenCC which is developed by BYVoid(byvoid.kcp@gmail.com).

opencc-python can run with Python2.7 and Python3.x.

Installation 安裝

opencc 這個目錄複製到你正在開發的專案中即可,或是執行(需要管理者權限):

套件也可從 PyPI 安裝,使用指令:

pip install opencc-python-reimplemented

Copy the opencc folder to your project, or run (admin required)

The package can also be installed from PyPI by issuing:

pip install opencc-python-reimplemented

Usage 使用方式

Code

from opencc import OpenCC cc = OpenCC('s2t') # convert from Simplified Chinese to Traditional Chinese

can also set conversion by calling set_conversion

cc.set_conversion('s2tw')

to_convert = '开放中文转换' converted = cc.convert(to_convert)

Command Line

usage: python -m opencc [-h] [-i ] [-o ] [-c ] [--in-enc ] [--out-enc ]

optional arguments: -h, --help show this help message and exit -i , --input Read original text from . (default: None = STDIN) -o , --output Write converted text to . (default: None = STDOUT) -c , --config Conversion (default: None) --in-enc Encoding for input (default: UTF-8) --out-enc Encoding for output (default: UTF-8)

example with UTF-8 encoded file:

python -m opencc -c s2t -i my_simplified_input_file.txt -o my_traditional_output_file.txt

See https://docs.python.org/3/library/codecs.html#standard-encodings for list of encodings.

Conversions 轉換

Issues 問題

當轉換有兩個以上的字詞可能時,程式只會使用第一個。

When there is more than one conversion available, only the first one is taken.