#Readme 說明（內有安裝步驟）

Introduction

Auto convert locae Chinese vocabulary program.

中文詞彙自動轉檔程式

目前只做簡轉繁，其他需求請自行修改。所有的轉出檔一律採用UTF-8編碼，並加上BOM。

支援使用者自訂字典

使用說明: g2butf8 [檔名]

usage: g2butf8.py [-h] [-r] [-nb] [-nobom] [-x extension [extension ...]] [-t type] [-u userdic] [-nu] files [files ...]

positional arguments: files 會自動偵測編碼，再轉換成有BOM的UTF-8

optional arguments: -h, --help show this help message and exit -r, --recursive 包含子目錄(預設不包括) -nb, --nobackup 不要產生.bak備份檔 (預設有) -nobom, --nobom 不要產生BOM標題 (預設有) -x extension [extension ...] 副檔名, (預設為所有檔案) -t type, --type type 轉換方式: g2b 簡轉繁 g2bdic 簡轉繁再加上詞彙轉換 -u userdic, --userdic userdic 使用者字典檔名，預設使用 userdic.txt -nu, --nouserdic 不使用自訂字典檔 (預設有，使用userdic.txt) 若有特別的轉換需求，可以在要轉檔的目錄下放 userdic.txt ，範例內容:

頭發=頭髮
內存=記憶體

使用方法

Windows:

g2butf8 c:\城市獵人\*.srt

Mac、Linux:

python g2butf8.py ~/城市獵人/*.srt

Details　細節

原本打算用C語言，因為某些庫編譯太麻煩，相依性過高，要跨平台太麻煩；因此改用Python實作。早期因函式庫限制採用Pyhon 2，現在改用Python 3。 Jianfan沒有再更新，計畫改用 hanziconv 很可惜 hanziconv 有過度換詞的問題。

2024年改用 charset-normalizer 與開放中文轉換 opencc-python-reimplemented

~~Universal Encoding Detector 偶有偵測編碼錯誤的情況，經大量測試大多數情況都不會有問題，大量轉碼時請注意輸出的原始編碼。~~

由於現在大多數人的檔案都已採用 UTF-8編碼，故無大量測試charset-normalizer機會。

Install 安裝

Linux 或 Mac :

Linux應該都內建Python 3，若沒有請自行安裝。舊版本要下python3 ，新版直接執行 python
安裝 charset-normalizer 及 opencc-python-reimplemented

> sudo pip install -r requirements.txt 
#或是
> sudo pip install -U charset-normalizer opencc-python-reimplemented

Windows:

安裝Python for Windows ，請選擇 Python 3.9之後版本
安裝 Universal Encoding Detector 與 Jianfan ，因 Jianfan被從pypi移除，只能手動安裝

> sudo pip install -r requirements.txt 
#或是
> sudo pip install -U charset-normalizer opencc-python-reimplemented

Reference 參考資料及函式庫

Python简繁转换

Jianfan

Universal Encoding Detector

開放中文轉換

Unicode In Python, Completely Demystified

Charset Detection, for Everyone 👋 charset-normalizer

開放中文轉換 opencc-python-reimplemented

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Introduction

Install 安裝

Linux 或 Mac :

Windows:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Introduction

Install 安裝

Linux 或 Mac :

Windows: