Python-zhuyin - An open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo)

Last update: Dec 29, 2022

Related tags

Text Data & NLP python-zhuyin

Overview

Python-Zhuyin (pyzhuyin) 注音和拼音轉換

Introduction 介紹

pyzhuyin is an open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo).

pyzhuyin 是一個開放原始碼的 Python 套件，提供了將拼音轉換成注音的統一介面。

Installation 安裝

pip install pyzhuyin

Usage 使用

from pyzhuyin import pinyin_to_zhuyin, zhuyin_to_pinyin


assert(pinyin_to_zhuyin("lu3") == "ㄌㄨˇ")
assert(pinyin_to_zhuyin("dan4") == "ㄉㄢˋ")
assert(map(pinyin_to_zhuyin, ["lu3", "dan4"]) == ["ㄌㄨˇ", "ㄉㄢˋ"])

assert(zhuyin_to_pinyin("ㄌㄩˊ") == "lü2")
assert(zhuyin_to_pinyin("˙ㄗ") == "zi5")
assert(map(lambda z: zhuyin_to_pinyin(z, u_to_v=True), ["ㄌㄩˊ", "˙ㄗ"]) == ["lv2", "zi5"])

Testing 測試

Run the following command at the root of the project to test the library:

在根目錄執行以下指令以測試套件:

python3 -m unittest

Notes 備註

Only support numeric tone for pinyin
- e.g. "lu3" instead of "lǔ"
Neutral tone is represented as 5
- e.g. "˙ㄗ" -> "zi5"
For pinyin_to_zhuyin:
- if corresponding zhuyin not found, raise ValueError
- internally convert all v to ü
For zhuyin_to_pinyin:
- if corresponding pinyin not found, raise ValueError
兒化音 is not supported because it is not representable in the zhuyin system as a "combo" word
- e.g. "公園兒" -> "gong1 yuanr2" -> "ㄍㄨㄥㄩㄢㄦˊ" (not allowed)

Data Sources 資料來源

中華民國教育部（Ministry of Education, R.O.C.）。《重編國語辭典修訂本》（版本編號：2015_20210928 ）

網址：https://dict.revised.moe.edu.tw/

CC BY-ND 3.0 TW 授權

Author 作者

Raymond Ku

Python-zhuyin - An open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo)

Related tags

Overview

Python-Zhuyin (pyzhuyin) 注音和拼音轉換

Introduction 介紹

Installation 安裝

Usage 使用

Testing 測試

Notes 備註

Data Sources 資料來源

Author 作者

Owner

Score-Based Point Cloud Denoising (ICCV'21)

This project converts your human voice input to its text transcript and to an automated voice too.

Translate U is capable of translating the text present in an image from one language to the other.

Speech Recognition Database Management with python

DeepAmandine is an artificial intelligence that allows you to talk to it for hours, you won't know the difference.

Unsupervised Language Modeling at scale for robust sentiment classification

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Rhyme with AI

Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields

Kestrel Threat Hunting Language

This repository contains the code, models and datasets discussed in our paper "Few-Shot Question Answering by Pretraining Span Selection"

Torchrecipes provides a set of reproduci-able, re-usable, ready-to-run RECIPES for training different types of models, across multiple domains, on PyTorch Lightning.

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022

用Resnet101+GPT搭建一个玩王者荣耀的AI

A PyTorch implementation of VIOLET

wxPython app for converting encodings, modifying and fixing SRT files

Help you discover excellent English projects and get rid of disturbing by other spoken language

I can help you convert your images to pdf file.

Sequence Modeling with Structured State Spaces