/Chinese-Example-Sentences

63k Chinese sentences with simplified, traditional, pinyin and english translation for offline use

Primary LanguagePython

Chinese-Example-Sentences

Chinese sentences with simplified, traditional, pinyin and english translation for offline use in app. Sentences data taken from Tatoeba, pinyin and traditional characters generated using python module pinyin_jyutping_sentence and hanziconv. Translation generated using Google sheets.

Total 63352 sentences in db and tsv file.

Downloads

TSV file

This is tab separated file.

| id | Simplified | Traditional | Pinyin | English |

10	我不知道。	我不知道。	wǒ bù zhīdào 。	I do not know.

sqlite .db file

The sen_data.db contains table examples with id, simplified, traditional, pinyin, english.

Usage

Get two random sentences with pinyin, traditional characters and translation
View read_2_random_sen.py

Create

  1. Download sentences database from Tatoeba
  2. Use Google translate to translate the sentences
  3. Use Python module pinyin_jyutping_sentence and hanziconv to generate pinyin and traditional characters for sentences
  4. Use gen_sen.py and write data to .tsv file
  5. Use tsv_to_db.py python code to create databases.

View this to create tsv and db files with translation for other language

Simple 中文