- scraping (downloading raw HTML)
- extracting (bs4)
- text_to_json (converting flat text files to JSON files)
- dictionary (build an offline Chinese dictionary with over 120K words)
- Idioms dictionary
- Top Chinese Characters
- Top 5000 rank
- Top 1500 rank
- spacy_parsing
- Vue sites
- Character frequency
- Reader