/Chinese-NLP_Buddhist-Texts

This notebook explores using the SpaCy Chinese model to tokenize and analyze parts of speech for a short Buddhist text. Mistakes in the tags were corrected by hand to train an improved model for Pre-Modern Literary Chinese Texts. It was built with Pandas, SpaCy, and Xiaoquan Kong's Chinese SpaCy model. It implements a Part of Speech tagger and visualizations using diSplaCy.

Primary LanguageHTML

Binder

Chinese-NLP_Buddhist-Texts

This notebook uses the SpaCy Chinese model to tokenize and analyze parts of speech for a short Buddhist text. Mistakes in the tags were corrected by hand to train an improved model for Pre-Modern Literary Chinese Texts. It was built with Pandas, SpaCy, and Xiaoquan Kong's Chinese SpaCy model. It implements a Part of Speech tagger and visualizations using diSplaCy.

MyBinder Jupyter Notebook | Static HTML