mhshih

Chinese Language Processing

Pinned Repositories

AI_Tutorial
Rocling2019 AI Tutorial file
Language:Jupyter Notebook00
Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
Language:Python11
mhshih.github.io
Language:HTML1 1 00
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
10
NLPCC-WordSeg-Weibo
Language:Python1 0 00
Susing-Piauki
輸入全漢kah全羅，對齊後，ta̍k-ê詞標詞性
Language:Python1 0 00
web
Language:Python10

mhshih's Repositories

mhshih/ArticutAPI_Taigi
Taigi CWS/POS/NER natural language processing tool with Articut as kernel.
Language:Python10
mhshih/mhshih.github.io
Language:HTML1 1 00
mhshih/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
10
mhshih/NLPCC-WordSeg-Weibo
Language:Python1 0 00
mhshih/Susing-Piauki
輸入全漢kah全羅，對齊後，ta̍k-ê詞標詞性
Language:Python1 0 00
mhshih/AI_Tutorial
Rocling2019 AI Tutorial file
Language:Jupyter Notebook00
mhshih/Alpaca-CoT
We extend CoT data to Alpaca to boost its reasoning ability. We are constantly expanding our collection of instruction-tuning data, and integrating more LLMs together for easy use. （我们将CoT数据扩展到Alpaca以提高其推理能力，同时我们将不断收集更多的instruction-tuning数据集,并在我们框架下集成进更多的LLM。）
Language:Jupyter Notebook00
mhshih/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到 SIGHAN 2005 F1-measure 91% 以上，Recall 96% 以上的成績。
mhshih/bert
TensorFlow code and pre-trained models for BERT
Language:Python
mhshih/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
mhshih/chineseQIE
1 0
mhshih/Disfactory
Language:Python0 0
mhshih/engine
Corpus engine of PTT-Corpus
Language:Python1 0
mhshih/fChartExamples2
fChart 6.0以上版本的分類範例
mhshih/G0HWcrawler
Language:Python
mhshih/hue7jip8
台語、族語、客語的語料清單、彙整
mhshih/interactive-tutorials
Interactive Tutorials
mhshih/ladsbook
Linguistic Analysis and Data Science
mhshih/MALINDO_Morph
Kamus morfologi untuk bahasa Melayu/Indonesia
Language:Python
mhshih/mhshih2.github.io
mhshih/moedict-data-twblg
臺灣閩南語常用詞辭典資料檔
Language:Perl0 0
mhshih/NLP18
1 0
mhshih/NLP2022
Language:Python1 0
mhshih/overleaf
A web-based collaborative LaTeX editor
mhshih/python2023
Language:Python1 0
mhshih/readr-data
We will open the data for the news
mhshih/sketch_diff
Language:Python1 0
mhshih/Susing-Kuhuat-Piautiau
台語詞性句法變調
Language:Python0 0
mhshih/Taiwanese-Corpora.github.io
Language:Python0 0
mhshih/tcsl
Language:Python1 0

mhshih

Pinned Repositories

AI_Tutorial

Chinese-Word-Vectors

mhshih.github.io

nlp_chinese_corpus

NLPCC-WordSeg-Weibo

Susing-Piauki

web

mhshih's Repositories

mhshih/ArticutAPI_Taigi

mhshih/mhshih.github.io

mhshih/nlp_chinese_corpus

mhshih/NLPCC-WordSeg-Weibo

mhshih/Susing-Piauki

mhshih/AI_Tutorial

mhshih/Alpaca-CoT

mhshih/ArticutAPI

mhshih/bert

mhshih/Chinese-Vicuna

mhshih/chineseQIE

mhshih/Disfactory

mhshih/engine

mhshih/fChartExamples2

mhshih/G0HWcrawler

mhshih/hue7jip8

mhshih/interactive-tutorials

mhshih/ladsbook

mhshih/MALINDO_Morph

mhshih/mhshih2.github.io

mhshih/moedict-data-twblg

mhshih/NLP18

mhshih/NLP2022

mhshih/overleaf

mhshih/python2023

mhshih/readr-data

mhshih/sketch_diff

mhshih/Susing-Kuhuat-Piautiau

mhshih/Taiwanese-Corpora.github.io

mhshih/tcsl