Pinned Repositories
aho-corasick
Aho-Corasick的Java实现,针对Ascii优化,支持Unicode。
AhoCorasickDoubleArrayTrie
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
CS224n
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
LDA4j
A Java implemention of LDA(Latent Dirichlet Allocation)
multi-criteria-cws
Simple Solution for Multi-Criteria Chinese Word Segmentation
pyhanlp
中文分词
TextRank
TextRank算法提取关键词的Java实现
Viterbi
An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现
hankcs's Repositories
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
hankcs/pyhanlp
中文分词
hankcs/AhoCorasickDoubleArrayTrie
An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.
hankcs/CS224n
CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
hankcs/Viterbi
An implementation of HMM-Viterbi Algorithm 通用的维特比算法实现
hankcs/multi-criteria-cws
Simple Solution for Multi-Criteria Chinese Word Segmentation
hankcs/hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
hankcs/TreebankPreprocessing
Python scripts preprocessing Penn Treebank and Chinese Treebank
hankcs/ID-CNN-CWS
Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"
hankcs/udacity-deep-learning
Assignments for Udacity Deep Learning class with TensorFlow in PURE Python, not IPython Notebook
hankcs/BERT-token-level-embedding
Generate BERT token level embedding without pain
hankcs/HanLPAndroidDemo
HanLP Android Demo
hankcs/sub-character-cws
Sub-Character Representation Learning
hankcs/gohanlp
Golang RESTful Client for HanLP
hankcs/distributed-bert
TensorFlow code and pre-trained models for BERT
hankcs/DeepBiaffineParserMXNet
An experimental implementation of biaffine parser using MXNet
hankcs/OpenCC-to-HanLP
无损转换OpenCC词典为HanLP格式
hankcs/gluon-nlp
NLP made easy
hankcs/bolt_splits
Split Broad Operational Language Translation corpus into train/dev/test set
hankcs/keras-rl2
Reinforcement learning with tensorflow 2 keras
hankcs/web-data
The repo to host all the web data including images for documents in dmlc projects.
hankcs/appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.
hankcs/data-science
Practical Approaches to Data Science with Text
hankcs/dataflow-evaluation-toolkit
hankcs/discourse-elasticsearch
discourse plugin to support elasticsearch
hankcs/dsa-java
Data Structures and Algorithms in Java
hankcs/elit
Evolution of Language and Information Technology
hankcs/mini_racer
Minimal embedded v8
hankcs/OpenDF
Code to reproduce LREC Paper Simplifying Semantic Annotations of SMCalFlow
hankcs/swne
Switchboard Named Entity Corpus