xiholix's Stars
TheAlgorithms/Python
All Algorithms implemented in Python
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
jlevy/the-art-of-command-line
Master the command line, in one page
julioasotodv/spark-tree-plotting
A simple tool for plotting Spark ML's Decision Trees
dylanaraps/pure-bash-bible
📖 A collection of pure bash alternatives to external processes.
ttacon/libphonenumber
golang port of Google's libphonenumber
amueller/introduction_to_ml_with_python
Notebooks and code for the book "Introduction to Machine Learning with Python"
fchollet/deep-learning-with-python-notebooks
Jupyter notebooks for the code samples of the book "Deep Learning with Python"
graphframes/graphframes
hankcs/HanLP
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
thunlp/THULAC
An Efficient Lexical Analyzer for Chinese
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
crownpku/Awesome-Chinese-NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
stanfordnlp/GloVe
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
facebookresearch/fastText
Library for fast text representation and classification.
facebookresearch/SentEval
A python tool for evaluating the quality of sentence embeddings.
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
zhezhaoa/ngram2vec
Four word embedding models implemented in Python. Supporting arbitrary context features
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
openai/finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
allenai/bilm-tf
Tensorflow implementation of contextualized word representations from bi-directional language models
jacobeisenstein/gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
YerevaNN/DIIN-in-Keras
Reproducing Densely Interactive Inference Network in Keras
RedditSota/state-of-the-art-result-for-machine-learning-problems
This repository provides state of the art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue or submit Google form (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.
erickrf/multiffn-nli
Implementation of the multi feed-forward network architecture by Parikh et al. (2016) for Natural Language Inference.
thu-coai/ecm
This project is a tensorflow implement of our work, ECM (emotional chatting machine).
wepe/O2O-Coupon-Usage-Forecast
1st Place Solution for O2O Coupon Usage Forecast
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .