orientedlin's Stars
xai-org/grok-1
Grok open release
PKUFlyingPig/cs-self-learning
计算机自学指南
pybind/pybind11
Seamless operability between C++11 and Python
carefree0910/carefree-data
A data processing module implemented with numpy
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
ymcui/MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
swabhs/open-sesame
A frame-semantic parsing system based on a softmax-margin SegRNN.
globalwordnet/OMW
The Open Multilingual Wordnet
cisnlp/simalign
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
ringgaard/sling
SLING - A natural language frame semantics parser
google/sling
SLING - A natural language frame semantics parser
facebookresearch/KILT
Library for Knowledge Intensive Language Tasks
doccano/doccano
Open source annotation tool for machine learning practitioners.
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
microsoft/nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
airaria/TextBrewer
A PyTorch-based knowledge distillation toolkit for natural language processing
CLUEbenchmark/CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
carefree0910/carefree-learn
Deep Learning ❤️ PyTorch
HIT-SCIR/ltp
Language Technology Platform
brightmart/albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
carefree0910/carefree-ml
carefree-ml implemented Machine Learning algorithms with numpy, mainly for educational use
carefree0910/carefree-toolkit
Some commonly used functions and modules
facebookresearch/LAMA
LAnguage Model Analysis