theshypig's Stars
THUDM/CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
ggerganov/llama.cpp
LLM inference in C/C++
wgwang/awesome-LLMs-In-China
**大模型
web-arena-x/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
codefuse-ai/codefuse-evaluation
Industrial-level evaluation benchmarks for Coding LLMs in the full life-cycle of AI native software developing.企业级代码大模型评测体系,持续开放中
langgptai/LangGPT
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt
Alfred0622/HypR
A benchmark corpus for ASR hypothesis revising task
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
natureLanguageQing/medical_ner
医学命名实体识别数据集制作
NLPatVCU/medaCy
:hospital: Medical Text Mining and Information Extraction with spaCy
microsoft/CyBERTron-LM
CyBERTron-LM is a project which collects some pre-trained Transformer-based models.
microsoft/COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
nghuyong/Chinese-text-correction-papers
text correction papers
wolfgarbe/SymSpell
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
mmxgn/spacy-clausie
Implementation of the ClausIE information extraction system for python+spacy
yixiu00001/LSTM-CRF-medical
构建医疗实体识别的模型,包含词典和语料标注,基于python构建
OmkarPathak/pyresparser
A simple resume parser used for extracting information from resumes
SparkJiao/Retrieval-based-Pre-training-for-Machine-Reading-Comprehension
Source code of the paper - REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training
google-research-datasets/dstc8-schema-guided-dialogue
The Schema-Guided Dialogue Dataset
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
iflytek/MiniRBT
MiniRBT (中文小型预训练模型系列)
ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
amazon-science/multiatis
Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
ZhuiyiTechnology/roformer-v2
RoFormer升级版
snipsco/snips-nlu
Snips Python library to extract meaning from text
JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
huggingface/blog
Public repo for HF blog posts