mymsimple's Stars
PKU-YuanGroup/ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
bojone/word-discovery
速度更快、效果更好的中文新词发现
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
meta-llama/llama
Inference code for Llama models
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
FreedomIntelligence/GrammarGPT
The code and data for GrammarGPT.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
InternLM/InternLM-techreport
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
THUDM/GLM
GLM (General Language Model)
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
zhuweiyou/chatgpt-api
封装 OpenAI 网页版最新 ChatGPT 接口, 不需要使用 API Key, 完全免费
blcuicall/CCL2023-CLTC
CCL 2023 汉语学习者文本纠错评测
houbb/word-checker
🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)
Tencent/Forward
A library for high performance deep learning inference on NVIDIA GPUs.
PaddlePaddle/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
futantan/OpenGpt
Create your own ChatGPT App in seconds.
google-research/lasertagger
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
huggingface/blog
Public repo for HF blog posts
google-research/pegasus
ZhuiyiTechnology/WoBERT
以词为基本单位的中文BERT
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.