lpty's Stars
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
chaoswork/sft_datasets
开源SFT数据集整理,随时补充
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
EwingYangs/awesome-open-gpt
Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
mosaicml/llm-foundry
LLM training code for Databricks foundation models
shadowsocks/go-shadowsocks2
Modern Shadowsocks in Go
tom-snow/wechat-windows-versions
保存微信历史版本
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
facebookresearch/metaseq
Repo for external large-scale work
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
TsinghuaAI/CPM-2-Pretrain
Code for CPM-2 Pre-Train
alibaba/ChatUI
The UI design language and React library for Conversational UI
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
jma127/pyltr
Python learning to rank (LTR) toolkit
CLUEbenchmark/CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
frida/frida
Clone this repo to build Frida
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.