Heepo
Machine Learning | Large Language Models | NLP | Search | Recommendation
Beijing University of Posts and TelecommunicationsBeijing
Heepo's Stars
lllyasviel/Fooocus
Focus on prompting and generating
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
codemayq/chinese-chatbot-corpus
中文公开聊天语料库
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
google-research/deduplicate-text-datasets
microsoft/Llama-2-Onnx
1e0ng/simhash
A Python Implementation of Simhash Algorithm
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
MiuLab/TC-Bot
User Simulation for Task-Completion Dialogues
r-three/t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
LowinLi/transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.
aplmikex/deduplication_mnbvc
文本去重
mosaicml/llm-eval-dashboard
A streamlit app for visualizing LLM evals.
dunovank/jupyterlab_darkside_theme
Dark theme for JupyterLab v4.0+
sheng-kai-wang/DST4LLM
DST(Dialogue State Tracker) for LLM(Large Language Model)
znhy1024/ProToCo