Heepo

Machine Learning | Large Language Models | NLP | Search | Recommendation

Beijing University of Posts and TelecommunicationsBeijing

Heepo's Stars

lllyasviel/Fooocus
Focus on prompting and generating
Language:Python40.7k 314 1.5k5.7k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python32.3k 205 5k4k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k 185 7311.9k
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C17.3k 193 2202.1k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python10.3k 92 7641k
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Language:Python7.3k 140 895889
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 41 395293
codemayq/chinese-chatbot-corpus
中文公开聊天语料库
Language:Python4k 75 18788
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Language:Python3.7k 32 374471
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.4k 64 54240
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
Language:Python3k 51 134233
LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
2.9k 72 38664
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.5k 46 3161
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2.3k 32 88154
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
Language:Python1.6k 15 8177
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.3k 18 23129
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1.3k 24 144214
google-research/deduplicate-text-datasets
Language:Rust1.1k 13 41110
microsoft/Llama-2-Onnx
Language:Python1k 338 2692
1e0ng/simhash
A Python Implementation of Simhash Algorithm
Language:Python976 22 43222
CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
917 21 1282
MiuLab/TC-Bot
User Simulation for Task-Completion Dialogues
Language:OpenEdge ABL805 44 15296
r-three/t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
Language:Python428 8 3259
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook301 24 1240
LowinLi/transformers-stream-generator
This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/Transformers.
Language:Python95 2 1214
aplmikex/deduplication_mnbvc
文本去重
Language:Python65 2 26
mosaicml/llm-eval-dashboard
A streamlit app for visualizing LLM evals.
Language:Python39 4 06
dunovank/jupyterlab_darkside_theme
Dark theme for JupyterLab v4.0+
Language:CSS21 3 20
sheng-kai-wang/DST4LLM
DST(Dialogue State Tracker) for LLM(Large Language Model)
Language:Java19 1 00
znhy1024/ProToCo
Language:Python6 1 03

Heepo

Heepo's Stars

lllyasviel/Fooocus

hiyouga/LLaMA-Factory

ymcui/Chinese-LLaMA-Alpaca

karpathy/llama2.c

Lightning-AI/litgpt

stanfordnlp/stanza

baichuan-inc/Baichuan2

codemayq/chinese-chatbot-corpus

hiyouga/ChatGLM-Efficient-Tuning

esbatmop/MNBVC

CVI-SZU/Linly

LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

Zjh-819/LLMDataHub

FasterDecoding/Medusa

hkust-nlp/ceval

Farama-Foundation/chatarena

bigscience-workshop/Megatron-DeepSpeed

google-research/deduplicate-text-datasets

microsoft/Llama-2-Onnx

1e0ng/simhash

CLUEbenchmark/CLUECorpus2020

MiuLab/TC-Bot

r-three/t-few

bigscience-workshop/data-preparation

LowinLi/transformers-stream-generator

aplmikex/deduplication_mnbvc

mosaicml/llm-eval-dashboard

dunovank/jupyterlab_darkside_theme

sheng-kai-wang/DST4LLM

znhy1024/ProToCo