houyili

houyili's Stars

OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
Language:Python54240
OpenLMLab/GAOKAO-Bench-2023
GAOGAO-Bench-2023 is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.
Language:Python182
state-spaces/mamba
Mamba SSM architecture
Language:Python13k1.1k
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.5k245
jxxghp/MoviePilot
NAS媒体库自动化管理工具
Language:Python6.6k787
LLaMafia/llamafia.github
Language:Python31216
databricks/megablocks
Language:Python1.2k174
microsoft/Tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
Language:Python72493
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.6k955
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.3k1.6k
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k71
kyegomez/FlashAttention20
Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels
Language:Python936
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14k1.3k
deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
Language:Makefile1.4k93
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python12.3k831
tatsu-lab/gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
Language:Python486116
Alibaba-NLP/EcomGPT
An Instruction-tuned Large Language Model for E-commerce
Language:Python22314
GPT-Fathom/GPT-Fathom
GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under aligned settings.
Language:Python35023
google-research/xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
Language:Python631110
NVIDIA/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python8.4k1.4k
pluto-junzeng/C4-zh
大规模中文语料
364
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.4k378
bigscience-workshop/data-preparation
Code used for sourcing and cleaning the BigScience ROOTS corpus
Language:Jupyter Notebook30340
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python62650
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k293
THUDM/GLM
GLM (General Language Model)
Language:Python3.2k325
zhenbench/z-bench
Z-Bench 1.0 by 真格基金：一个麻瓜的大语言模型中文测试集。Z-Bench is a LLM prompt dataset for non-technical users, developed by an enthusiastic AI-focused team in Zhenfund.
47942
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k506
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
Language:Python3k237
meta-llama/llama
Inference code for Llama models
Language:Python56.2k9.6k