cenyk1230

Please find the ChatGLM APP https://chatglm.cn/ and MaaS platform https://bigmodel.cn/

AITsinghua Science Park

cenyk1230's Stars

chatchat-space/Langchain-Chatchat
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language:TypeScript32.2k 284 3.9k5.6k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.6k 342 2684.1k
facebookresearch/fastText
Library for fast text representation and classification.
Language:HTML26k 845 1.1k4.7k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.6k 246 1412.8k
state-spaces/mamba
Mamba SSM architecture
Language:Python13.3k 98 5531.1k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.6k 162 7812.4k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.2k 77 1.2k1.3k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.1k 103 1.4k1.1k
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.9k 57 280526
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python5.3k 34 569441
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.6k 79 90350
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Language:Python4.6k 36 336473
mindspore-ai/mindspore
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Language:C++4.3k 148 273710
yizhongw/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python4.2k 58 19488
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.5k 65 55246
modelscope/data-juicer
Making data higher-quality, juicier, and more digestible for any large models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python3k 19 200180
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单：目前已囊括128个大模型，覆盖chatgpt、gpt-4o、谷歌gemini、百度文心一言、阿里通义千问、百川、讯飞星火、商汤senseChat、minimax等商用模型，以及qwen2.5、llama3.1、glm4、书生internLM2.5、openbuddy、AquilaChat等开源大模型。不仅提供能力评分排行榜，也提供所有模型的原始输出结果！
2.9k 33 50132
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k 50 3171
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Language:Python2.6k 12 173274
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.6k 36 100248
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python2.2k 33 245145
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
Language:Python2.1k 29 170145
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k 21 179168
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.8k 24 3998
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26169
hikariming/chat-dataset-baseline
人工精调的中文对话数据集和一段chatglm的微调代码
Language:Jupyter Notebook1.2k 15 3498
redotvideo/mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Language:Python912 6 3170
NVIDIA/nccl-tests
NCCL Tests
Language:Cuda904 26 231242
THUDM/LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Language:Python223 8 1215
THUDM/NaturalCodeBench
NaturalCodeBench (Findings of ACL 2024)
Language:Python56 8 32

cenyk1230

cenyk1230's Stars

chatchat-space/Langchain-Chatchat

tatsu-lab/stanford_alpaca

facebookresearch/fastText

karpathy/llm.c

state-spaces/mamba

NVIDIA/Megatron-LM

huggingface/trl

huggingface/text-generation-inference

yangjianxin1/Firefly

THUDM/GLM-4

togethercomputer/RedPajama-Data

OFA-Sys/Chinese-CLIP

mindspore-ai/mindspore

yizhongw/self-instruct

esbatmop/MNBVC

modelscope/data-juicer

jeinlee1991/chinese-llm-benchmark

Zjh-819/LLMDataHub

dvlab-research/LongLoRA

PhoebusSi/Alpaca-CoT

predibase/lorax

THUDM/CogVLM2

OpenLLMAI/OpenRLHF

S-LoRA/S-LoRA

gkamradt/LLMTest_NeedleInAHaystack

hikariming/chat-dataset-baseline

redotvideo/mamba-chat

NVIDIA/nccl-tests

THUDM/LongAlign

THUDM/NaturalCodeBench