K-945

K-945's Stars

sail-sg/CPO
Language:Python25
lansinuote/Simple_LLM_DPO
Language:Jupyter Notebook5910
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Language:Jupyter Notebook9613
Duxiaoman-DI/XuanYuan
轩辕：度小满中文金融对话大模型
Language:Python1k88
JsonBorn7/llm-hero
Language:Jupyter Notebook323
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.2k892
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook1.9k188
datawhalechina/tiny-universe
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
Language:Python1k96
datawhalechina/llms-from-scratch-cn
仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理
Language:Jupyter Notebook1.1k156
LirongWu/GraphMixup
Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Context Prediction"
Language:Python205
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
2.5k226
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML2.6k306
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
3.2k279
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合**宝宝的部署教程
Language:Jupyter Notebook7.9k946
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python2.8k583
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.1k1k
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Language:Python34114
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook4k335
belladoreai/llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
Language:JavaScript33022
futuredialchallenge/2024-RAG
A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge
Language:Python83
liyucheng09/Selective_Context
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
Language:Python27212
RUCAIBox/BAMBOO
Language:Python313
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
Language:TeX670171
MediaBrain-SJTU/MING
明医 (MING)：中文医疗问诊大模型
Language:Python811103
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.4k4.5k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python30.9k3.8k
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.2k500
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python43.6k5.2k
Ethan-yt/guwen-models
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.
Language:Python14619
mingdachen/SummScreen
SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)
Language:Python342