K-945's Stars
sail-sg/CPO
lansinuote/Simple_LLM_DPO
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Duxiaoman-DI/XuanYuan
轩辕:度小满中文金融对话大模型
JsonBorn7/llm-hero
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
datawhalechina/llms-from-scratch-cn
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
LirongWu/GraphMixup
Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Context Prediction"
datawhalechina/so-large-lm
大模型基础: 一文了解大模型基础知识
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
belladoreai/llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
futuredialchallenge/2024-RAG
A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge
liyucheng09/Selective_Context
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
RUCAIBox/BAMBOO
acl-org/acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
MediaBrain-SJTU/MING
明医 (MING):中文医疗问诊大模型
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Ethan-yt/guwen-models
GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Chinese related models and resources on the Internet.
mingdachen/SummScreen
SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)