victorShawFan
A gradStudent of FDU_ major in computer science_ interested in knowledge graph and natural language processing 知乎名:蜡笔小熊猫
Fudan UniversityShanghai
victorShawFan's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
meta-llama/llama3
The official Meta Llama 3 GitHub site
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
microsoft/DeepSpeedExamples
Example models using DeepSpeed
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
huggingface/course
The Hugging Face course on Transformers
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
huggingface/pytorch-openai-transformer-lm
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
OpenLMLab/MOSS-RLHF
MOSS-RLHF
bilibili/Index-1.9B
A SOTA lightweight multilingual LLM
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
KMnO4-zx/huanhuan-chat
Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
lansinuote/More_Simple_Reinforcement_Learning
LearnPrompt/LLMs-cookbook
Examples and guides for using the LLMs
lansinuote/Transformer_Example
MikeGu721/AgentGroup
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
lansinuote/Simple_RLHF
CLUEbenchmark/SuperCLUE-Role
SuperCLUE-Role中文原生角色扮演测评基准
lansinuote/Simple_RLHF_tiny
victorShawFan/OpenRLHF_add_simpo
添加了simpo方法的OpenRLHF,个人修改,原仓库链接:https://github.com/OpenLLMAI/OpenRLHF