victorShawFan

A gradStudent of FDU_ major in computer science_ interested in knowledge graph and natural language processing 知乎名：蜡笔小熊猫

Fudan UniversityShanghai

victorShawFan's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python35.4k 215 5.4k4.4k
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Language:C++31.8k 479 2.5k3.7k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.3k 228 2653.1k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.6k 110 1.1k1.6k
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k 132 6151.9k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.8k 98 181.1k
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
Language:Python6.5k 59 334461
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.1k 75 5391k
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 67 128504
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 41 395298
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
Language:Python4k 90 100540
huggingface/course
The Hugging Face course on Transformers
Language:MDX2.3k 51 156758
zyds/transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
Language:Jupyter Notebook2.2k 17 12311
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.8k 21 179168
huggingface/pytorch-openai-transformer-lm
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
Language:Python1.5k 92 39285
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k 34 53101
bilibili/Index-1.9B
A SOTA lightweight multilingual LLM
Language:Python916 9 2547
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python756 8 2445
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python734 8 7451
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
Language:Python642 15 3031
KMnO4-zx/huanhuan-chat
Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句，基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
Language:Python523 4 2546
lansinuote/More_Simple_Reinforcement_Learning
Language:Jupyter Notebook302 1 172
LearnPrompt/LLMs-cookbook
Examples and guides for using the LLMs
Language:Jupyter Notebook256 6 227
lansinuote/Transformer_Example
Language:Python153 3 155
MikeGu721/AgentGroup
Language:Python83 2 13
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Language:Python59 2 34
lansinuote/Simple_RLHF
Language:Jupyter Notebook52 1 010
CLUEbenchmark/SuperCLUE-Role
SuperCLUE-Role中文原生角色扮演测评基准
23 3 20
lansinuote/Simple_RLHF_tiny
Language:Jupyter Notebook3 1 0
victorShawFan/OpenRLHF_add_simpo
添加了simpo方法的OpenRLHF，个人修改，原仓库链接：https://github.com/OpenLLMAI/OpenRLHF
Language:Python30

victorShawFan

victorShawFan's Stars

hiyouga/LLaMA-Factory

facebookresearch/faiss

meta-llama/llama3

huggingface/peft

THUDM/ChatGLM2-6B

naklecha/llama3-from-scratch

InternLM/InternLM

microsoft/DeepSpeedExamples

baichuan-inc/Baichuan-7B

baichuan-inc/Baichuan2

CLUEbenchmark/CLUE

huggingface/course

zyds/transformers-code

OpenLLMAI/OpenRLHF

huggingface/pytorch-openai-transformer-lm

OpenLMLab/MOSS-RLHF

bilibili/Index-1.9B

ContextualAI/HALOs

princeton-nlp/SimPO

HIT-SCIR/Chinese-Mixtral-8x7B

KMnO4-zx/huanhuan-chat

lansinuote/More_Simple_Reinforcement_Learning

LearnPrompt/LLMs-cookbook

lansinuote/Transformer_Example

MikeGu721/AgentGroup

ZHZisZZ/modpo

lansinuote/Simple_RLHF

CLUEbenchmark/SuperCLUE-Role

lansinuote/Simple_RLHF_tiny

victorShawFan/OpenRLHF_add_simpo