pskun's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
ollama/ollama-python
Ollama Python library
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
google-research/deduplicate-text-datasets
yandex/YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
epfLLM/Megatron-LLM
distributed trainer for LLMs
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
huggingface/cosmopedia
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
whitzard-ai/jade-db
"他山之石、可以攻玉":复旦白泽智能发布面向国内开源和国外商用大模型的Demo数据集JADE-DB
ZigeW/data_management_LLM
Collection of training data management explorations for large language models
FlagOpen/FlagData
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Aligner2024/aligner
Achieving Efficient Alignment through Learned Correction
RLHFlow/Directional-Preference-Alignment
Directional Preference Alignment
SparkJiao/llama-pipeline-parallel
A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.
FreedomIntelligence/FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];