pskun's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
RLHFlow/Directional-Preference-Alignment
Directional Preference Alignment
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
huggingface/trl
Train transformer language models with reinforcement learning.
d8ahazard/sd_dreambooth_extension
continue-revolution/sd-webui-segment-anything
Segment Anything for Stable Diffusion WebUI
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
coder/code-server
VS Code in the browser
gitpod-io/openvscode-server
Run upstream VS Code on a remote machine with access through a modern web browser from any device, anywhere.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
OpenMatch/ActiveRAG
This is the code repo for our paper "Revealing the Treasures of Knowledge via Active Learning".
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving