pskun

Currently at @IDEA-CCNL as an algorithm engineer.

@IDEA-CCNLShenzhen, China

pskun's Stars

ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go91.6k7.2k
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python52258
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
Language:Python3.7k312
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.7k3.9k
RLHFlow/Directional-Preference-Alignment
Directional Preference Alignment
463
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python70558
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.2k148
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
Language:Python34016
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.5k583
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.5k2.2k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12k768
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.4k110
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python52551
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python61342
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.1k797
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.6k1.2k
d8ahazard/sd_dreambooth_extension
Language:Python1.9k282
continue-revolution/sd-webui-segment-anything
Segment Anything for Stable Diffusion WebUI
Language:Python3.4k205
beichenzbc/Long-CLIP
[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Language:Python61630
pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python291
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python61838
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook14.9k1.4k
coder/code-server
VS Code in the browser
Language:TypeScript67.8k5.6k
gitpod-io/openvscode-server
Run upstream VS Code on a remote machine with access through a modern web browser from any device, anywhere.
Language:TypeScript5k432
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.6k4k
OpenMatch/ActiveRAG
This is the code repo for our paper "Revealing the Treasures of Knowledge via Active Learning".
Language:Python895
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k206
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Language:Python2.9k199
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
Language:Jinja47847
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.2k110

pskun

pskun's Stars

ollama/ollama

NVIDIA/NeMo-Aligner

modelscope/ms-swift

hiyouga/LLaMA-Factory

RLHFlow/Directional-Preference-Alignment

RLHFlow/RLHF-Reward-Modeling

google-research/big_vision

hao-ai-lab/Consistency_LLM

huggingface/lerobot

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

argilla-io/distilabel

feifeibear/LLMSpeculativeSampling

jzhang38/EasyContext

RUCAIBox/LLMSurvey

huggingface/trl

d8ahazard/sd_dreambooth_extension

continue-revolution/sd-webui-segment-anything

beichenzbc/Long-CLIP

pickxiguapi/Uni-RLHF-Platform

BlackSamorez/tensor_parallel

IDEA-Research/Grounded-Segment-Anything

coder/code-server

gitpod-io/openvscode-server

mlabonne/llm-course

OpenMatch/ActiveRAG

OpenRLHF/OpenRLHF

AnswerDotAI/RAGatouille

chujiezheng/chat_templates

flashinfer-ai/flashinfer