pskun

Currently at @IDEA-CCNL as an algorithm engineer.

@IDEA-CCNLShenzhen, China

pskun's Stars

ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go91.5k 544 4.5k7.2k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.7k 201 4.9k3.9k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.5k 160 1.5k2.2k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12k 270 109768
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.5k 70 104583
ollama/ollama-python
Ollama Python library
Language:Python4.1k 31 147341
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.8k 33 513302
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.3k 34 94465
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.2k 41 54148
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2k 44 125139
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Language:Python1.4k 13 413110
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k 14 868
google-research/deduplicate-text-datasets
Language:Rust1.1k 13 41108
yandex/YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel
Language:Python824 18 341
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python705 19 2958
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Language:Python673 9 13294
epfLLM/Megatron-LLM
distributed trainer for LLMs
Language:Python526 18 5976
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python524 2 1451
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python519 16 7058
huggingface/cosmopedia
Language:Python426 11 1143
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
Language:Python340 9 1016
whitzard-ai/jade-db
"他山之石、可以攻玉"：复旦白泽智能发布面向国内开源和国外商用大模型的Demo数据集JADE-DB
296 3 219
ZigeW/data_management_LLM
Collection of training data management explorations for large language models
270 5 126
FlagOpen/FlagData
Language:Python254 4 1630
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python109 6 1328
Aligner2024/aligner
Achieving Efficient Alignment through Learned Correction
Language:Python104 1 75
RLHFlow/Directional-Preference-Alignment
Directional Preference Alignment
45 3 42
SparkJiao/llama-pipeline-parallel
A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you have encoured.
Language:Python45 1 62
FreedomIntelligence/FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
Language:Python32 2 03

pskun

pskun's Stars

ollama/ollama

hiyouga/LLaMA-Factory

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

huggingface/lerobot

ollama/ollama-python

InternLM/xtuner

llm-attacks/llm-attacks

modelscope/swift

google-research/big_vision

huggingface/datatrove

argilla-io/distilabel

XueFuzhao/OpenMoE

google-research/deduplicate-text-datasets

yandex/YaFSDP

RLHFlow/RLHF-Reward-Modeling

alibaba/Pai-Megatron-Patch

epfLLM/Megatron-LLM

feifeibear/LLMSpeculativeSampling

NVIDIA/NeMo-Aligner

huggingface/cosmopedia

hao-ai-lab/Consistency_LLM

whitzard-ai/jade-db

ZigeW/data_management_LLM

FlagOpen/FlagData

PKU-Alignment/align-anything

Aligner2024/aligner

RLHFlow/Directional-Preference-Alignment

SparkJiao/llama-pipeline-parallel

FreedomIntelligence/FastLLM