RifleZhang

RifleZhang's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook104k 739 8.3k16.8k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python28.5k 244 2873.3k
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Language:Python22.6k 153 1k1.7k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.9k 158 1.6k2.4k
OpenBMB/MiniCPM-o
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Language:Python19k 139 7781.4k
LLaVA-VL/LLaVA-NeXT
Language:Python3.6k 35 385334
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.8k 43 424170
openai/simple-evals
Language:Python2.5k 35 17225
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Language:Python2.2k 8 279227
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda2.2k 38 42127
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python2k 31 6478
GAIR-NLP/O1-Journey
O1 Replication Journey
2k 37 1565
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Language:Python1.9k 70 3272
laekov/fastmoe
A fast MoE impl for PyTorch
Language:Python1.7k 12 121194
Zefan-Cai/KVCache-Factory
Unified KV Cache Compression Methods for Auto-Regressive Models
Language:Python955 77 37124
BradyFU/Video-MME
✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
489 5 3620
xiaoachen98/Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT.
Language:Python385 10 2819
RLHF-V/RLAIF-V
[CVPR'25] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Language:Python321 6 3613
RLHF-V/RLHF-V
[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
Language:Python268 2 298
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models
Language:Python167 4 177
NVIDIA/ngpt
Normalized Transformer (nGPT)
Language:Python163 3 917
ByungKwanLee/Meteor
[NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to improve performance of numerous vision language performances for diverse capabilities.
Language:Python110 1 65
thu-pacman/FasterMoE
Language:Python81 16 313
luyug/magix
Supercharge huggingface transformers with model parallelism.
Language:Python76 2 13
hamishivi/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python69 1 016
RifleZhang/LLaVA-Reasoner-DPO
Language:Python68 6 72
HazyResearch/train-tk
train with kittens!
Language:Python54 3 03
shawntan/stickbreaking-attention
Stick-breaking attention
Language:Python48 5 32
hyhieu/easy_pybind
Language:Python32 1 00
princeton-nlp/PTP
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
Language:Python28 5 11