canqin001's Stars
ggerganov/llama.cpp
LLM inference in C/C++
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
meta-llama/llama3
The official Meta Llama 3 GitHub site
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
ikatyang/emoji-cheat-sheet
A markdown version emoji cheat sheet
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
dmlc/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
ProjectNUWA/DragNUWA
NVlabs/DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
phohenecker/switch-cuda
A simple bash script for switching between installed versions of CUDA.
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Vision-CAIR/MiniGPT4-video
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
RaivoKoot/Video-Dataset-Loading-Pytorch
Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
bfshi/scaling_on_scales
When do we not need larger vision models?
tsb0601/MMVP
Karine-Huang/T2I-CompBench
[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
valeoai/Maskgit-pytorch
unofficial MaskGIT reproduction in PyTorch
zzxslp/SoM-LLaVA
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
BenchCouncil/AIGCBench
Official repo for AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI
heliossun/SQ-LLaVA
Visual self-questioning for large vision-language assistant.
uncbiag/UniLMMV