FlyingRoastDuck

xmu

FlyingRoastDuck's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook37.6k 392 674k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.7k 201 4.9k3.9k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python25.4k 195 4.1k5.2k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k 218 4592.9k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python21.7k 185 4872.1k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.5k 160 1.5k2.2k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k 160 3001k
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.1k 120 2101.1k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.4k 81 21917
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k 63 98504
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.1k 61 379329
luban-agi/Awesome-AIGC-Tutorials
Curated tutorials and resources for Large Language Models, AI Painting, and more.
3.8k 29 2257
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.2k 133 18193
ray-project/llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
Language:Jupyter Notebook1.7k 17 12222
YangLing0818/RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language:Jupyter Notebook1.7k 26 5194
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.2k 16 106110
MichalGeyer/plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
Language:Python905 9 1757
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
830 52 1435
kakaobrain/torchgpipe
A GPipe implementation in PyTorch
Language:Python807 33 3398
Raudaschl/rag-fusion
Language:Python779 9 795
songweige/rich-text-to-image
Rich-Text-to-Image Generation
Language:Python755 20 1563
MaximeVandegar/Papers-in-100-Lines-of-Code
Implementation of papers in 100 lines of code.
Language:Python716 22 9103
muzairkhattak/multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Language:Python635 7 8147
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Language:Python627 20 3524
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
517 20 432
csyxwei/ELITE
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
Language:Python511 43 2030
genforce/freecontrol
Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"
Language:Python423 27 1213
anosorae/IRRA
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)
Language:Python194 2 4127
ZrrSkywalker/LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
82 4 17
NVlabs/dream-in-4d
Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]
Language:Python56 9 13

FlyingRoastDuck

FlyingRoastDuck's Stars

mlabonne/llm-course

hiyouga/LLaMA-Factory

huggingface/diffusers

Vision-CAIR/MiniGPT-4

hpcaitech/Open-Sora

haotian-liu/LLaVA

PKU-YuanGroup/Open-Sora-Plan

lucidrains/DALLE2-pytorch

liguodongiot/llm-action

pytorch-labs/gpt-fast

tencent-ailab/IP-Adapter

luban-agi/Awesome-AIGC-Tutorials

showlab/Awesome-Video-Diffusion

ray-project/llm-applications

YangLing0818/RPG-DiffusionMaster

flashinfer-ai/flashinfer

MichalGeyer/plug-and-play

DirtyHarryLYL/LLM-in-Vision

kakaobrain/torchgpipe

Raudaschl/rag-fusion

songweige/rich-text-to-image

MaximeVandegar/Papers-in-100-Lines-of-Code

muzairkhattak/multimodal-prompt-learning

TencentARC/Open-MAGVIT2

Yangyi-Chen/Multimodal-AND-Large-Language-Models

csyxwei/ELITE

genforce/freecontrol

anosorae/IRRA

ZrrSkywalker/LLaMA-Adapter

NVlabs/dream-in-4d