Pinned Repositories
ADM-evaluation-suite-pytorch
anime-resume
AutoencoderKL
Awesome-Visual-Tokenizers
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
control-lora-v3
DiT-Muon
EDiT
GenTron
Transfusion-Legacy
lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
lavinal712's Repositories
lavinal712/AutoencoderKL
lavinal712/DiT-Muon
lavinal712/control-lora-v3
lavinal712/ADM-evaluation-suite-pytorch
lavinal712/Awesome-Visual-Tokenizers
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
lavinal712/Transfusion
lavinal712/Transfusion-Legacy
lavinal712/EDiT
lavinal712/anime-resume
lavinal712/Transfusion_PyTorchLightning
lavinal712/VA-VAE
lavinal712/Awesome-Galgame-Imouto
lavinal712/Awesome-Personalized-Image-Generation
A collection of resources on personalized image generation.
lavinal712/clip-recon
lavinal712/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
lavinal712/fast-DiT
Fast Diffusion Models with Transformers
lavinal712/flow_mar
lavinal712/imagenet-download
lavinal712/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
lavinal712/linear-probing
lavinal712/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
lavinal712/omini-kontext
An inference and training framework for multiple image input in Flux Kontext dev
lavinal712/QLIP
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
lavinal712/REPA
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
lavinal712/soap_sort
lavinal712/sudoku-generator
lavinal712/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
lavinal712/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
lavinal712/VQGAN-pytorch
lavinal712/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism