lavinal712

Master degree candidate of USTC

Pinned Repositories

ADM-evaluation-suite-pytorch
Language:Python61
anime-resume
Language:TypeScript10
AutoencoderKL
Language:Python52 1 11
Awesome-Visual-Tokenizers
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
30
control-lora-v3
Language:Python11 1 20
DiT-Muon
Language:Python131
EDiT
Language:Python2 1 10
GenTron
Language:Python9 2 00
Transfusion-Legacy
Language:Python3 1 10
lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Language:Python352 8 6240

lavinal712's Repositories

lavinal712/AutoencoderKL
Language:Python52 1 11
lavinal712/DiT-Muon
Language:Python131
lavinal712/control-lora-v3
Language:Python11 1 20
lavinal712/ADM-evaluation-suite-pytorch
Language:Python61
lavinal712/Awesome-Visual-Tokenizers
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
30
lavinal712/Transfusion
Language:Python3
lavinal712/Transfusion-Legacy
Language:Python3 1 10
lavinal712/EDiT
Language:Python2 1 10
lavinal712/anime-resume
Language:TypeScript10
lavinal712/Transfusion_PyTorchLightning
Language:Python11
lavinal712/VA-VAE
Language:Python10
lavinal712/Awesome-Galgame-Imouto
1
lavinal712/Awesome-Personalized-Image-Generation
A collection of resources on personalized image generation.
lavinal712/clip-recon
Language:Python
lavinal712/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python
lavinal712/fast-DiT
Fast Diffusion Models with Transformers
Language:Python
lavinal712/flow_mar
Language:Python
lavinal712/imagenet-download
Language:Python
lavinal712/IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Language:Python
lavinal712/linear-probing
Language:Python
lavinal712/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Language:Python
lavinal712/omini-kontext
An inference and training framework for multiple image input in Flux Kontext dev
Language:Jupyter Notebook
lavinal712/QLIP
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
lavinal712/REPA
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Language:Python
lavinal712/soap_sort
Language:Python
lavinal712/sudoku-generator
Language:Python
lavinal712/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
lavinal712/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
lavinal712/VQGAN-pytorch
Language:Python
lavinal712/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Language:Python