alfredplpl

Research Scientist. Interests: data science, machine learning, robotics, neuroscience

CyberAgent, incJapan

alfredplpl's Stars

kohya-ss/musubi-tuner
Language:Python1089
iejMac/video2dataset
Easily create large video dataset from video urls
Language:Python55967
microsoft/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Language:Python6k375
CelebV-HQ/CelebV-HQ
[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
Language:Python40529
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Language:Python1.7k60
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
Language:Python1.1k27
kijai/ComfyUI-MochiWrapper
Language:Python73162
genmoai/mochi
The best OSS video generation models
Language:Python2.6k266
lucidrains/rectified-flow-pytorch
Implementation of rectified flow and some of its followup research / improvements in Pytorch
Language:Python2199
yukara-ikemiya/friendly-stable-audio-tools
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.
Language:Python15511
a-r-r-o-w/finetrainers
Memory-optimized training scripts for video models based on Diffusers
Language:Python65068
TheDenk/cogvideox-controlnet
Simple Controlnet module for CogvideoX model.
Language:Jupyter Notebook946
SerialLain3170/AwesomeAnimeResearch
Papers, repository and other data about anime or manga research. Please let me know if you have information that the list does not include.
1.1k70
aigc-apps/CogVideoX-Fun
📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.
Language:Python59540
EvolvingLMMs-Lab/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Language:Python2.2k179
dailenson/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Language:Python32331
zhenyuw16/GenArtist
Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"
Language:Jupyter Notebook936
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python2k77
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
Language:Python72842
shiml20/FlowTurbo
Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"
Language:Jupyter Notebook623
IsaacGuan/3D-VAE
A variational autoencoder for volumetric shape generation
Language:Python4010
openai/simple-evals
Language:Python2.1k183
Taited/clip-score
Quick scripts to calculate CLIP text-image similarity
Language:Python20718
LuChengTHU/dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
Language:Python1.6k119
reppy4620/diffusion
My implementation of diffusion (like) models
Language:Python102
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4.1k245
discus0434/metrics-utils
Language:Python1
EricGuo5513/momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Language:Python90072
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Language:Python66818
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python4.9k429

alfredplpl

alfredplpl's Stars

kohya-ss/musubi-tuner

iejMac/video2dataset

microsoft/TRELLIS

CelebV-HQ/CelebV-HQ

PKU-YuanGroup/LLaVA-CoT

NVIDIA/Cosmos-Tokenizer

kijai/ComfyUI-MochiWrapper

genmoai/mochi

lucidrains/rectified-flow-pytorch

yukara-ikemiya/friendly-stable-audio-tools

a-r-r-o-w/finetrainers

TheDenk/cogvideox-controlnet

SerialLain3170/AwesomeAnimeResearch

aigc-apps/CogVideoX-Fun

EvolvingLMMs-Lab/lmms-eval

dailenson/One-DM

zhenyuw16/GenArtist

baaivision/Emu3

willisma/SiT

shiml20/FlowTurbo

IsaacGuan/3D-VAE

openai/simple-evals

Taited/clip-score

LuChengTHU/dpm-solver

reppy4620/diffusion

QwenLM/Qwen2-VL

discus0434/metrics-utils

EricGuo5513/momask-codes

Vchitect/Vchitect-2.0

modelscope/ms-swift