Arseny5
Master's student at Skoltech and HSE | Generative models and Multimodal foundations | Founder AI Knowledge Club
Skoltech & HSE Moscow, Russia
Arseny5's Stars
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
Ruixxxx/Awesome-Vision-Mamba-Models
[Official Repo] Visual Mamba: A Survey and New Outlooks
dongzhuoyao/awesome-flow-matching
A summary of related works about flow matching, stochastic interpolants
AaronZ345/StyleSinger
PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis
lucidrains/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
GTSinger/GTSinger
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
dunnolab/awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
malbergo/stochastic-interpolants
VikhrModels/effective_llm_alignment
Effective LLM Alignment Toolkit
openvpi/SingingVocoders
A collection of neural vocoders suitable for singing voice synthesis tasks.
HannesStark/dirichlet-flow-matching
ChenYi99/EgoPlan
RakitinDen/ODE-SDE-Generative-Models
Tutorial on generative models based on ordinary (ODE) and stochastic (SDE) differential equations
Atmyre/RAVE
Code for paper RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
lucidrains/scaling-vin-pytorch
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group
justkolesov/Wasserstein1Benchmark
A set of tests for evaluating large-scale algorithms for Wasserstein-1 transport computation (NeurIPS'22).
sakharok13/Aligning-Stable-Diffusion-with-Noise-Conditioned-Perception
leffff/flow-matching-experiments
Simple tests of novel flow based methods
maxnygma/equivariant-gnn
My take on E(n) Equivariant Graph Neural Networks
ai-forever/emotional-fbc4.0-aij24
Emotional FusionBrain Challenge 4.0 - dev
makriot/harry-potter-chatbot
FSE Project: Chatbot (LLama 3.2), trained on "Harry Potter" movies
WladGrm/Bridge_Matching
Brownian Bridge matching generative model with Euler-Maruyama SDE solver