moiseshorta
❉ Sound artist, creative technologist and electronic musician working with generative A.I. From México, based in Berlin.
moiseshorta.audioBerlin
moiseshorta's Stars
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
moises-ai/moises-db
Moises Source Separation Public Dataset
kevinamiri/elevenlabs-react-example
elevenlabs react example
elevenlabs/elevenlabs-js
The official JavaScript (Node) library for ElevenLabs Text to Speech.
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
minzwon/sota-music-tagging-models
black-forest-labs/flux
Official inference repo for FLUX.1 models
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
yandex-research/vqdm
Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
parlance-zz/dualdiffusion
Fourier Dual Diffusion
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
styalai/xLSTM-pytorch
A easy to use implementation of xLSTM
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
JMGaljaard/VGGish-pytorch
fisheggg/LVNS-RAVE
swesterfeld/audiowmark
Audio Watermarking
zwl666666/infusion
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
SonyCSLParis/music2latent
Encode and decode audio samples to/from compressed latent representations!
EmilianPostolache/stable-audio-controlnet
Fine-tune Stable Audio Open with DiT ControlNet.
DamRsn/NeuralNote
Audio Plugin for Audio to MIDI transcription using deep learning.
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
aik2mlj/polyffusion
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Agentic-Learning-AI-Lab/procreate-diffusion-public
Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"
yxlllc/ReFlow-VAE-SVC
dl4to/dl4to
DL4TO is a Python library for 3D topology optimization that is based on PyTorch and allows easy integration with neural networks.