moiseshorta

❉ Sound artist, creative technologist and electronic musician working with generative A.I. From México, based in Berlin.

moiseshorta.audioBerlin

moiseshorta's Stars

erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
Language:HTML86398
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python33.4k4.1k
moises-ai/moises-db
Moises Source Separation Public Dataset
Language:Python1065
kevinamiri/elevenlabs-react-example
elevenlabs react example
Language:TypeScript115
elevenlabs/elevenlabs-js
The official JavaScript (Node) library for ElevenLabs Text to Speech.
Language:TypeScript10711
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
Language:Jupyter Notebook41226
minzwon/sota-music-tagging-models
Language:Python39263
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python13.5k950
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.4k109
yandex-research/vqdm
Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper
Language:Python141
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Language:Python47919
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.3k661
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python2.3k233
parlance-zz/dualdiffusion
Fourier Dual Diffusion
Language:Python211
myscience/x-lstm
Pytorch implementation of the xLSTM model by Beck et al. (2024)
Language:Python11513
styalai/xLSTM-pytorch
A easy to use implementation of xLSTM
Language:Python181
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2k197
JMGaljaard/VGGish-pytorch
Language:Python171
fisheggg/LVNS-RAVE
Language:Python8
swesterfeld/audiowmark
Audio Watermarking
Language:C++37673
zwl666666/infusion
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
Language:Python10
SonyCSLParis/music2latent
Encode and decode audio samples to/from compressed latent representations!
Language:Python1166
EmilianPostolache/stable-audio-controlnet
Fine-tune Stable Audio Open with DiT ControlNet.
Language:Python1533
DamRsn/NeuralNote
Audio Plugin for Audio to MIDI transcription using deep learning.
Language:C++1.3k66
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python34.9k4.9k
aik2mlj/polyffusion
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls
Language:Python718
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Language:Python1.8k107
Agentic-Learning-AI-Lab/procreate-diffusion-public
Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"
31
yxlllc/ReFlow-VAE-SVC
Language:Python9214
dl4to/dl4to
DL4TO is a Python library for 3D topology optimization that is based on PyTorch and allows easy integration with neural networks.
Language:Jupyter Notebook403

moiseshorta

moiseshorta's Stars

erew123/alltalk_tts

coqui-ai/TTS

moises-ai/moises-db

kevinamiri/elevenlabs-react-example

elevenlabs/elevenlabs-js

magic-research/piecewise-rectified-flow

minzwon/sota-music-tagging-models

black-forest-labs/flux

feizc/FluxMusic

yandex-research/vqdm

NVlabs/edm2

microsoft/LoRA

gpt-omni/mini-omni

parlance-zz/dualdiffusion

myscience/x-lstm

styalai/xLSTM-pytorch

OpenRLHF/OpenRLHF

JMGaljaard/VGGish-pytorch

fisheggg/LVNS-RAVE

swesterfeld/audiowmark

zwl666666/infusion

SonyCSLParis/music2latent

EmilianPostolache/stable-audio-controlnet

DamRsn/NeuralNote

hacksider/Deep-Live-Cam

aik2mlj/polyffusion

facebookresearch/chameleon

Agentic-Learning-AI-Lab/procreate-diffusion-public

yxlllc/ReFlow-VAE-SVC

dl4to/dl4to