mathigatti
ML Engineer and Creative Coder. I like to make computers talk and sing.
Buenos Aires, Argentina
mathigatti's Stars
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
mlfoundations/open_clip
An open source implementation of CLIP.
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
MubertAI/Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
prophesier/diff-svc
Singing Voice Conversion via diffusion model
readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
huggingface/setfit
Efficient few-shot learning with Sentence Transformers
nat/natbot
Drive a browser with GPT-3
bfelbo/DeepMoji
State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.
timojl/clipseg
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
marl/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
pettarin/forced-alignment-tools
A collection of links and notes on forced alignment tools
sophiefy/Sovits
An unofficial implementation of the combination of Soft-VC and VITS
bshall/soft-vc
Soft speech units for voice conversion
amrrs/stable-diffusion-prompt-inpainting
This project helps you do prompt-based inpainting without having to paint the mask - using Stable Diffusion and Clipseg
ServiceNow/picard
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
ofirpress/self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
WX-Wei/HarmoF0
howard1337/S2VC
Kyubyong/mtp
Multi-lingual Text Processing
patriceguyot/Yin
Fast Python implementation of the Yin algorithm: a fundamental frequency estimator
buttplugio/buttplug-py
Python implementation of core message system and client for the Buttplug Sex Toy Protocol Standard
summerstay/true_poetry
Poetry generator by gpt-2 with meter and rhyme constraints.
Mainakdeb/text-2-cellular-automata
:brain: Neural Cellular Automata + CLIP
00sapo/OpenEWLD
A Public Domain Leadsheet Dataset
yoyololicon/eva
A screaming vocal samples dataset.
sebasgverde/music-geometry-eval
A python library to automatically evaluate music tonality based on geometry