hayk-corpusant

hayk-corpusant's Stars

Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Language:Python28k 247 7.1k3.3k
apple/coremltools
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
Language:Python4.3k 122 1.4k626
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.3k 63 94222
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Language:C4.1k 101 1k870
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k 46 0168
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
Language:Python2.5k 43 85231
nerfstudio-project/gsplat
CUDA accelerated rasterization of gaussian splatting
Language:Python1.8k 44 185220
pytorch/executorch
On-device AI across mobile, embedded and edge for PyTorch
Language:C++1.7k 54 387288
symforce-org/symforce
Fast symbolic computation, code generation, and nonlinear optimization for robotics
Language:C++1.4k 43 255143
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
Language:Python1.1k 43 2636
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python1.1k 27 73101
gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python761 33 4686
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Language:Python752 16 4335
DoMusic/Hybrid-Net
Real-time audio to chords, lyrics, beat, and melody.
Language:Python654 5 227
Audio-AGI/WavJourney
WavJourney: Compositional Audio Creation with LLMs
Language:Python513 25 145
shansongliu/M2UGen
This is the official repository for M2UGen
Language:Jupyter Notebook437 10 1138
mir-aidj/all-in-one
All-In-One Music Structure Analyzer
Language:Python401 9 1241
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
Language:Python397 9 2661
diffusion-classifier/diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
Language:Python388 16 3028
bytedance/uss
This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.
Language:Python323 12 1215
YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Language:Python312 10 3125
VinAIResearch/XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Language:Python294 10 2135
spotify-research/llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
Language:Jupyter Notebook290 7 722
google-ai-edge/ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
Language:Jupyter Notebook281 29 5036
seungheondoh/lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Language:Python265 8 1032
descriptinc/audiotools
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Language:Python216 28 1837
locuslab/ect
Consistency Models Made Easy
Language:Python195 6 137
tchambon/IADB
Official implementation of IADB (Iterative α-(de)Blending: a Minimalist Deterministic Diffusion Model), published at Siggraph 2023.
Language:Python149 4 014
microsoft/fadtk
A simple library for Fréchet Audio Distance (FAD) calculation
Language:Python137 6 820
ryeoat3/gomin
GOMIN; Gaudio Open Mel-spectrogram Inversion Network
Language:Python109 6 06