Pinned Repositories
Angry-Bomb
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
awesome-optical-flow-algorithm
A curated list of resources dedicated to optical flow algorithms. Feel free to make PRs to contribute.
basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
beat-tracking-tcn
An implementation of Davies & Böck's beat-tracking temporal convolutional network
Beat-Transformer
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
blender-mcp
deep-attentive-time-warping
Deep Attentive Time Warping in Pattern Recognition 2023.
GlideWebpDecoder
A Glide WebpDecoder Intergration Library for decoding and displaying webp images
alvingao's Repositories
alvingao/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
alvingao/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
alvingao/beat-tracking-tcn
An implementation of Davies & Böck's beat-tracking temporal convolutional network
alvingao/Beat-Transformer
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
alvingao/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
alvingao/blender-mcp
alvingao/deep-attentive-time-warping
Deep Attentive Time Warping in Pattern Recognition 2023.
alvingao/groove2groove
Code for "Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data"
alvingao/html-midi-player
🎹 Play and display MIDI files on the web
alvingao/librosa
Python library for audio and music analysis
alvingao/llamacoder
Open source Claude Artifacts – built with Llama 3.1 405B
alvingao/M3U8Parser
alvingao/madmom
Python audio and music signal processing library
alvingao/min-char-rnn-pytorch
PyTorch implementation of min-char-rnn.py from Andrej Karpathy
alvingao/musicinformationretrieval.com
Instructional notebooks on music information retrieval.
alvingao/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
alvingao/melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
alvingao/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
alvingao/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
alvingao/nodejs-template
alvingao/Nominate
macOS app for renaming PDFs based on their contents
alvingao/riffusion
Stable diffusion for real-time music generation
alvingao/shadcn-multi-select-component
A multi-select component designed with shadcn/ui
alvingao/strapi-sdk-js
📦 Javascript SDK for your Strapi API
alvingao/sullyo-webapp-starter
A monorepo template for building webapps - perfect for LLMs
alvingao/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
alvingao/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
alvingao/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
alvingao/waveglow
A Flow-based Generative Network for Speech Synthesis
alvingao/wavenet_vocoder
WaveNet vocoder