Pinned Repositories
AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
bark
🔊 Text-Prompted Generative Audio Model
CLAP
Contrastive Language-Audio Pretraining
compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
FaceDetection
Everything Face Detection and ARKit related
GiftCardsAR
View card balances using ARKit 2 and image tracking.
jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
bluenucleus's Repositories
bluenucleus/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
bluenucleus/bark
🔊 Text-Prompted Generative Audio Model
bluenucleus/CLAP
Contrastive Language-Audio Pretraining
bluenucleus/compound-word-transformer
Official implementation of compound word transformer (AAAI'21)
bluenucleus/FaceDetection
Everything Face Detection and ARKit related
bluenucleus/GiftCardsAR
View card balances using ARKit 2 and image tracking.
bluenucleus/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
bluenucleus/Mubert-Text-to-Music
A simple notebook demonstrating prompt-based music generation via Mubert API
bluenucleus/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
bluenucleus/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
bluenucleus/Project-AiR
Using AudioKit
bluenucleus/test
bluenucleus/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
bluenucleus/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
bluenucleus/vq-voice-swap
Voice swapping with VQ-VAE and diffusion models
bluenucleus/Waveformer
An efficient architecture for real-time target sound extraction.