wayne391's Stars
labuladong/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
katspaugh/wavesurfer.js
Audio waveform player
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
naomiaro/waveform-playlist
Multitrack Web Audio editor and player with canvas waveform preview. Set cues, fades and shift multiple tracks in time. Record audio tracks or provide audio annotations. Export your mix to AudioBuffer or WAV! Add effects from Tone.js. Project inspired by Audacity.
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
mir-aidj/all-in-one
All-In-One Music Structure Analyzer
mjhydri/BeatNet
BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. (ISMIR 2021's paper implementation).
spotify-research/llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
chrisdonahue/sheetsage
Transcribe music into lead sheets!
seungheondoh/lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Ashvala/AQUA-Tk
AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)
HackAudio/PointToPoint_LT
Circuit modeling software for audio signal processing
sony/hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
MWM-io/nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
Kikyo-16/coco-mulla-repo
Official source codes of coco-mulla
Natooz/music-modeling-time-duration
Code of the paper "Impact of time and note duration tokenizations on deep learning symbolic music modeling" (ISMIR 2023)