wayne391

貓咪 :3

@ailabstwTaiwan

wayne391's Stars

labuladong/fucking-algorithm
刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.
Language:Markdown124k 2.3k 82523.1k
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python73.8k 456 7.1k5.8k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python32.8k 349 2955.1k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python19.9k 189 3562k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python11.8k 136 195814
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python11.5k 106 8231k
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language:C++9.3k 150 3.5k2k
katspaugh/wavesurfer.js
Audio waveform player
Language:TypeScript8.3k 165 2.1k1.6k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python7.8k 77 488552
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++7.1k 82 1.4k758
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Language:Python4.2k 62 89213
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.3k 41 98218
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 171 466
naomiaro/waveform-playlist
Multitrack Web Audio editor and player with canvas waveform preview. Set cues, fades and shift multiple tracks in time. Record audio tracks or provide audio annotations. Export your mix to AudioBuffer or WAV! Add effects from Tone.js. Project inspired by Audacity.
Language:JavaScript1.4k 64 132283
LAION-AI/CLAP
Contrastive Language-Audio Pretraining
Language:Python1.2k 28 78118
descriptinc/descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Language:Python963 26 5679
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
Language:Python604 7 8177
zhvng/open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
Language:Python489 16 2558
mir-aidj/all-in-one
All-In-One Music Structure Analyzer
Language:Python348 10 632
mjhydri/BeatNet
BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. (ISMIR 2021's paper implementation).
Language:Python295 9 2646
spotify-research/llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.
Language:Jupyter Notebook264 7 723
chrisdonahue/sheetsage
Transcribe music into lead sheets!
Language:Python254 11 2143
seungheondoh/lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Language:Python237 8 729
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Language:Python182 5 2510
Ashvala/AQUA-Tk
AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)
Language:Python88 3 36
HackAudio/PointToPoint_LT
Circuit modeling software for audio signal processing
Language:C++78 4 02
sony/hFT-Transformer
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
Language:Python65 3 210
MWM-io/nansypp
Unofficial implementation of NANSY++ in Pytorch Lightning
Language:Python42 8 34
Kikyo-16/coco-mulla-repo
Official source codes of coco-mulla
Language:Python18 4 11
Natooz/music-modeling-time-duration
Code of the paper "Impact of time and note duration tokenizations on deep learning symbolic music modeling" (ISMIR 2023)
Language:Python91

wayne391

wayne391's Stars

labuladong/fucking-algorithm

yt-dlp/yt-dlp

karpathy/nanoGPT

facebookresearch/audiocraft

BlinkDL/RWKV-LM

Dao-AILab/flash-attention

NVIDIA/TensorRT

katspaugh/wavesurfer.js

facebookresearch/xformers

NVIDIA/TensorRT-LLM

luosiallen/latent-consistency-model

haoheliu/AudioLDM

archinetai/audio-ai-timeline

naomiaro/waveform-playlist

LAION-AI/CLAP

descriptinc/descript-audio-codec

Natooz/MidiTok

zhvng/open-musiclm

mir-aidj/all-in-one

mjhydri/BeatNet

spotify-research/llark

chrisdonahue/sheetsage

seungheondoh/lp-music-caps

XinhaoMei/WavCaps

Ashvala/AQUA-Tk

HackAudio/PointToPoint_LT

sony/hFT-Transformer

MWM-io/nansypp

Kikyo-16/coco-mulla-repo

Natooz/music-modeling-time-duration