dramaticmeow's Stars
LTH14/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
yutto-dev/yutto
:ice_cube: 一个可爱且任性的 B 站视频下载器(bilili V2)
Yikai-Liao/symusic
A swift and unified toolkit for symbolic music processing
OpenRL-Lab/Wandb_Tutorial
How to use wandb?
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
ZhangXInFD/SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
microsoft/fadtk
A simple library for Fréchet Audio Distance (FAD) calculation
voidful/Codec-SUPERB
Audio Codec Speech processing Universal PERformance Benchmark
KinWaiCheuk/Jointist
Official Implementation of Jointist
LargeWorldModel/LWM
qjfoidnh/BaiduPCS-Go
iikira/BaiduPCS-Go原版基础上集成了分享链接/秒传链接转存功能
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
minzwon/musicfm
irahorecka/youtube2audio
GUI application to download YouTube videos as annotated MP3 or MP4 files
yangdongchao/UniAudio
The Open Source Code of UniAudio
HumanSignal/awesome-data-labeling
A curated list of awesome data labeling tools
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
yizhilll/MERT
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
meta-llama/llama
Inference code for Llama models
affige/genmusic_demo_list
a list of demo websites for automatic music generation research
MCQTSS/MCQTSS_QQMusic
QQ音乐解析
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)