Cathy0610's Stars
BytedanceSpeech/seed-tts-eval
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
echocatzh/MTFAA-Net
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
radames/Real-Time-Latent-Consistency-Model
App showcasing multiple real-time diffusion models pipelines with Diffusers
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
mdeff/fma
FMA: A Dataset For Music Analysis
iranroman/musicinformationretrieval.com
Instructional notebooks on music information retrieval.
source-separation/tutorial
Tutorial covering Open Source tools for Source Separation.
marl/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Music-and-Culture-Technology-Lab/omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
McDevon/music-hack
Python scripts for altering music files with beat detection. Reverse beats, turn song into swing, etc.
generic-beat-detector/GBD
IoT Framework for Generic and Realtime Music Beat Detection
chrvadala/music-beat-detector
music-beat-detector is a library that analyzes a music stream and detects any beat. It can be used to control lights or any magic effect by the music wave.
dodiku/AudioOwl
Fast and simple music and audio analysis using RNN in Python 🕵️♀️ 🥁
adamstark/BTrack
A Real-Time Beat Tracker
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
datawhalechina/llm-universe
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
espnet/espnet
End-to-End Speech Processing Toolkit
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
lorenmt/mtan
The implementation of "End-to-End Multi-Task Learning with Attention" [CVPR 2019].
spotify-research/cosernn
Code for the paper "Contextual and Sequential User Embeddings for Large-Scale Music Recommendation".
keunwoochoi/music4all_contrib
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
openai/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
ethanjperez/film
FiLM: Visual Reasoning with a General Conditioning Layer
magenta/mt3
MT3: Multi-Task Multitrack Music Transcription
secdr/research-method
论文写作与资料分享