Chiuqyan's Stars
CoinCheung/pytorch-loss
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
xserrat/docker-facebook-demucs
Dockerized Facebook Demucs library to make it easy its execution
zhenye234/CoMoSpeech
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
kinglegendzzh/chordPrediction
音乐创作工具(基于马尔科夫链的和弦预测算法)
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
mimbres/YourMT3
multi-task and multi-track music transcription for everyone
yamathcy/ISMIR-2024-Papers
yamathcy/music-deeplearning-japanese
深層学習×音楽情報処理勉強会@筑波大学・人と音の情報学研究室
yamathcy/awesome-music-informatics
A curated list of awesome article, tutorial, library, webpage, etc.
JunyaoHu/common_metrics_on_video_quality
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
RickyL-2000/ROSVOT
Robust Singing Voice Transcription and MIDI Extraction
openvpi/SOME
SOME: Singing-Oriented MIDI Extractor.
bloodraven66/aai_pta_transformers
sanderwood/melodyt5
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
varyshare/easy_slam_tutorial
首个中文的简单从零开始实现视觉SLAM理论与实践教程,使用Python实现。包括:ORB特征点提取,对极几何,视觉里程计后端优化,实时三维重建地图。A easy SLAM practical tutorial (Python).图像处理、otsu二值化。更多其他教程我的CSDN博客
scaperot/the-BPM-detector-python
BPM detection for audio files (currently just .wav). Takes in the whole file, and prints out the BPM.
Music-and-Culture-Technology-Lab/omnizart
Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.
hartmetzls/audio_to_midi
A CNN which converts piano audio to a simplified MIDI format
DamRsn/NeuralNote
Audio Plugin for Audio to MIDI transcription using deep learning.
FudanDISC/DISC-LawLLM
DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services
ORI-Muchim/AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
jbhuang0604/awesome-tips
fancyfrees/Jobs
IT猎头fancyfrees,我的微信:fancyfrees,如果寻求合作,请加我微信。我们直接跟PE/VC合作的比较多一些(例如:经纬/IDG/GGV/挚信资本/钟鼎创投等),他们投的一些公司,特别靠谱的而且近几年会上市的我们会加入进去帮忙做一些招聘。例如:滴滴出行/蘑菇街/饿了么/陌陌/木瓜移动/英语流利说等等专注于做IT互联网技术类职位,技术方向:Java/Python/Ruby on rails/PHP/Node.js/JavaScript/HTML/QA/iOS/Android/NLP,算法,反作弊算法,大数据,人工智能,机器学习,音视频算法,云计算等
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Dsqvival/hierarchical-structure-analysis
Algorithm and Data for paper "Automatic Detection of Hierarchical Structure and Influence of Structure on Melody, Harmony and Rhythm in Popular Music"
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
CJLU-source/Synthesizer-V-FE
Synthesizer V Free Editor
ckycky3/CMT-pytorch
Chord-Conditioned Melody Transformer
stakira/OpenUtau
Open singing synthesis platform / Open source UTAU successor