Pinned Repositories
AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
espnet
End-to-End Speech Processing Toolkit
CALL-proto
ESPNet_asr_egs
Public examples for ESPNet2 demonstration
Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
Music-Project
Project for music
subjective-eval-sample-generate
For subjective evaluation sample preparation
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Muskits
An opensource music processing toolkit
ftshijt's Repositories
ftshijt/Interspeech2024_DiscreteSpeechChallenge
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
ftshijt/subjective-eval-sample-generate
For subjective evaluation sample preparation
ftshijt/speech_evaluation
A toolkit dedicate for speech evaluation.
ftshijt/ESPNet_asr_egs
Public examples for ESPNet2 demonstration
ftshijt/dscore
Diarization scoring tools.
ftshijt/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
ftshijt/ds-baseball
ftshijt/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
ftshijt/DL21_samples
ftshijt/espnet
End-to-End Speech Processing Toolkit
ftshijt/ESPnet-ST-v2-examples
ftshijt/espnet_model_zoo
ESPnet Model Zoo
ftshijt/ESPnet_st_egs
Public examples for ESPnet2 demonstration
ftshijt/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
ftshijt/ftshijt
ftshijt/LibriMix
An open source dataset for source separation
ftshijt/Muskits
An opensource music processing toolkit
ftshijt/newlang-tech
A guide to building language technology in new languages.
ftshijt/notebook
ftshijt/openslr
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
ftshijt/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
ftshijt/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
ftshijt/PublicLectureSlides
ftshijt/Puebla_Nahuatl_Split
ftshijt/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
ftshijt/shinjiwlab.github.io
ftshijt/SimulEval
SimulEval: A General Evaluation Toolkit for Simultaneous Translation
ftshijt/svs_demo
ftshijt/SVS_system
A system works on singing voice synthesis
ftshijt/Totonac_Split