Pinned Repositories
contentvec
speech self-supervised representations
SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
whisper.cpp
Port of OpenAI's Whisper model in C/C++
tensorflowbook
tensorflow教程每个章节的源码
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
egy-arb-dialect-id
Egyptian / Modern Standard Arabic language identification system
VI-SVS
Singing Voice Synthesis based on VITS, different from VISinger
LabelMakr
A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singing.
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
msaf
Music Structure Analysis Framework
li-henan's Repositories
li-henan/tensorflowbook
tensorflow教程每个章节的源码