li-henan

Pinned Repositories

contentvec
speech self-supervised representations
Language:Python467 11 3037
SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Language:Python4.3k 67 142379
whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++35.4k 312 1.4k3.6k
tensorflowbook
tensorflow教程每个章节的源码
Language:Python0 0 00
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.1k 137 7091.3k
egy-arb-dialect-id
Egyptian / Modern Standard Arabic language identification system
Language:Python6 4 14
VI-SVS
Singing Voice Synthesis based on VITS, different from VISinger
Language:Python187 8 1431
LabelMakr
A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singing.
Language:Python27 2 57
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.2k 62 390337
msaf
Music Structure Analysis Framework
Language:Python498 24 9179