lixikun's Stars
Panxuefeng-loongson/javacpp-presets
The missing Java distribution of native C++ libraries
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
myshell-ai/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
rhasspy/piper
A fast, local neural text to speech system
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
2noise/ChatEval
Identify speakers with stable voice timbre.
mpetazzoni/sse.js
A flexible Server-Sent Events EventSource polyfill for Javascript
katspaugh/wavesurfer.js
Audio waveform player
Rikorose/DeepFilterNet
Noise supression using deep filtering
toly1994328/FlutterUnit
【Flutter 集录指南 App】The unity of flutter, The unity of coder.
h2database/h2database
H2 is an embeddable RDBMS written in Java.
2noise/ChatTTS
A generative speech model for daily dialogue.
stream-labs/desktop
Free and open source streaming software built on OBS and Electron.
modelscope/FunClip
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
pika-online/funasr_seaco_paraformer_onnx_with_timestamp
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
OpenFlutter/fluwx
Flutter版微信SDK.WeChat SDK for flutter.
chinue/Fast-SSIM
Fast algorithm of SSIM and PSNR for Python and speed up 30x for SSIM 10x for PSNR
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
unit-mesh/auto-dev
🧙AutoDev: The AI-powered coding wizard(AI 驱动编程助手) with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀
ggerganov/llama.cpp
LLM inference in C/C++
facebookresearch/audioseal
Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
leanflutter/window_manager
This plugin allows Flutter desktop apps to resizing and repositioning the window.
iDvel/rime-ice
Rime 配置:雾凇拼音 | 长期维护的简体词库
happyDom/dyyRime
这是dyy个人使用的rime输入法的配置包,欢迎测试指正
fabiancelik/rich-voice-editor
Rich Voice Editor: Quill Rich Text Editor Enhancements to support SSML
RoyJames/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
apache/fury
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
Zz-ww/SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。