wxbool

wxbool's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python127k 1.1k 15k25.2k
ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
Language:Go72.3k 440 3.1k5.3k
huggingface/candle
Minimalist ML framework for Rust
Language:Rust14k 148 581776
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Language:Jupyter Notebook7k 87 104694
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Language:Python4.1k 59 157515
lich0821/WeChatFerry
微信机器人底层框架，可接入Gemini、ChatGPT、ChatGLM、讯飞星火、Tigerbot等大模型。WeChat Robot Hook.
Language:C++2.9k 47 108528
huggingface/parler-tts
Inference and training library for high-quality TTS models.
Language:Python2.7k 45 41281
xinyu1205/recognize-anything
Open-source and strong foundation image recognition models.
Language:Jupyter Notebook2.5k 26 142235
facebookresearch/denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python1.6k 38 149295
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Language:Python1.5k 34 109165
pemistahl/lingua-go
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Language:Go1.1k 11 3464
nihui/realsr-ncnn-vulkan
RealSR super resolution implemented with ncnn library
Language:C1.1k 29 47111
declare-lab/tango
A family of diffusion models for text-to-audio generation.
Language:Python941 25 4272
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Language:Python767 11 2851
QPT-Family/QPT
[内测中]QPT - 致力于让开源项目更好通往互联网世界的Python to EXE工具（Python打包）。
Language:Python699 8 9780
axodox/axodox-machinelearning
This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, Midas, HED and OpenPose.
Language:C++597 13 1634
muesli/kmeans
k-means clustering algorithm implementation written in Go
Language:Go449 9 1252
gotranspile/cxgo
Tool for transpiling C to Go.
Language:Go272 5 4020
axodox/unpaint
A simple Windows / Xbox app for generating AI images with Stable Diffusion.
Language:C++261 11 4011
KdaiP/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
Language:Python261 26 1227
numediart/EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
Language:Python232 9 520
uthree/tinyvc
a lightweight voice conversion
Language:Python679
jack139/go-infer
Go framework for DL model inference and API deployment
Language:Go50 1 010
chenyangMl/keyword-spot
端到端语音唤醒工具箱，从模型训练到模型推理。
Language:Python487
NextAudioGen/ultimatevocalremover_api
API for a Vocal Remover that uses Deep Neural Networks.
Language:Python41 2 95
instant-high/wav2lip-onnx-HQ
Full version of wav2lip-onnx including face alignment and face enhancement and more...
Language:Python286
spkgyk/RTFS-Net
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
Language:Python28 1 23
Okrio/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
Language:Python21 1 012
yvonwin/qwen2.cpp
qwen2 and llama3 cpp implementation
Language:C++21 1 40
fxkt-tech/liv
friendly ffmpeg wrap for go.
Language:Go3 1 02