dzcmingdi's Stars
open-mmlab/PowerPaint
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model. 一个高质量多功能的图像修补模型,可以同时支持插入物体、移除物体、图像扩展、形状可控的物体生成,只需要一个模型
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
cvat-ai/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
deepfakes/faceswap
Deepfakes Software For All
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
google/visqol
Perceptual Quality Estimator for speech and audio
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
floodsung/Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
CareyWang/sub-web
tindy2013/subconverter
Utility to convert between various subscription format
rhasspy/piper
A fast, local neural text to speech system
espeak-ng/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
massgravel/Microsoft-Activation-Scripts
Open-source Windows and Office activator featuring HWID, Ohook, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
pytorch/serve
Serve, optimize and scale PyTorch models in production
xinghaochen/awesome-hand-pose-estimation
Awesome work on hand pose estimation/tracking
BobLiu20/YOLOv3_PyTorch
Full implementation of YOLOv3 in PyTorch
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
SteamDatabase/GameTracking-Dota2
📥 Game Tracker: Dota 2
tankvn/videojs-total-skin
Nice skin for VideoJS