szhaomsft

Pinned Repositories

AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
Language:Python0 1 00
api-ai-english-asr-model
Api.ai English Speech Recognition (ASR) Model for Kaldi
0 1 00
asr-server
FastCGI support for Kaldi ASR
Language:C++0 1 00
Automatic-Prosody-Annotation
Language:Python0 0 00
Awesome-Chatbot
Awesome Chatbot Projects,Corpus,Papers,Tutorials.
Language:Python0 1 00
azure-docs
Open source documentation of Microsoft Azure
Language:PowerShell0 1 00
bark
🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model
Language:Python0 0 00
Boost-for-Android
Android port of Boost C++ Libraries
Language:C++0 1 00
CNTK
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
Language:C++0 1 00
Cognitive-Speech-SRSample
Community samples to use Cognitive SR services
Language:C#3 1 23

szhaomsft's Repositories

szhaomsft/Cognitive-Speech-SRSample
Community samples to use Cognitive SR services
Language:C#3 1 23
szhaomsft/AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
Language:Python0 1 00
szhaomsft/Automatic-Prosody-Annotation
Language:Python0 0 00
szhaomsft/azure-docs
Open source documentation of Microsoft Azure
Language:PowerShell0 1 00
szhaomsft/bark
🚀 BARK INFINITY 🎶 Power Up The Bark Text-prompted Generative Audio Model
Language:Python0 0 00
szhaomsft/Boost-for-Android
Android port of Boost C++ Libraries
Language:C++0 1 00
szhaomsft/concentus.oggfile
Implementing support for reading/writing .opus audio files using Concentus
Language:C#0 1 00
szhaomsft/contextualLoss
The Contextual Loss
Language:Python1 0
szhaomsft/corert
This repo contains CoreRT, a .NET Core runtime optimized for AOT (ahead of time compilation) scenarios, with the accompanying compiler toolchain.
Language:C#1 0
szhaomsft/CortanaSkillsKit
Create with the Cortana Skills Kit.
Language:PowerShell1 0
szhaomsft/crepe
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)
Language:Python0 0
szhaomsft/dotnet-docs-samples
.NET code samples used on https://cloud.google.com
szhaomsft/LruCacheNet
A fast, generic, thread-safe Least Recently Used (LRU) cache for .NET Standard.
Language:C#1 0
szhaomsft/NAudio
Audio and MIDI library for .NET
Language:C#
szhaomsft/noi
Language:C++2 0
szhaomsft/obfuscar
Open source obfuscation tool for .NET assemblies
Language:C#1 0
szhaomsft/OpenTN
open source text normalizer
2 0
szhaomsft/pinyin4net
pinyin4net is a .net library supporting convertion between Chinese characters and Pinyin systems.
Language:C#1 0
szhaomsft/protobuf-android
port protobuf to android
Language:C++1 0
szhaomsft/SadTalker
（CVPR 2023）SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python0 0
szhaomsft/stable-diffusion-webui
Stable Diffusion web UI
szhaomsft/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++1 0
szhaomsft/testdata
szhaomsft/torchcrepe
Pytorch implementation of the CREPE pitch tracker
Language:Python0 0
szhaomsft/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
Language:Python0 0
szhaomsft/tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
Language:Python0 0
szhaomsft/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python0 0
szhaomsft/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
szhaomsft/voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
Language:Python0 0
szhaomsft/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python0 0