gandolfxu

BaiduBeijing

Pinned Repositories

api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
Language:Java0 0 00
approximate
Approximate discrete values and numbers
Language:Haskell0 0 00
audino
Open source audio annotation tool for humans™
Language:JavaScript00
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python0 0 00
audioset_tagging_cnn
Language:Python0 0 00
audiotsm
A python library for real-time audio time-scale modification procedures
Language:Python0 0 00
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语，成语，词语，汉字。
Language:Python0 0 00
CMSIS_5
CMSIS Version 5 Development Repository
Language:C00
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Language:Python00
dabnn
dabnn is an accelerated binary neural networks inference framework for mobile platform
Language:C++0 0 00

gandolfxu's Repositories

gandolfxu/api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
Language:Java0 0 00
gandolfxu/approximate
Approximate discrete values and numbers
Language:Haskell0 0 00
gandolfxu/audino
Open source audio annotation tool for humans™
Language:JavaScript00
gandolfxu/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python0 0 00
gandolfxu/audioset_tagging_cnn
Language:Python0 0 00
gandolfxu/chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语，成语，词语，汉字。
Language:Python0 0 00
gandolfxu/CMSIS_5
CMSIS Version 5 Development Repository
Language:C00
gandolfxu/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
Language:Python00
gandolfxu/dabnn
dabnn is an accelerated binary neural networks inference framework for mobile platform
Language:C++0 0 00
gandolfxu/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python0 0 00
gandolfxu/digital-table-of-general-standard-chinese-characters
Digitalization of the Table of General Standard Chinese Characters
gandolfxu/ERNN-for-speech-enhancement
gandolfxu/espnet
End-to-End Speech Processing Toolkit
gandolfxu/face_recognition
The world's simplest facial recognition api for Python and the command line
Language:Python0 0
gandolfxu/faiss
A library for efficient similarity search and clustering of dense vectors.
gandolfxu/fmath
fast log and exp functions for x86/x64 SSE
gandolfxu/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
Language:Python0 0
gandolfxu/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Language:Shell0 0
gandolfxu/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
Language:Python0 0
gandolfxu/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
gandolfxu/neural_sp
End-to-end ASR/LM implementation with PyTorch
gandolfxu/pinyin-data
汉字拼音数据
Language:Python0 0
gandolfxu/rhubarb-lip-sync
Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other project that requires animating mouths based on existing recordings.
gandolfxu/rnnoise
Recurrent neural network for audio noise reduction
gandolfxu/so-vits-svc
SoftVC VITS Singing Voice Conversion
gandolfxu/speech
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Language:Python0 0
gandolfxu/TextParser
TTS Chinese and English text analysis
gandolfxu/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
gandolfxu/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Language:Python0 0
gandolfxu/wenet
Transformer based ASR Engine.