Pinned Repositories
api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
approximate
Approximate discrete values and numbers
audino
Open source audio annotation tool for humans™
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
audioset_tagging_cnn
audiotsm
A python library for real-time audio time-scale modification procedures
chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
CMSIS_5
CMSIS Version 5 Development Repository
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
dabnn
dabnn is an accelerated binary neural networks inference framework for mobile platform
gandolfxu's Repositories
gandolfxu/api-samples
Code samples for YouTube APIs, including the YouTube Data API, YouTube Analytics API, and YouTube Live Streaming API. The repo contains language-specific directories that contain the samples.
gandolfxu/approximate
Approximate discrete values and numbers
gandolfxu/audino
Open source audio annotation tool for humans™
gandolfxu/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
gandolfxu/audioset_tagging_cnn
gandolfxu/chinese-xinhua
:orange_book: 中华新华字典数据库。包括歇后语,成语,词语,汉字。
gandolfxu/CMSIS_5
CMSIS Version 5 Development Repository
gandolfxu/ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
gandolfxu/dabnn
dabnn is an accelerated binary neural networks inference framework for mobile platform
gandolfxu/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
gandolfxu/digital-table-of-general-standard-chinese-characters
Digitalization of the Table of General Standard Chinese Characters
gandolfxu/ERNN-for-speech-enhancement
gandolfxu/espnet
End-to-End Speech Processing Toolkit
gandolfxu/face_recognition
The world's simplest facial recognition api for Python and the command line
gandolfxu/faiss
A library for efficient similarity search and clustering of dense vectors.
gandolfxu/fmath
fast log and exp functions for x86/x64 SSE
gandolfxu/jukebox
Code for the paper "Jukebox: A Generative Model for Music"
gandolfxu/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
gandolfxu/kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
gandolfxu/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
gandolfxu/neural_sp
End-to-end ASR/LM implementation with PyTorch
gandolfxu/pinyin-data
汉字拼音数据
gandolfxu/rhubarb-lip-sync
Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other project that requires animating mouths based on existing recordings.
gandolfxu/rnnoise
Recurrent neural network for audio noise reduction
gandolfxu/so-vits-svc
SoftVC VITS Singing Voice Conversion
gandolfxu/speech
A PyTorch Implementation of End-to-End Models for Speech-to-Text
gandolfxu/TextParser
TTS Chinese and English text analysis
gandolfxu/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
gandolfxu/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
gandolfxu/wenet
Transformer based ASR Engine.