daxiangpanda

UESTCSichuan Province,China

Pinned Repositories

acdemic
Language:Python0 2 00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
android_nfc_book
A repo for code samples in my android-nfc-book
Language:Java0 2 00
assimp
Official Open Asset Import Library Repository. Loads 40+ 3D file formats into one unified and clean data structure.
Language:C++0 2 00
awesome-deep-learning-music
List of articles related to deep learning applied to music
Language:TeX1 1 00
interviewforprogrammers
Language:Python1 2 00
maoyan
Language:Python1 1 00
shiyanba
Language:Python1 2 00
stable-diffusion-webui
Stable Diffusion web UI
Language:Python1 1 00
tacotronv2_wavernn_chinese
tacotronV2 + wavernn 实现中文语音合成(Tensorflow + pytorch)
Language:Python1 1 00

daxiangpanda's Repositories

daxiangpanda/stable-diffusion-webui
Stable Diffusion web UI
Language:Python1 1 00
daxiangpanda/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
daxiangpanda/audiocraft_plus
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python0 0
daxiangpanda/bark-training-cloning
for training the model
Language:Jupyter Notebook0 0
daxiangpanda/carefree-creator
An AI-powered creator for everyone.
Language:Jupyter Notebook1 0
daxiangpanda/CLAP
Contrastive Language-Audio Pretraining
daxiangpanda/DiffSinger
PyTorch Implementation of DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis (TTS Extension)
Language:Python1 0
daxiangpanda/DiffSinger-1
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Forked and maintained by the OpenVPI community
Language:Python1 0
daxiangpanda/disable-flutter-tls-verification
A Frida script that disables Flutter's TLS verification
Language:C++0 0
daxiangpanda/dream-textures
Stable Diffusion built-in to the Blender shader editor
Language:Python1 0
daxiangpanda/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Python0 0
daxiangpanda/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
daxiangpanda/Games
Home Page Link:
Language:JavaScript0 0
daxiangpanda/lobe-chat
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
Language:TypeScript0 0
daxiangpanda/MDM
MDM
Language:Python1 0
daxiangpanda/metahuman-stream
Real time interactive streaming digital human
Language:Python0 0
daxiangpanda/midi-js-soundfonts
Pre-rendered General MIDI soundfonts that can be used immediately with MIDI.js
1 0
daxiangpanda/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Language:Python1 0
daxiangpanda/OpenVoice
Instant voice cloning by MyShell.
Language:Python0 0
daxiangpanda/PaddleSpeech
Easy-to-use Speech Toolkit including SOTA/Streaming ASR witch punctuation, influential TTS with text frontend, Speaker Verification System and End-to-End Speech Simultaneous Translation.
Language:C++1 0
daxiangpanda/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Language:Jupyter Notebook1 0
daxiangpanda/ppg-vc
PPG-Based Voice Conversion
Language:Python1 0
daxiangpanda/python_template
daxiangpanda/roop
one-click deepfake (face swap)
Language:Python0 0
daxiangpanda/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook1 0
daxiangpanda/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python0 0
daxiangpanda/UniAudio
The Open Source Code of UniAudio
Language:Python0 0
daxiangpanda/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python1 0
daxiangpanda/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
Language:Python0 0
daxiangpanda/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:C++1 0