shinshoji01

Chinese University of Hong Kong, ShenzhenChina

Pinned Repositories

2D_marker_detection_using_convolutional_layers
It contains 2D marker detection using convolutional layers and pooling layers.
Language:Python0 1 00
AM_with_GAN_for_melspectrogram
This repository is to introduce the application of Activation Maximization for audio-domain data.
Language:Jupyter Notebook1 1 00
gonken-lesson
Lessons provided in Gonsalves Laboratory
Language:Jupyter Notebook2 2 64
GonKen-Lesson_Sho
It contains the lessons I created for Gonsalves AI laboratory.
Language:Jupyter Notebook00
Latent_Conditional_GAN
This repository is to introduce our research, LCGAN.
Language:Jupyter Notebook2 0 00
MacST-project-page
20
research_blog
声フェチ野郎の音声生成録(https://shinshoji01.hatenablog.com/) で紹介してるソースコード
Language:Jupyter Notebook1 0 00
Style-Restricted_GAN
This repository is to introduce our model, Style-Restricted GAN.
Language:Jupyter Notebook9 1 02
Text-Hierarchical-ED
This is an official implementation of our paper published in ICASSP 2024.
6 3 20
text2speech-website
This repository contains the implementation of the website with speech synthesis.
Language:Python00

shinshoji01's Repositories

shinshoji01/Text-Hierarchical-ED
This is an official implementation of our paper published in ICASSP 2024.
6 3 20
shinshoji01/Latent_Conditional_GAN
This repository is to introduce our research, LCGAN.
Language:Jupyter Notebook2 0 00
shinshoji01/MacST-project-page
20
shinshoji01/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
shinshoji01/dbViz
The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective (CVPR'22).
Language:Python0 0 00
shinshoji01/Docker
Language:Dockerfile0 1 00
shinshoji01/AN-SSDT-Demo
Language:HTML
shinshoji01/beaqlejs
*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.
Language:JavaScript1
shinshoji01/emotion2vec
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Language:Python0 0
shinshoji01/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Language:Python
shinshoji01/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language:Python0 0
shinshoji01/Hierarchical-ED-Demo
Language:Jupyter Notebook1 0
shinshoji01/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python0 0
shinshoji01/MacST-Demo
Language:HTML1 0
shinshoji01/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook0 0
shinshoji01/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
Language:Python0 0
shinshoji01/SECap
Language:Python0 0
shinshoji01/seq2seq-EVC
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage sequence-to-sequence training.
shinshoji01/seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
Language:Jupyter Notebook0 0
shinshoji01/sho_util
Language:Python1 0
shinshoji01/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Language:Jupyter Notebook
shinshoji01/SpeechGPT
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities.
shinshoji01/Tacotron-pytorch
Tacotron series TTS model implemented with Pytorch
Language:Python0 0
shinshoji01/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook0 0
shinshoji01/Text-Sequential-ED-Demo
Language:Jupyter Notebook1 0
shinshoji01/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python0 0
shinshoji01/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Language:Python0 0
shinshoji01/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Language:Python0 0
shinshoji01/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Language:Python0 0
shinshoji01/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)
Language:Python0 0