ZihanJin
Daytime programmer nighttime linguist | University of Edinburgh SLP | Machine Learning Engineer at Resemble AI | TTS | NLP
Resemble AI Toronto
ZihanJin's Stars
jlevy/the-art-of-command-line
Master the command line, in one page
binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
faif/python-patterns
A collection of design patterns/idioms in Python
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
fishaudio/fish-speech
Brand new TTS solution
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
holyshell/Books
Some special ebooks
NVIDIA/BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
PacktPublishing/Python-for-Finance-Cookbook
Python for Finance Cookbook, published by Packt
google/visqol
Perceptual Quality Estimator for speech and audio
SamuraiT/mecab-python3
:snake: mecab-python. you can find original version here:http://taku910.github.io/mecab/
MuQiuJun-AI/bert4pytorch
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
feizc/MLE-LLaMA
Multi-language Enhanced LLaMA
quadrismegistus/prosodic
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
Kyubyong/g2pK
g2pK: g2p module for Korean
TEAMuP-dev/audacitorch
PyTorch wrappers for using your model in audacity!
nsu-ai/russian_g2p
Accentor and transcriptor for Russian language
typst/hypher
Separates words into syllables.
SMART-TTS/SMART-G2P
teddykoker/cryptopunks-gan
Simple SN-GAN to generate CryptoPunks
ErikEkstedt/TurnGPT
TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog
ErikEkstedt/VoiceActivityProjection
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
iisys-hof/HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repository it is possible to automatically recreate the dataset. It is also possible to add more speakers to the processing pipeline.
Laurence-Cullen/cuneiform
Machine translation and word embeddings of cuneiform corpuses