undeadyequ
Deep Learning and speech processing Enthusiasts. Also interesting in ML and real-time processing
@JiaoTong UniversityJapan
Pinned Repositories
CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
espnet
End-to-End Speech Processing Toolkit
FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
FineGrainedImageClassification
forecasting_japanese_election
forecasting_japanese_election_clean
kaldi
This is now the official location of the Kaldi project.
luo_blog
undeadyequ's Repositories
undeadyequ/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
undeadyequ/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
undeadyequ/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
undeadyequ/espnet
End-to-End Speech Processing Toolkit
undeadyequ/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
undeadyequ/FineGrainedImageClassification
undeadyequ/forecasting_japanese_election
undeadyequ/forecasting_japanese_election_clean
undeadyequ/kaldi
This is now the official location of the Kaldi project.
undeadyequ/luo_blog
undeadyequ/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
undeadyequ/protest-detection-violence-estimation
Implementation of the model used in the paper Protest Activity Detection and Perceived Violence Estimation from Social Media Images (ACM Multimedia 2017)
undeadyequ/protest_issue_classification
A tool for trianing your own protest issue classfication model.
undeadyequ/rosenxuan.github.io
undeadyequ/ser_model
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
undeadyequ/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
undeadyequ/tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
undeadyequ/undeadyequ.github.io
undeadyequ/vim_config
Recommand Vim configuration