Park323's Stars
coqui-ai/TTS
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
JaeYeopHan/Interview_Question_for_Beginner
:boy: :girl: Technical-Interview guidelines written for those who started studying programming. I wish you all the best. :space_invader:
brightmart/nlp_chinese_corpus
ๅคง่งๆจกไธญๆ่ช็ถ่ฏญ่จๅค็่ฏญๆ Large Scale Chinese Corpus for NLP
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
boost-devs/ai-tech-interview
๐ฉโ๐ป๐จโ๐ป AI ์์ง๋์ด ๊ธฐ์ ๋ฉด์ ์คํฐ๋ (โญ๏ธ 1k+)
CjangCjengh/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
ga642381/speech-trident
Awesome speech/audio LLMs, representation learning, and codec models
rishikksh20/ViViT-pytorch
Implementation of ViViT: A Video Vision Transformer
DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
kiyoungkim1/LMkor
Pretrained Language Models for Korean
rtzr/Awesome-Korean-Speech-Recognition
ํ๊ตญ์ด ์์ฑ์ธ์ STT API ๋ฆฌ์คํธ. ๊ฐ ์ฑ๋ฅ ๋ฒค์น๋งํฌ.
daveshap/PlainTextWikipedia
Convert Wikipedia database dumps into plaintext files
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
1ytic/warp-rnnt
CUDA-Warp RNN-Transducer
HamadYA/GhostFaceNets
This repository contains the official implementation of GhostFaceNets, State-Of-The-Art lightweight face recognition models.
pgcorpus/gutenberg
Pipeline to generate the Standardized Project Gutenberg Corpus
jasonppy/PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
ga642381/Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
Hazqeel09/ellzaf_ml
Bridging Research and Practice with PyTorch
BladeTransformerLLC/OvercookedGPT
An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.
hyeonsangjeon/computing-Korean-STT-error-rates
STT ํ๊ธ ๋ฌธ์ฅ ์ธ์๊ธฐ ์ถ๋ ฅ ์คํฌ๋ฆฝํธ์ ์ธ์ ์ค๋ฅ์จ(CER), ๋จ์ด ์ค๋ฅ์จ(WER)์ ๊ณ์ฐํ๋ Python ํจ์ ํจํค์ง
kurianbenoy/whisper_normalizer
A python package for whisper normalizer
thunlp/SubCharTokenization
haoxiangsnr/llm-tse
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
ouor/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
mispchallenge/MISP-2023-Challenge-Baseline
koreanAI/2023-Korean-AI-Competition
2023 ํ๊ตญ์ด AI ๊ฒฝ์ง๋ํ
dmlguq456/PIT_CSS
dual-path multi-channel network for speech separation
coalboss/ChatCLR2024-TargetSpeakerLipreading