sinhat98

I'm an ML engineer in Japan. My interests are Deep Learning, Speech Processing, and Spoken Dialogue Systems.

CyberAgent, Inc.Tokyo, Sibuya

Pinned Repositories

adapter-wavlm
Language:Python42 3 88
Aivis
💠 Aivis: AI Voice Imitation System
Language:Python0 0 00
DialogueMock
Language:Python0 1 00
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python00
espnet
End-to-End Speech Processing Toolkit
Language:Python00
fastapi-beginner
Language:Python0 1 00
go-basics
🐳 Go basic lesson
Language:Go00
icefall
Language:Python00
nishika-competition
nishikaコンペの再現コード
Language:Python2 1 00
SERwithWavLM
Language:Python1 1 00

sinhat98's Repositories

sinhat98/adapter-wavlm
Language:Python42 3 88
sinhat98/nishika-competition
nishikaコンペの再現コード
Language:Python2 1 00
sinhat98/SERwithWavLM
Language:Python1 1 00
sinhat98/Aivis
💠 Aivis: AI Voice Imitation System
Language:Python0 0 00
sinhat98/DialogueMock
Language:Python0 1 00
sinhat98/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python00
sinhat98/espnet
End-to-End Speech Processing Toolkit
Language:Python00
sinhat98/fastapi-beginner
Language:Python0 1 00
sinhat98/go-basics
🐳 Go basic lesson
Language:Go00
sinhat98/icefall
Language:Python00
sinhat98/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python00
sinhat98/llm-endpoint
Language:Python0 1 00
sinhat98/python-dev
Language:Jupyter Notebook0 1 00
sinhat98/sherpa-onnx
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter
Language:C++00
sinhat98/skills-secure-code-game
My clone repository
Language:Python0 1 00
sinhat98/VGGFace2-pytorch
PyTorch Face Recognizer based on 'VGGFace2: A dataset for recognising faces across pose and age'
Language:Python0 0 00
sinhat98/moshi
A modern JSON library for Kotlin and Java.
sinhat98/Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
Language:Python0 0

sinhat98

Pinned Repositories

adapter-wavlm

Aivis

DialogueMock

distil-whisper

espnet

fastapi-beginner

go-basics

icefall

nishika-competition

SERwithWavLM

sinhat98's Repositories

sinhat98/adapter-wavlm

sinhat98/nishika-competition

sinhat98/SERwithWavLM

sinhat98/Aivis

sinhat98/DialogueMock

sinhat98/distil-whisper

sinhat98/espnet

sinhat98/fastapi-beginner

sinhat98/go-basics

sinhat98/icefall

sinhat98/LLaMA-Omni

sinhat98/llm-endpoint

sinhat98/python-dev

sinhat98/sherpa-onnx

sinhat98/skills-secure-code-game

sinhat98/VGGFace2-pytorch

sinhat98/moshi

sinhat98/Style-Bert-VITS2