HKAB

I love painting 🎨 and reading 📖

FTechViet Nam

HKAB's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python33.4k 191 5833.6k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.9k 2.6k 01.7k
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python24.6k 162 4531.8k
aristocratos/btop
A monitor of resources
Language:C++22k 117 639670
overleaf/overleaf
A web-based collaborative LaTeX editor
Language:JavaScript14.5k 211 1.1k1.5k
kyutai-labs/moshi
Language:Python7.1k 80 96552
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python6.5k 54 218568
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
Language:Python3.9k 44 162349
sjpiper145/MakerSkillTree
A repository of Maker Skill Trees and templates to make your own.
Language:Jinja3k 57 18133
ChrisBuilds/terminaltexteffects
TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.
Language:Python3k 13 2756
libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
Language:C3k 34 16122
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k 30 52185
zhengkyl/qrframe
code-based qr code designer
Language:TypeScript2.6k 5 373
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
Language:Python2k 15 3369
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.9k 32 3291
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.4k 33 8993
pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Language:Python1.2k 10 9145
k2-fsa/icefall
Language:Python974 48 688310
olcf/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
Language:Cuda650 18 0236
clu0/unet.cu
UNet diffusion model in pure CUDA
Language:Cuda590 3 027
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Language:Python500 14 1923
google-ai-edge/ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
Language:Jupyter Notebook410 29 9852
facebookresearch/muavic
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation
Language:Python372 13 2332
microsoft/onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
Language:C#321 44 5462
te42kyfo/gpu-benches
collection of benchmarks to measure basic GPU capabilities
Language:Jupyter Notebook273 9 1141
leduckhai/MultiMed
Multilingual Multitask Multipurpose Medical Speech Recognition
Language:Python91 5 115
CisMine/Guide-NVIDIA-Tools
NVIDIA tools guide
Language:Cuda722
echocatzh/conv-stft
A STFT/iSTFT written up in PyTorch using 1D Convolutions
Language:Python26 2 111
leimao/Nsight-Compute-Docker-Image
Nsight Compute in Docker
Language:Dockerfile11 2 0
tuyen-tran1/VN-SLU
A Vietnamese Spoken Language Understanding
1

HKAB

HKAB's Stars

2noise/ChatTTS

karpathy/LLM101n

roboflow/supervision

aristocratos/btop

overleaf/overleaf

kyutai-labs/moshi

Ucas-HaoranWei/GOT-OCR2.0

FunAudioLLM/SenseVoice

sjpiper145/MakerSkillTree

ChrisBuilds/terminaltexteffects

libAudioFlux/audioFlux

ictnlp/LLaMA-Omni

zhengkyl/qrframe

facebookresearch/schedule_free

HazyResearch/ThunderKittens

QwenLM/Qwen2-Audio

pemistahl/lingua-py

k2-fsa/icefall

olcf/cuda-training-series

clu0/unet.cu

nyrahealth/CrisperWhisper

google-ai-edge/ai-edge-torch

facebookresearch/muavic

microsoft/onnxruntime-training-examples

te42kyfo/gpu-benches

leduckhai/MultiMed

CisMine/Guide-NVIDIA-Tools

echocatzh/conv-stft

leimao/Nsight-Compute-Docker-Image

tuyen-tran1/VN-SLU