cjw414's Stars
huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
huggingface/datasets
๐ค The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
huggingface/transformers
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
huggingface/peft
๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
HeegyuKim/open-korean-instructions
์ธ์ด๋ชจ๋ธ์ ํ์ตํ๊ธฐ ์ํ ๊ณต๊ฐ ํ๊ตญ์ด instruction dataset๋ค์ ๋ชจ์๋์์ต๋๋ค.
huggingface/community-events
Place where folks can contribute to ๐ค community events
sindresorhus/awesome-whisper
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Beomi/KoAlpaca
KoAlpaca: ํ๊ตญ์ด ๋ช ๋ น์ด๋ฅผ ์ดํดํ๋ ์คํ์์ค ์ธ์ด๋ชจ๋ธ
FFmpeg/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
chirlu/sox
SoX, Swiss Army knife of sound processing
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
facebookresearch/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
sarulab-speech/jtubespeech
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
lumaku/ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
shirayu/whispering
Streaming transcriber with whisper
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
georgian-io/Knowledge-Distillation-Toolkit
:no_entry: [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
ICT-VGL/ICT-FaceKit
ICT's Vision and Graphics Lab's morphable face model and toolkit
microsoft/Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
spite/FaceMeshFaceGeometry
FaceMeshFaceGeometry for FaceMesh
CMU-Perceptual-Computing-Lab/MonocularTotalCapture
Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.