cjw414

Korea UniversitySeoul, Korea

cjw414's Stars

huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Language:Python3.5k267
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19k2.6k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k26.2k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python15.7k1.5k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.6k465
HeegyuKim/open-korean-instructions
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
Language:Python33124
huggingface/community-events
Place where folks can contribute to 🤗 community events
Language:Jupyter Notebook395101
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
1.2k56
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python17.3k1.3k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook46.5k5.5k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.5k1.4k
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델
Language:Jupyter Notebook1.5k240
FFmpeg/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
Language:C44.8k12k
chirlu/sox
SoX, Swiss Army knife of sound processing
Language:C689112
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python1.8k186
facebookresearch/AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Language:Python52044
sarulab-speech/jtubespeech
Language:Python20846
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python3.9k387
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook5.9k751
yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python81.8k6.4k
lumaku/ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
Language:Python31829
shirayu/whispering
Streaming transcriber with whisper
Language:Python68653
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python67.1k7.9k
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.2k479
georgian-io/Knowledge-Distillation-Toolkit
:no_entry: [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
Language:Python13625
ICT-VGL/ICT-FaceKit
ICT's Vision and Graphics Lab's morphable face model and toolkit
Language:Python641111
microsoft/Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
Language:Python2.2k441
spite/FaceMeshFaceGeometry
FaceMeshFaceGeometry for FaceMesh
Language:JavaScript40063
CMU-Perceptual-Computing-Lab/MonocularTotalCapture
Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"
Language:C++659121
TimoBolkart/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
Language:Python1.1k271

cjw414

cjw414's Stars

huggingface/distil-whisper

huggingface/datasets

huggingface/transformers

huggingface/peft

01-ai/Yi

HeegyuKim/open-korean-instructions

huggingface/community-events

sindresorhus/awesome-whisper

Anjok07/ultimatevocalremovergui

facebookresearch/segment-anything

speechbrain/speechbrain

Beomi/KoAlpaca

FFmpeg/FFmpeg

chirlu/sox

iver56/audiomentations

facebookresearch/AudioMAE

sarulab-speech/jtubespeech

snakers4/silero-vad

pyannote/pyannote-audio

yt-dlp/yt-dlp

lumaku/ctc-segmentation

shirayu/whispering

openai/whisper

s3prl/s3prl

georgian-io/Knowledge-Distillation-Toolkit

ICT-VGL/ICT-FaceKit

microsoft/Deep3DFaceReconstruction

spite/FaceMeshFaceGeometry

CMU-Perceptual-Computing-Lab/MonocularTotalCapture

TimoBolkart/voca