akshatdewan's Stars
wiseman/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
ina-foss/inaSpeechSegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
thuhcsi/FlatTN
Chinese Text Normalization and Dataset
monatis/label-snd
Easily label sound datasets!
monatis/asr-annotation-bot
Simple Telegram bot to annotate and varify automatic speech recognition datasets
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.
yuboona/Chinese-Punctuation-Restoration-with-Bert-CNN-RNN
A Bert-CNN-LSTM model for punctuation restoration
nkrnrnk/BertPunc
SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model
lightly-ai/lightly
A python library for self-supervised learning on images.
facebookresearch/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
flashlight/flashlight
A C++ standalone library for machine learning
eldar/deepcut
Multi Person Pose Estimation
DeepLabCut/DeepLabCut
Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
mdn/web-dictaphone
A sample MDN app that uses getUserMedia and MediaRecorder API for recording audio snippets, and The Web Audio API for visualizations.
mosaicml/composer
Supercharge Your Model Training
flagist0/reverso_context_api
Simple Python API for Reverso Context
SpeechColab/GigaSpeech
Large, modern dataset for speech recognition
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
bitextor/bitextor
Bitextor generates translation memories from multilingual websites
google/sparrowhawk
speechio/chinese_text_normalization
Chinese text normalization for speech processing
stanford-crfm/mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
soshial/text-normalization
Python tool for normilizing text and text canonicalization (DISCONTINUED)
312shan/Text-Normalization-in-pyTorch
pyTorch implementation for Text Normalization Challenge
EFord36/normalise
A module for normalising text.
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
facebookresearch/AugLy
A data augmentations library for audio, image, text, and video.