sogris's Stars
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
ahmetoner/whisper-asr-webservice
OpenAI Whisper ASR Webservice API
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
fergiemcdowall/stopword
A module for node.js and the browser that takes in text and strips it of stopwords
tensorflow/models
Models and examples built with TensorFlow
timesler/facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
notAI-tech/NudeNet
Lightweight nudity detection
UKPLab/EasyNMT
Easy to use, state-of-the-art Neural Machine Translation for 100+ languages
ofkindness/winston-sql-transport
Winston universal SQL transport for logging
ofkindness/winston-postgres-transport
A Winston transport for PostgreSQL.
ipazc/mtcnn
MTCNN face detection implementation for TensorFlow, as a PIP package.
serengil/tensorflow-101
TensorFlow 101: Introduction to Deep Learning
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
stas-demydiuk/ewpe-smart-mqtt
MQTT bridge for EWPE Smart powered devices
vercel/pkg
Package your Node.js project into an executable
mxbi/youtube8m-2019
5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)
flashlight/wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
spencermountain/compromise
modest natural-language processing
davisking/dlib-models
Trained model files for dlib example programs.
miha-skalic/youtube8mchallenge
1st place solution to Kaggle's 2018 YouTube-8M Video Understanding Challenge
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
surmon-china/videojs-player
@videojs player component for @vuejs(3) and React.
gdiepen/face-recognition
Repository for face recognition related work
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line