maboa
Web Developer - Project Manager - Media Technologist
Happyworm / Hyperaudio / BIF / TheirStoryFlorence, Italy
maboa's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Vaibhavs10/insanely-fast-whisper
goenning/google-indexing-script
Script to get your site indexed on Google in less than 48 hours
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
jplayer/jPlayer
jPlayer : HTML5 Audio & Video for jQuery
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
NUKnightLab/TimelineJS3
TimelineJS v3: A Storytelling Timeline built in JavaScript. http://timeline.knightlab.com
cubiq/ComfyUI_InstantID
yeates/PromptFix
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
pyannote/pyannote-video
Face detection, tracking and clustering in videos
bugbakery/transcribee
open source audio and video transcription software
federicotorrielli/BetterWhisperX
Better WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
hyperaudio/hyperaudio-lite
Hyperaudio Lite - a Super-lightweight Interactive Transcript Player
theirstory/gliner-spacy
A spaCy wrapper for GliNER
smlum/scription
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
arseneyr/wasm-media-encoders
MP3 and Ogg Vorbis encoders for the browser and Node
ablwr/media-collection-viewer
visualizations/charts for media collections, based on mediainfo
bbc/stt-align-node
node version of stt-align https://github.com/bbc/stt-align by Chris Baume - R&D.
hyperaudio/wordpress-hyperaudio
a wordpress plugin to display interactive transcripts
Digital-Creativity-Labs/CuttingRoom-v0.0
Cutting Room is a plugin for Unity developed by Digital Creativity Labs which allows creators to build and deploy Object Based Media productions.
wjbmattingly/weaviate-filter
A package for creating GraphQL filters for Weaviate
BadIdeaFactory/skyppy
speech is overrated - skyp it
OpenEditor/openeditor
An application that allows creation and correction of automated transcriptions of media.
theirstory/spacy-whisper
This is a spaCy pipeline that takes a Whisper output and builds a spaCy Doc container
theirstory/timedtext-player
hyperaudio/wordpress-hyperaudio-pro
A repo for the pro version of the Hyperaudio Wordpress plugin