jadedgnome's Stars
abi/screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
VAST-AI-Research/TripoSR
lkwq007/stablediffusion-infinity
Outpainting with Stable Diffusion on an infinite canvas
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
OpenCodeInterpreter/OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
nodiscc/awesome-linuxaudio
[mirror] A list of software and resources for professional audio/video/live events production on Linux.
KoljaB/RealtimeTTS
Converts text to speech in realtime
sindresorhus/awesome-whisper
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Jieyab89/OSINT-Cheat-sheet
OSINT cheat sheet, list OSINT tools, dataset, article, book and OSINT tips
ComfyWorkflows/ComfyUI-Launcher
Run any ComfyUI workflow w/ ZERO setup.
AndrewVeee/nucleo-ai
An AI assistant beyond the chat box.
DolbyIO/awesome-audio
A curated list of awesome audio technology resources for developers
taesiri/ArXivQA
WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)
NaomiProject/Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
ModuleArt/quick-screen-recorder
Lightweight desktop screen recorder for Windows.
tomchang25/whisper-auto-transcribe
Auto transcribe tool based on whisper
WiNE-iNEFF/Simple_Prompt_Generator
Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, and etc.
lextrack/Simple-Screen-Recorder
Simple and easy-to-use screen recorder for Windows. With a built-in file merge tool.
microsoft/simulated-trial-and-error
bnsantoso/sub-to-audio
Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.
VaibhavCodeClub/learn
Learning app for kids
TrelisResearch/one-click-llms
One click templates for inferencing Language Models
Alex313031/cog-chromium
Cog Chrome App - Fork with more info, changed colours, manifest update, and Chromium name :)
zeke/all-the-public-replicate-models
Metadata for all the public models on Replicate, bundled up into an npm package.
reekystive/puppeteer-tab-recorder
Record browser tabs with audio.
CodingTrain/coding-train-transcripts
A project to collect transcripts from Coding Train videos