bmilde's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
httpie/cli
🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
ggerganov/ggml
Tensor library for machine learning
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
jonaswinkler/paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
KoboldAI/KoboldAI-Client
For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcpp
azlux/log2ram
ramlog like for systemd (Put log into a ram folder)
state-spaces/s4
Structured state space sequence models
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
divamgupta/stable-diffusion-tensorflow
Stable Diffusion in TensorFlow / Keras
zhanymkanov/fastapi_production_template
FastAPI Template with Docker, Postgres
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
qunash/stable-diffusion-2-gui
Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.
shobrook/stackexplain
Explain your error message with ChatGPT
acrogenesis/macchanger
Change your mac address, for OS X
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
vtsynergy/CU2CL
A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework
Felflare/rpunct
📝An easy-to-use package to restore punctuation of the text.
embium/solverecaptchas
An async Python library to automate solving ReCAPTCHA v2 using Playwright.
jezs00/pycasso
A system to send AI generated art to an E-Paper display through a Raspberry PI unit
Jeremy-Fuller/Prompts
A sample of prompts from Lexica art