bmilde

bmilde's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k 1.1k 7.6k26.6k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C34.9k 313 1.3k3.6k
httpie/cli
🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
Language:Python33.6k 85 8763.7k
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Language:C++13.2k 318 3981.2k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python11.7k 206 2.3k2.4k
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11k 183 1.9k1.8k
ggerganov/ggml
Tensor library for machine learning
Language:C++11k 127 4071k
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Language:Python9.1k 92 201513
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.7k 133 1.1k1.4k
lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.7k 143 47666
jonaswinkler/paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
Language:Python5.4k 53 671356
KoboldAI/KoboldAI-Client
For GGUF support, see KoboldCPP: https://github.com/LostRuins/koboldcpp
Language:Python3.5k 68 275749
azlux/log2ram
ramlog like for systemd (Put log into a ram folder)
Language:Shell2.6k 60 170193
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.4k 52 134285
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.2k 44 397483
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
Language:Python1.9k 39 43167
bugbakery/audapolis
an editor for spoken-word audio with automatic transcription
Language:TypeScript1.7k 26 25138
divamgupta/stable-diffusion-tensorflow
Stable Diffusion in TensorFlow / Keras
Language:Python1.6k 25 49227
zhanymkanov/fastapi_production_template
FastAPI Template with Docker, Postgres
Language:Python993 14 20136
sooftware/conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Language:Python946 9 37174
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Language:Python614 15 4694
qunash/stable-diffusion-2-gui
Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.
Language:Jupyter Notebook607 14 1973
shobrook/stackexplain
Explain your error message with ChatGPT
Language:Python516 5 431
acrogenesis/macchanger
Change your mac address, for OS X
Language:Ruby319 13 1443
kssteven418/Squeezeformer
[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Language:Python246 15 419
vtsynergy/CU2CL
A prototype CUDA-to-OpenCL source-to-source translator, built on the Clang compiler framework
Language:C++189 18 932
Felflare/rpunct
📝An easy-to-use package to restore punctuation of the text.
Language:Python107 4 670
embium/solverecaptchas
An async Python library to automate solving ReCAPTCHA v2 using Playwright.
Language:Python104 4 824
jezs00/pycasso
A system to send AI generated art to an E-Paper display through a Raspberry PI unit
Language:Python73 8 656
Jeremy-Fuller/Prompts
A sample of prompts from Lexica art
71 6 014