TroySK's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Pythagora-io/gpt-pilot
The first real AI developer
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell. Audio foundation model.
Aider-AI/aider
aider is AI pair programming in your terminal
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
princeton-nlp/SWE-agent
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.
stackblitz/bolt.new
Prompt, run, edit, and deploy full-stack web applications
mediar-ai/screenpipe
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording and control
catdad/canvas-confetti
🎉 performant confetti animation in the browser
easydiffusion/easydiffusion
Easiest 1-click way to create beautiful artwork on your PC using AI, with no tech knowledge. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
nilsherzig/LLocalSearch
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed.
MadcowD/ell
A language model programming library.
asg017/sqlite-vec
A vector search SQLite extension that runs anywhere!
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Dataherald/dataherald
Interact with your SQL database, Natural Language to SQL using LLMs
ChrisBuilds/terminaltexteffects
TerminalTextEffects (TTE) is a terminal visual effects engine, application, and Python library.
fixie-ai/ultravox
A fast multimodal LLM for real-time voice
fastai/lm-hackers
Hackers' Guide to Language Models
lexbor/lexbor
Lexbor is development of an open source HTML Renderer library. https://lexbor.com
McGill-NLP/webllama
Llama-3 agents that can browse the web by following instructions and talking to you
OpenAutoCoder/Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
mustafaaljadery/lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
RafalWilinski/cloudflare-rag
Fullstack "Chat with your PDFs" RAG (Retrieval Augmented Generation) app built fully on Cloudflare
khmyznikov/pwa-install
Installation dialog for Progressive Web Application. Provides a more convenient user experience and fixes the lack of native dialogs in some browsers.
py-pdf/benchmarks
Benchmarking PDF libraries
EriCongMa/awesome-transformer-ocr
This repository is created to share current progress of transformer based optical character recognition(OCR). Welcome to share~