Arbitrate3280's Stars
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
deedy5/duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
CatchTheTornado/pdf-extract-api
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
coollabsio/coolify
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
MightyMoud/sidekick
Bare metal to production ready in mins; your own fly server on your VPS.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
jhj0517/Whisper-WebUI
A Web UI for easy subtitle using whisper model.
WoeUSB/WoeUSB-ng
WoeUSB-ng is a simple tool that enable you to create your own usb stick windows installer from an iso image or a real DVD. This is a rewrite of original WoeUSB.
emuplace/sudachi.emuplace.app
ClassicOldSong/moonlight-android
GameStream client for Android
voideditor/void
deskflow/deskflow
Deskflow lets you share one mouse and keyboard between multiple computers on Windows, macOS and Linux. It's like a software KVM (but without video).
ClassicOldSong/Apollo
Sunshine fork
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
AIHawk-co/Auto_Jobs_Applier
Auto_Jobs_Applier by AIHawk is an Agen that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated and personalized way.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
emcf/thepipe
Extract clean data from anywhere, powered by vision-language models ⚡
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Aider-AI/aider
aider is AI pair programming in your terminal
runtipi/runtipi
Runtipi is a homeserver for everyone! One command setup, one click installs for your favorites self-hosted apps. ✨
chiteroman/PlayIntegrityFix
Fix Play Integrity (and SafetyNet) verdicts.
daboynb/PlayIntegrityNEXT
itsOwen/CyberScraper-2077
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
vaamonde/pt_br-wpsoffice
Pacote de Tradução e Dicionário do WPS Office 2023 para o Linux Mint 20.x e 21.x
Kingsman44/Pixelify
Magisk module to enables pixel exclusive features and ui