qu0laz's Stars
Xetera/ghost-cursor
🖱️ Generate human-like mouse movements with puppeteer or on any 2D plane
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
suntong/html2md
HTML to Markdown converter
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
lavague-ai/LaVague
Large Action Model framework to develop AI Web Agents
Agent-Tools/awesome-autonomous-web
duckduckgo/tracker-radar-collector
🕸 Modular, multithreaded, puppeteer-based crawler
ishan0102/vimGPT
Browse the web with GPT-4V and Vimium
Skyvern-AI/skyvern
Automate browser-based workflows with LLMs and Computer Vision
coder-hxl/x-crawl
Flexible Node.js AI-assisted crawler library
ultrafunkamsterdam/nodriver
Successor of Undetected-Chromedriver. Providing a blazing fast framework for web automation, webscraping, bots and any other creative ideas which are normally hindered by annoying anti bot systems like Captcha / CloudFlare / Imperva / hCaptcha
getlinksc/css-selector-tool
A low-code data extractor for websites with built in proxy and parsing capabilities. Great for testing and debugging css selectors
kaliiiiiiiiii/Selenium-Driverless
undetected Selenium without usage of chromedriver
rebrowser/rebrowser-patches
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on demand.
StreetLamb/tribe
Low code tool to rapidly build and coordinate multi-agent teams
joshuarichards001/pixels
A real-time collaborative pixel art canvas
codercurious/dun-and-bradstreet-dnb-com-scraper
Scrape companies and contacts details from D&B database based on industry, job title, employee count, entity types etc.
Bunsly/JobSpy
Jobs scraper library for LinkedIn, Indeed, Glassdoor & ZipRecruiter
resque/resque
Resque is a Redis-backed Ruby library for creating background jobs, placing them on multiple queues, and processing them later.
openfaas/faas
OpenFaaS - Serverless Functions Made Simple
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
datitran/raccoon_dataset
The dataset is used to train my own raccoon detector and I blogged about it on Medium
niderhoff/nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
openimages/dataset
The Open Images dataset
several27/FakeNewsCorpus
A dataset of millions of news articles scraped from a curated list of data sources.
mediar-ai/screenpipe
24/7 local AI screen & mic recording. Build AI apps that have the full context. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.