Pinned Repositories
actor-whitepaper
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
apify-cli
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
apify-mcp-server
Apify MCP server (tools for web scraping, data extraction, and automation)
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
fingerprint-suite
Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
got-scraping
HTTP client made for scraping based on got.
impit
impit | rust library for browser impersonation
mcp-server-rag-web-browser
A MCP Server for the RAG Web Browser Actor
proxy-chain
Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
apify's Repositories
apify/act-crawl-url-list
Apify actor to crawl a list of URLs
apify/apify-dev-js
Development dependencies used in all Apify projects.
apify/js-bson-32MB
Form of bson package with limits for BSON object size changed to 32MB
apify/keboola-base-node
A Node.js support for Keboola Docker Infrastructure
apify/keboola-ex-apify-docker
Docker image for Apify extractor for Keboola Connection
apify/qtwebkit