Pinned Repositories
actor-whitepaper
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
apify-cli
Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
apify-mcp-server
Apify MCP server (tools for web scraping, data extraction, and automation)
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
fingerprint-suite
Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
got-scraping
HTTP client made for scraping based on got.
impit
impit | rust library for browser impersonation
mcp-server-rag-web-browser
A MCP Server for the RAG Web Browser Actor
proxy-chain
Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
apify's Repositories
apify/actor-page-analyzer
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
apify/actor-scraper
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
apify/got-cjs
An action to release a CommonJS version of the popular library got, which is soon to be available only in an ESM format.
apify/better-sqlite3-with-prebuilds
Better SQLite prebuild & publish action
apify/chat-with-a-website
A simple app that lets you chat with a given website.
apify/actor-legacy-phantomjs-crawler
The actor implements the legacy Apify Crawler product. It uses PhantomJS headless browser to recursively crawl websites and extract data from them using a piece of JavaScript code.
apify/actor-example-python
Example Apify Actor written in Python
apify/crawlee-parallel-scraping-example
An example repository showcasing how you can scrape in parallel using one request queue
apify/input-schema-editor-react
Apify input schema editor written in React.js
apify/actor-example-secret-input
Example actor showcasing the secret input fields
apify/aidevworld2023
How to get clean web data for chatbots and LLMs slides and supporting materials.
apify/actor-imagediff
Returns an image containing difference of two given images.
apify/apify-web-covid-19
A list of public COVID-19 APIs to be rendered on https://apify.com/covid-19
apify/slack-messages-action
It wraps up messages sending from Apify GitHub workflows into Slack.
apify/Flowise
Drag & drop UI to build your customized LLM flow using LangchainJS
apify/playwright-test-actor
Source code for the Playwright Test public actor.
apify/actor-proxy-test
apify/actor-selenium-mocha-runner
Actor that runs Selenium based Mocha tests.
apify/apify-docs-preset
Common preset for the v2 documentation Docusaurus instances.
apify/docsearch-apify-docs
:blue_book: Tweaked version of docsearch for apify-docs
apify/komparz
Special, yet insignificant actors
apify/scrapy-migrator
A standalone POC script for wrapping Scrapy projects with Apify middleware.
apify/actor-firebase-firestore-import
apify/apify-docs-v2
Development repository for new version of docs.apify.com, archived and merged to the apify-docs repository now
apify/actor-chat-with-your-website
This Actor enables you to chat with your website like you do with your friends.
apify/apify-fig-autocomplete
Fig adds autocomplete to your terminal.
apify/apify-sdk-js-v2
fork of the SDK monorepo to be able to deploy another docusaurus instance
apify/apify-test-actors
apify/page-analyzer-ui
Interface for act-page-analyzer
apify/ps-tree
Fork of indexzero/ps-tree that also includes memory usage