apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
TypeScriptApache-2.0
Stargazers
- 0xjgvepilot-dev
- ankitjain28may@Razorpay
- cermak-petr@Apifier
- Copernicium112
- cybairflyCzech Republic
- darcyturkGreater Los Angeles
- djob
- entrptaher@code-eating-ants
- EsomNovotech Software Solutions
- FGRibreau@nobullshitbooks @netwo-io @hook0 @image-charts @killbugtoday @mailpopin / Sold @Redsmin @bringr
- gdunghi
- HemmingssonProduct Designer at Einride & Co-Founder of Not Toys
- iworkforthemSingapore
- jakubbaladaApify
- karelrochelt@amio-io
- kenyimoses
- kevinsegalSegal Industries
- kuceram
- leonardohipolitoLeonardo Hipólito
- littlelotta@shopping
- lucasyvasCanada
- nsourov@CHEQPlease
- P4l1ndr0m
- pranciskus
- rationalthug
- RobbertvermeulenPeople & Media
- SantoshSrinivas79
- smallcar88
- StorytellerCZ@LiteraryUniverse @Meteor-Community-Packages
- sukhjitsingh-impactradius
- texano00Crif SpA
- theill@familiohq
- TibboddiT
- yonnyZer0
- yunjialiSouthampton
- zatziky