BuilderIO/gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
TypeScriptISC
Issues
- 0
- 6
error TS2322
#151 opened by sc0h0 - 1
cookie example
#158 opened by cmbcbe - 0
- 0
Scrape HTML tag
#183 opened by WebdevWebi - 2
Cookies not accepted
#145 opened by stevenbaert - 0
Hi
#181 opened by chihebhichri67 - 0
- 2
- 0
match mid part of path
#173 opened by SmallDryad - 1
Multiple match patterns
#172 opened by qqaatw - 1
Crawl a Github repo
#171 opened by mena234 - 2
GPT Crawler cli Drop-in config
#168 opened by zerofill - 3
'Zod' package not found |
#123 opened by Daethyra - 1
Does gpt-crawler server always return same site?
#147 opened by kaibadash - 1
Output.json file not created
#152 opened by bourgeda - 0
- 1
how to crawler this site?match not work
#159 opened by tom6q6 - 0
Type Error
#165 opened by JoelWekesa - 0
Doesnt Extract Texts present in dropboxes
#164 opened by KumarSampurn - 0
memory usage need to be optimized
#163 opened by banditsmile - 0
PlaywrightCrawler memory problem and errors
#162 opened by lipstk - 0
sh: cross-env: command not found
#161 opened by Vickie-Liu - 0
add a username and password?
#160 opened by GentleLemon - 0
extracting text in hidden div blocks
#157 opened by udgithub - 0
How to supply read-able code to the GPT?
#155 opened by lucastobrazil - 0
Can i use Gemini model by google?
#150 opened by amrpyt - 0
How to crawl Single Page Application(SPA)
#149 opened by ouyh1111 - 11
Multiple websites at once?
#135 opened by heyfletch - 0
Multiple Selectors not Reflected in Output
#146 opened by mahdii0908 - 1
Crawl websites protected by username and password?
#136 opened by alzh666 - 2
How to crawl https://zod.dev/ ?
#124 opened by MontiL - 2
WARN PlaywrightCrawler: Reclaiming failed request back to the list or queue. Request blocked - received 429 status code.
#137 opened by Voyager3D - 7
Json too large for GPT
#113 opened by tristan-mcinnis - 1
Trying to Crawl site nothing working
#139 opened by upup666 - 0
- 2
Only one tag html for all the page
#128 opened by Th3Heavy - 1
Add for a selector to exclude elements in the site
#134 opened by razaanstha - 3
npm start issue
#127 opened by diandian0420 - 3
Add support for concurrent invocations to crawl
#120 opened by adityak74 - 0
how to add userName & passwd to gpt-crawler
#141 opened by DorakuCN - 0
ERROR PlaywrightCrawler: Request failed and reached maximum retries. Navigation timed out after 60 seconds
#140 opened by Mytraas - 2
Disallowed Special Token
#130 opened by MrAshRhodes - 0
- 0
aisa gpt
#125 opened by AZURALIF06 - 0
How to paginate for large JSON files?
#119 opened by chnsh - 0
Crawling more than max number of pages
#118 opened by dcgleason - 0
- 0
- 0
How to limit the hierarchy of pages to be crawled?
#112 opened by wywywy1990