spatie/robots-txt
Determine if a page may be crawled from robots.txt, robots meta tags and robot headers
PHPMIT
Issues
- 3
- 1
file_get_contents($source) throws an InvalidArgumentException on Websites with expired Certificates
#34 opened by osthafen - 3
Not working properly with: x-robots-tag: none
#30 opened by nnerijuss - 5
- 1
- 4
Custom UserAgent mismatches due to parseUserAgent()
#25 opened by muhci - 8
- 3
Implement Allow directive
#18 opened by BenMorel - 1
- 1
Fixes "case-insensitive"-Rule in X-robots-tag
#10 opened by RobinDev - 1
Fix nofollow or noindex check for Headers
#11 opened by RobinDev - 1
Fix Wildcard check in Robots.txt and Headers
#12 opened by RobinDev - 2
- 2
Support robot headers
#1 opened by brendt