sathish316/scrapify
ScrApify is a library to build APIs by scraping static sites and use data as models or JSON APIs. It powers APIfy which is used to create JSON APIs from any html or wikipedia page
Ruby
Issues
- 0
Use index as key
#8 opened by sathish316 - 0
find all with pagination
#15 opened by sathish316 - 0
Convert scraped HTML content to XML
#31 opened by sathish316 - 0
Pagination using next page selector
#7 opened by sathish316 - 0
conditions with <,=,> operators
#13 opened by sathish316 - 0
conditions with regex like
#14 opened by sathish316 - 0
- 0
find by id should crawl detailed content
#21 opened by sathish316 - 0
Export crawled content to csv
#22 opened by sathish316 - 0
Export crawled content to sqlite db
#23 opened by sathish316 - 0
Export crawled content to sql database
#24 opened by sathish316 - 0
Extract category attribute
#26 opened by sathish316 - 0
Download media (mp4, pdf etc) using crawler
#28 opened by sathish316 - 2
Support for Basic Authentication
#19 opened by sathish316 - 3
Support multiple html pages
#6 opened by sathish316 - 0
Support for Login with session
#20 opened by sathish316 - 0
pagination using array of pages
#29 opened by sathish316 - 1
Replace <br> tags in content with newline
#25 opened by sathish316 - 0
Attribute data types
#27 opened by sathish316 - 0
pagination using next page selector
#17 opened by sathish316 - 0
- 0
find all with conditions
#12 opened by sathish316 - 1
- 0
- 0
Validate css selector syntax
#3 opened by sathish316 - 0
Validate xpath selector syntax
#4 opened by sathish316 - 1
Tolerance to malformed XML
#9 opened by franciscolourenco