jsonld-crawler

Search for json-ld entities on a website and collect them

This utility produces a newline delimited list of jsonld entities matching the given object type from all pages of a website.

To extract 'Product's from 'exmpl.store' you can invoke the utility like so.

cargo run -- --url http://exmpl.store

For more options consult the help page with the --help option.

USAGE:
    crawler [OPTIONS] --url <URL>

OPTIONS:
    -h, --help                         Print help information
    -o, --output <OUTPUT>              [default: entities.ndjson]
    -T, --object-type <OBJECT_TYPE>    Object @type to search for (e.g. Product)
    -u, --url <URL>                    The domain name to search
    -V, --version                      Print version information