mc-crawler - A MobileCoin Network Crawler
This binary crawls the MobileCoin network and optionally provides 2 JSONs in stellarbeat.io format.
The crawler communicates with the validator nodes using RPCs provided by the mc-consensus-api, and asks each new node for the last consensus message it broadcast to the other validators. The response of the gRPC contains, among other information, the queried node's quorum set which in turn contains other validators that the crawler may have not yet seen.
The crawler, therefore, only finds validators (no watcher nodes), and will not find nodes that are not included in any validator's quorum set.
The Nodes-JSON contains the following data about every found node:
- Hostname
- Port
- Quorum Set
- Public Key
- Connectivity status
- (When available) IP-based Geolocation data, i.e. country and ISP
The Crawl Report contains the same data as the Nodes-JSON in addition to metadata about the crawl such as the duration and a timestamp.
We maintain an HTTP API at https://api.crawler.mc.trudi.group/v1 with crawl data dating back to 23rd August 2021.
The API returns a single JSON file of the crawl recorded at the requested timestamp which must be provided as an RFC 3339 and ISO 8601 date and time string.
The most recent data can be retrieved by not passing a timestamp; the oldest with https://api.crawler.mc.trudi.group/v1/2021-08-23T20:00:00.007Z.
-
Rust's
nightly-2022-04-29
compiler which will be downloaded and installed automatically -
Protobuf compiler
protoc
which can be built from source or installed using a package manager or , e.g.apt install -y protobuf-compiler
on Ubuntu
Some of the crates used in this library need the Intel SGX environment variables
SGX_MODE
and IAS_MODE
.
You can set them in your terminal like below or pass them when building the binary.
export SGX_MODE=SW IAS_MODE=DEV
Having set the environment variables, the SGX variables do not need to be passed whenever a call to a cargo subcommand is made.
Continue to the section on running the crawler.
SGX_MODE=SW IAS_MODE=DEV cargo build --release
- The environment variables are only necessary if you skipped step 2.
- The initial compilation will take several minutes due to some of the dependencies used in this project.
SGX_MODE=SW IAS_MODE=DEV cargo run --release [-- path_to_bootstrap_file --output output_directory --debug --fbas --complete]
- The environment variables are only necessary if you skipped step 2.
- Nodes that should be used to bootstrap the crawler are read from text file in which each node's URI is stored on a separate line. The default is "bootstrap.txt".
- The default output directory is set to "crawl_data".
- The crawler optionally writes a JSON with the FBAS discovered during the crawl when "fbas" is passed.
- The crawler optionally writes a JSON with additional data about the crawl when "complete" is passed.
- Debug level messages are suppressed by default.
Passing --debug results in more verbose terminal output during the crawl.
The results presented in the paper can all be reproduced using the data obtained from the crawler and the fbas_analyzer.
Refer to its documentation for installation instructions before proceeding.
Below are some example commands: (see target/release/fbas_analyzer -h
for more analysis options)
Find all minimal quorums, minimal blocking sets and minimal splitting sets and output metrics about the sizes of the node sets.
target/release/fbas_analyzer -adp mobilecoin_nodes_completed_manually_2021-08-02.json
target/release/fbas_analyzer -adp mobilecoin_nodes_completed_manually_2021-08-02.json --merge-by-org mobilecoin_organisations_2021-08-02_created_manually.json
target/release/fbas_analyzer -adp mobilecoin_nodes_completed_manually_2021-08-02.json --merge-by-isp
Find the same sets as above, but merge by countries and output lists of node lists (instead of metrics)
target/release/fbas_analyzer -ap mobilecoin_nodes_completed_manually_2021-08-02.json --merge-by-country