Dataset for the Cloud Providers Microsoft Azure and Google Cloud Platform (GCP).
Tested on OpenJDK 17.0.2 & Maven 3.6.3.
mvn clean install -DskipTests
Extracted from https://learn.microsoft.com/en-us/azure/virtual-machines/linux/compute-benchmark-scores with the responsible Web Scraper.
cd azure-web-scraper
java -jar ./target/exectuable.jar
Copy the <table>
element of the table on the website and use it in the resources with minimal reformatting.
Then run the responsible Web Scraper.
cd azure-web-scraper
java -jar ./target/exectuable.jar
Copy the <div>
element of the table on the website and use it in the resources with minimal reformatting.
Then run the responsible Web Scraper.
cd azure-web-scraper
java -jar ./target/exectuable.jar
Copied from the https://docs.google.com/spreadsheets of https://www.cloudcarbonfootprint.org/docs/embodied-emissions/.
Extracted from https://cloud.google.com/compute/docs/cpu-platforms with the responsible Web Scraper.
cd gcp-web-scraper
java -jar ./target/exectuable.jar
Queried from BigQuery and saved as CSV:
SELECT year,cloud_region,location,zone_id,cfe_region,google_cfe FROM `bigquery-public-data.google_cfe.datacenter_cfe` WHERE year=2021 LIMIT 1000
Created by hand and matched with the cloud regions. A lot of table entries could not find a match with the regions:
- Lenoir, North Carolina
- Montgomery County, Tennessee
- Jackson County, Alabama
- Papillion, Nebraska
- Mayes County, Oklahoma
Copy the <table>
element of the table on the website and use it in the resources with minimal reformatting.
Then run the responsible Web Scraper.
cd gcp-web-scraper
java -jar ./target/exectuable.jar
Extracted from https://cloud.google.com/compute/docs/regions-zones with the responsible Web Scraper.
cd gcp-web-scraper
java -jar ./target/exectuable.jar