Consider pulling from iam-dataset rather than scraping AWS' reference pages
skeggse opened this issue · 1 comments
skeggse commented
You might already be familiar with https://github.com/iann0036/iam-dataset, but it's a rather handy reference that packages up a ton of useful information about AWS IAM and is used downstream in a number of tools. You might consider ripping out the existing scraping tooling in this repository - if you don't want to maintain it - and instead sourcing the package's data from iam-dataset!
michael-k commented
They are also just scraping the AWS docs with https://github.com/duo-labs/parliament/blob/main/utils/update_iam_data.py. 🤷 At least the part of their dataset that's relevant for awacs.