/awesome-federal-AI-datasets

A list of high quality accessible AI-ready datasets from the US Federal government.

Primary LanguagePythonCreative Commons Zero v1.0 UniversalCC0-1.0

Awesome Federal AI datasets

A list of high quality accessible AI-ready datasets from the US Federal government.

This repo aims to provide a convenient platform for discovering these datasets, ensuring their accessibility, and upholding a standard of quality as defined by an objective criteria.

Status Dept. Agency Title
🟢 HHS NIH Clinical Trials
🟢 HHS NIH Bibliometric publication citation graph links
🟢 HHS CDC Vaccine Adverse Event Reporting System (VAERS)
🟢 HHS CDC COVID-19 Case Surveillance Public Use Data with Geography
🟢 Legislative GPO US Federal Register
🟢 HHS NIH PubMed: Biomedical publication abstract
🟢 HHS NIH PubMed Central: Full text biomedical publications
🟢 HHS NIH ExPORTER: NIH Grant funding
🟢 HHS FDA Drug, Device, Animal, and Veterinary Adverse Events
🟢 HHS CMS Open Payments Dataset Downloads
🟢 DOC USPTO US Trademarks: Filings and registration images
🟢 DOC USPTO US Patents: full text and images
🔴 GSA Regulations.gov, Federal rulemaking process
🔴 DOJ NSD Foreign Agents Registration Act : Registrants and PDFs
DOC NOAA Weather and Climate Quick Links : National Centers for Environmental Information

AI Ready scores

Questions and scores for the AI dataset can be found at AI_ready_questions.yaml.

Development

Built with 💜 by @metasemantic. Code is linted by black and conforms to standards by flake8. New projects should be added to data/datasets. To help build the YAML entry run make add, make build, then submit a PR.