open-data
There are 2334 repositories under open-data topic.
ckan/ckan
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
okfn-brasil/serenata-de-amor
🕵 Artificial Intelligence for social control of public administration | **This repository does not receive frequent updates. Check out the README**
common-voice/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
statsbomb/open-data
Free football data from StatsBomb
codyogden/killedbygoogle
Part guillotine, part graveyard for Google's doomed apps, services, and hardware.
mdeff/fma
FMA: A Dataset For Music Analysis
github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
GSA/datagov-wptheme
Data.gov WordPress Theme (obsolete)
softwareunderground/awesome-open-geoscience
Curated from repositories that make our lives as geoscientists, hackers and data wranglers easier or just more awesome
open-thoughts/open-thoughts
Fully open data curation for reasoning models
okfn-brasil/querido-diario
📰 Diários oficiais brasileiros acessíveis a todos | 📰 Brazilian government gazettes, accessible to everyone.
juancarlospaco/faster-than-requests
Faster requests on Python 3
sentinelsat/sentinelsat
Search and download Copernicus Sentinel satellite images
datasets/awesome-data
Curated list of quality open datasets
MobilityData/gbfs
Documentation for the General Bikeshare Feed Specification, a standardized data feed for shared mobility system availability. Maintained by MobilityData
kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
cernopendata/opendata.cern.ch
Source code for the CERN Open Data portal
yuhonas/free-exercise-db
Open Public Domain Exercise Dataset in JSON format, over 800 exercises with a browsable public searchable frontend
siznax/wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
blaylockbk/Herbie
Download numerical weather prediction datasets (HRRR, RAP, GFS, IFS, etc.) from NOMADS, NODD partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.
kr-stn/awesome-sentinel
curated list of awesome tools, tutorials and APIs for Copernicus Sentinel satellite data
etalab/DVF-app
Exploration des données DVF
Fraud-Detection-Handbook/fraud-detection-handbook
Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook
github/covid-19-repo-data
Data archive of identifiable COVID-19 related public projects on GitHub
magda-io/magda
A federated, open-source data catalog for all your big data and small data
catalyst-cooperative/pudl
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
meteostat/meteostat-python
Access and analyze historical weather and climate data with Python.
rgllm/awesome-portugal-data
🇵🇹 Lista de repositórios de dados abertos em Portugal
geonetwork/core-geonetwork
GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
github/innovationgraph
GitHub Innovation Graph
anahitasocial/anahita
Anahita is a platform and framework for developing open science and knowledge sharing applications on a social networking foundation.
openlists/ElectrophysiologyData
A list of openly available datasets in (mostly human) electrophysiology.
Chicago/food-inspections-evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
basedosdados/sdk
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/sdk/
GetDKAN/dkan
DKAN Open Data Portal
earthobservations/wetterdienst
Open weather data for humans.