NOAA CDO -> CSV -> Parquet -> DuckDB
This repo is two things:
- A rust script for converting the special-formatted NOAA CDO data into CSVs
- A few node scripts for creating DuckDBs out of these data parquet files*
*This repo does not include CSV -> Parquet processing, but that's because you can grab this utility instead.
TODO
- Create a Makefile for downloading the NOAA data, for now you can grab it from the source (you want
ghcnd-stations.txt
andghcnd_gsn.tar.gz
). - Publish the built csv/parquet/duckdb artifacts to GitHub releases, maybe.