/dd2db

Discogs datasets 2 database

Primary LanguagePython

dd2db: Discogs datasets 2 database

A recasting of code from the discogs-xml2db repository to advance into the modern era of Python.

A CLI toolkit oriented around taking the Discogs Data dumps and ingesting them into various database systems, sqlite3 and Postgresql to start with. The data is real world recorded music data (supports discogs.com, quite longitudinal (spanning 16 years as of 2024), updated monthly, and of course a bit messy.

Quick start

$ python3 -m venv dd2db
$ source dd2db/bin/activate
$ git clone https://github.com/crossjam/dd2db.git
$ cd dd2db
$ pip3 install --upgrade -e .
$ dd2db --help