/PostgreSQL-DE1.3

Database set up in PostgreSQL for goodreads dataset. Modeling, cleaning and querying.

Primary LanguageJupyter Notebook

Database set up in Postgres DB for Goodreads Dataset

Project to create PostgreSQL database, model and normalize goodreads dataset, then load data to db and once set up to run some analytic queries.

Requirements: active Postgres server and Postgres client installed.
Dataset was downloaded from here.

After cloning this repo follow these steps:
Option 1 Step by step from CLI. In repo folder run:

  1. ./0_data-preparation-cleaning.ipynb
  2. ./1_create_db.sh
  3. ./2_create_load_tables.sh
  4. ./3_create_table_relations.sh
  5. ./4_queries_in_booksDB.ipynb

Option 2 Restore database from file. In CLI run.

  1. login to Postgres
  2. $ psql --host=localhost --port 5432 --username=zivile --password --dbname=books
    password to use zivile123

You can explore DB books from your favorite GUI Client like pgAdmin, DBeaver or other or directly from psql client.

Data Model

books