imdb_dataset_article: A Jupyter Notebook repository from fithisux

Welcome to your new dbt project!

Demo code for an article. Download to an imdb_files folder in the parent of the repository the files of IMDB NONCOMMERCIAL DATASET

Also install depenencies in a virtual environment from requirments.txt. In my case

python.exe -m venv venv

venv\Scripts\activate.bat

pip3 install -r requirements.txt

Because of storage shortage the cleansed (or original) datasets are materialized as views

Try running the following commands:

There is a jupyter notebook for profiling with jupysql.

For profiling with soda-core, execute (if the existing json does not satisfy you)

soda scan -d imdb_dataset -c configuration.yaml checks.yaml -V -srf soda_scan.json

fithisux/imdb_dataset_article