/datascience

Data Science AI Artificial Intelligence Machine Deep Learning Classification Regression Python Keras TensorFlow TensorFlow2 TPOT XGBoost Matplotlib NumPy Pandas scikit-learn Folium Seaborn Jupyter Lab Notebook

Primary LanguageJupyter Notebook

DATA SCIENCE

This repository stores Jupyter Notebooks to demonstrate skills in Data Science, Artificial Intelligence, classification and regression problems with Python, Keras, scikit-learn, Matplotlib, NumPy, Pandas, TPOT, XGBoost, Folium, Seaborn among others.

DEPENDENCIES

The code has been tested using:

Virtual environment (<env_name>=.venv) can be generated with requirements.txt file found in main folder.

Command to configure virtual environment with venv:

~/datascience$ python3 -m venv .venv
~/datascience$ source .venv/bin/activate
(.venv)~/datascience$ python3 -m pip install pip==24.2
(.venv)~/datascience$ python3 -m pip install setuptools==75.2.0
(.venv)~/datascience$ python3 -m pip install -r requirements.txt
(.venv)~/datascience$ pre-commit install

HOW TO RUN NOTEBOOKS

A good way to play with the Jupyter Notebooks is through Jupyter Lab. To run any of them use the command shown below:

(.venv)~/datascience$ jupyter lab <notebook_name>.ipynb

It might be also necessary to install locally Graphviz for rendering graph images with the command:

~/datascience$ sudo apt-get install graphviz

Graph image example of a decision tree is shown below.

Graph image example of a decision tree

CREDITS

author: alvertogit copyright: 2018-2024