For people who struggle to start in data science with Python
This hands-on in-person workshop is based on Python for Data Science Course by IBM Cognitive Class
Learn how to create your first Python scripts and perform basic hands-on data analysis using Jupyter-based environment.
The workshop will cover core topics:
Types | Variables | Strings |
---|---|---|
- Hello World
- Comments
- Errors
- Types
- Expressions
- Variables
- Strings
Tuple | Set | Dictionary |
---|---|---|
- Tuples
- Lists
- Sets
- Dictionaries
Condition | Loop | Class |
---|---|---|
- Conditions
- Branching
- Loops
- Functions
- Objects
- Classes
Read file | pandas DataFrame | Specify columns |
---|---|---|
- Reading files with open
- Writing files with open
- Loading data with pandas
- Working with and Saving data with pandas
1D Array | 2D Array | Array slicing |
---|---|---|
- Creating and Manipulating 1D & 2D Arrays
- Array Operations
You will need a laptop that can access the internet
Install miniconda or install the (larger) Anaconda distribution
Install Python using Miniconda
OR Install Python using Ananconda
Clone the repository
git clone git@github.com:aymanibrahim/pyds.git
OR Download the repository as a .zip file
Change current directory to pyds directory
cd pyds
Install Python with the required packages into an environment named pyds as per environment.yml YAML file.
conda env create -f environment.yml
When conda asks if you want to proceed, type "y" and press Enter.
Change the current default environment (base) into pyds environment.
conda activate pyds
Use check_environment.py script to make sure everything was installed correctly, open a terminal, and change its directory (cd) so that your working directory is the workshop directory pyds you cloned or downloaded. Then enter the following:
python check_environment.py
If everything is OK, you will get the following message:
Your workshop environment is set up
Start JupyterLab using:
jupyter lab
JupyterLab will open automatically in your browser.
You may access JupyterLab by entering the notebook server’s URL into the browser.
Press CTRL + C in the terminal to stop JupyterLab.
Change the current environment (pyds) into the previous environment.
conda deactivate
- Python: Programming language
- Conda: Package and environment manager
- Anaconda: Python distribution
- Miniconda: Minimal installer for conda
- NumPy: Fundamental package for scientific computing with Python
- Matplotlib: Python 2D plotting library
- pandas: Python data analysis library
- Jupyter Notebook: Web application to create documents with code, equations, visualizations and text
- JupyterLab: Web-based development environment for Jupyter Notebooks
- Python for Data Science: Course by IBM Cognitive Class
Thanks for your interest in contributing! There are many ways to contribute to this project. Get started here.
Python for Data Science Workshop by Ayman Ibrahim is licensed under a Creative Commons Attribution 4.0 International License. Based on a work at IBM Cognitive Class Python for Data Science by Joseph Santarcangelo, PhD.