IncomeSim
IncomeSim is a time-series simulator based on the Adult dataset.
ProbAI 23 lecture
If you want to follow along in the notebook during the ProbAI lecture, you have two options:
- Clone this repository and open probai_lecture_github.ipynb in Jupyter/Jupyter lab
- Work in Colab from this notebook
The slides for the lecture can be found here.
Installing prerequisites
- IncomeSim is written in Python 3 and based on the Scikit-learn package and the Adult dataset.
- Start by installing python modules
pandas, numpy, scikit-learn, jupyter, requests, matplotlib
Preparing the data files
You don't need to do this if you use the ProbAI notebook, the notebook does this automatically!
- Download the Adult dataset
- Create a folder
data/income
in the IncomeSim root folder - Place the files
adult.data
,adult.names
andadult.test
indata/income
Generating data
- Run
python generate.py -n <number of samples> -T <length of horizon>
to fit the simulator and generate data