So It's Time To Analyse
Using packages as suggested from
https://blog.dominodatalab.com/video-huge-debate-r-vs-python-data-science/
- feather - fast reading/writing
- ibis - dataframes with database neurtral way
- paratext - fast csv reading
- bcolz - compressed columnar data storage. Like SFrames
- altair - matplotlib replacement, uses grammar of graphics
- bokeh - interactive visualizations
- geoplotlib - maps
- blaze - numpy pandas syntax with any backend, abstracts storage and compute. Eg spark
- xarray - high end data manipulations. n dimension arrays
- dask - parallel computation. Dynamic task scheduler (like celery, optimised for interactivity), parallel arrays, dataframes and lists.
- keras - deep learning
- pymc3 - high end algorithms for modelling