Jupyter notebook content for my OReilly book, the Python Data Science Handbook
See also the free companion project, A Whirlwind Tour of Python: a fast-paced introduction to the Python language aimed at researchers and scientists.
This repository will contain the full listing of IPython notebooks used to create the book, including all text and code. I am currently editing these, and will post them as I make my way through. See the content here:
- Help and Documentation in IPython
- Keyboard Shortcuts in the IPython Shell
- IPython Magic Commands
- Input and Output History
- IPython and Shell Commands
- Errors and Debugging
- Profiling and Timing Code
- More IPython Resources
- Understanding Data Types in Python
- The Basics of NumPy Arrays
- Computation on NumPy Arrays: Universal Functions
- Aggregations: Min, Max, and Everything In Between
- Computation on Arrays: Broadcasting
- Comparisons, Masks, and Boolean Logic
- Fancy Indexing
- Sorting Arrays
- Structured Data: NumPy's Structured Arrays
The notebooks above are still being edited. In the meantime, you can see all the code from the book in code_listings. The code is in IPython notebooks, organized by book chapter and section. All code from this book was tested with Python 3.4-3.5, though it should be near 100% compatible with Python 2.7 as well.
Nearly every figure used in the book was generated using Matplotlib. Most figures are generated by the code within the text; those few that are not can be found in the online-only Figure Appendix
The code here is released under the MIT license. Read more at the Open Source Initiative.
The text content of the book is released under the CC-BY-NC-ND license. Read more at Creative Commons.