SIGN IN SIGN UP

Python Data Science Handbook: full text in Jupyter Notebooks

47156 0 0 Jupyter Notebook
2016-11-17 22:12:38 -08:00
# Python Data Science Handbook
2016-08-11 15:49:18 -07:00
2017-11-13 13:31:26 -08:00
[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)
2018-08-28 13:05:27 -07:00
[![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)
2017-11-13 11:08:47 -08:00
This repository contains the entire [Python Data Science Handbook](http://shop.oreilly.com/product/0636920034919.do), in the form of (free!) Jupyter notebooks.
2016-08-11 15:49:18 -07:00
2017-11-13 11:08:47 -08:00
![cover image](notebooks/figures/PDSH-cover.png)
2017-11-13 11:14:55 -08:00
## How to Use this Book
2017-08-14 14:54:22 -07:00
2017-11-13 11:14:55 -08:00
- Read the book in its entirety online at https://jakevdp.github.io/PythonDataScienceHandbook/
2017-11-13 11:08:47 -08:00
2017-11-13 11:14:55 -08:00
- Run the code using the Jupyter notebooks available in this repository's [notebooks](notebooks) directory.
2017-11-13 11:11:35 -08:00
2018-08-28 13:13:35 -07:00
- Launch executable versions of these notebooks using [Google Colab](http://colab.research.google.com): [![Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb)
2018-08-28 13:05:27 -07:00
2017-11-13 13:31:26 -08:00
- Launch a live notebook server with these notebooks using [binder](https://beta.mybinder.org/): [![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/jakevdp/PythonDataScienceHandbook/master?filepath=notebooks%2FIndex.ipynb)
2017-11-13 11:08:47 -08:00
2017-11-13 11:14:55 -08:00
- Buy the printed book through [O'Reilly Media](http://shop.oreilly.com/product/0636920034919.do)
2017-11-13 11:11:35 -08:00
## About
2016-08-11 15:49:18 -07:00
2017-11-13 11:08:47 -08:00
The book was written and tested with Python 3.5, though other Python versions (including Python 2.7) should work in nearly all cases.
2016-11-21 11:31:25 -08:00
2017-01-10 07:20:04 -08:00
The book introduces the core libraries essential for working with data in Python: particularly [IPython](http://ipython.org), [NumPy](http://numpy.org), [Pandas](http://pandas.pydata.org), [Matplotlib](http://matplotlib.org), [Scikit-Learn](http://scikit-learn.org), and related packages.
Familiarity with Python as a language is assumed; if you need a quick introduction to the language itself, see the free companion project,
[A Whirlwind Tour of Python](https://github.com/jakevdp/WhirlwindTourOfPython): it's a fast-paced introduction to the Python language aimed at researchers and scientists.
2017-08-14 14:54:22 -07:00
See [Index.ipynb](http://nbviewer.jupyter.org/github/jakevdp/PythonDataScienceHandbook/blob/master/notebooks/Index.ipynb) for an index of the notebooks available to accompany the text.
2016-08-11 15:44:38 -07:00
2017-11-13 11:11:35 -08:00
## Software
2016-11-23 09:37:18 -08:00
The code in the book was tested with Python 3.5, though most (but not all) will also work correctly with Python 2.7 and other older Python versions.
2017-01-27 12:11:38 -08:00
The packages I used to run the code in the book are listed in [requirements.txt](requirements.txt) (Note that some of these exact version numbers may not be available on your platform: you may have to tweak them for your own use).
To install the requirements using [conda](http://conda.pydata.org), run the following at the command-line:
2016-11-23 09:37:18 -08:00
```
$ conda install --file requirements.txt
```
To create a stand-alone environment named ``PDSH`` with Python 3.5 and all the required package versions, run the following:
```
$ conda create -n PDSH python=3.5 --file requirements.txt
```
You can read more about using conda environments in the [Managing Environments](http://conda.pydata.org/docs/using/envs.html) section of the conda documentation.
2016-08-11 15:44:38 -07:00
## License
2016-11-23 09:42:16 -08:00
### Code
The code in this repository, including all code samples in the notebooks listed above, is released under the [MIT license](LICENSE-CODE). Read more at the [Open Source Initiative](https://opensource.org/licenses/MIT).
### Text
2016-11-17 22:12:38 -08:00
The text content of the book is released under the [CC-BY-NC-ND license](LICENSE-TEXT). Read more at [Creative Commons](https://creativecommons.org/licenses/by-nc-nd/3.0/us/legalcode).