For any data scientist Python is a must, but Python alone will not go very far on its own. Pandas is the data analytics library that allows Python to deliver the functionality which comes out of the box in R.
Setting up Python & Pandas is now made very easy with Anaconda, and the running of Python can be made very intuitive with Jupyter Notebook.
- Download Python & Install. – python 2.7 was used here
- Download Anaconda & Install
- Open Command Prompt after installation
- set PATH=%PATH%;c:\Python27;
- conda –version
- conda install pandas
- conda install ipython
- conda install pip
- jupyter notebook
For more details, please see the full tutorial to install Pandas here.
A simple, intuitive, and powerful introduction to Pandas can be found here.
The graphics matplotlib library is discussed here.
Data files useful to run analysis on: