Introduction

pandas is the Python Data Analysis package. It allows for data ingestion, transformation, and cleaning, and creates objects that can then be passed on to analytic packages like statsmodels and scikit-learn for modeling and packages like matplotlib, seaborn, and plotly for visualization.

pandas is built on top of numpy, so many numpy functions are commonly used in manipulating pandas objects.


Source: Abhijit Dasgupta, https://www.araastat.com/BIOF085/pandas.html#starting-pandas
Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 License.