Python Libraries
Throughout this book, we'll be using various Python libraries, including pandas, Matplotlib, Seaborn, and scikit-learn.
pandas
pandas is an open source package that has many functions for loading and processing data in order to prepare it for machine learning tasks. It also has tools that can be used to analyze and manipulate data. Data can be read from many formats using pandas. We will mainly be using CSV data throughout this book. To read CSV data, you can use the read_csv()
function by passing filename.csv
as an argument. An example of this is shown here:
>>> import pandas as pd >>> pd.read_csv("data.csv")
In the preceding code, pd
is an alias name given to pandas. It is not mandatory to give an alias. To visualize a pandas DataFrame, you can use the head()
function to list the top five rows. This will be demonstrated in one of the following exercises.
Note
Please visit the following link to learn more about pandas...