Packt+ | Advance your knowledge in tech

You're reading from Mastering Numerical Computing with NumPy Master scientific computing and perform complex operations with ease

Product type Paperback

Published in Jun 2018

Publisher Packt

ISBN-13 9781788993357

Length 248 pages

Edition 1st Edition

Languages

Python

Tools

NumPy

Concepts

Scientific Computing

Authors (3):

Mert Cakmak

Tiago Antao

Cuhadaroglu

View More author details

Table of Contents (16) Chapters

Title Page

Packt Upsell

Contributors

Preface

1. Working with NumPy Arrays

2. Linear Algebra with NumPy FREE CHAPTER

3. Exploratory Data Analysis of Boston Housing Data with NumPy Statistics

4. Predicting Housing Prices Using Linear Regression

5. Clustering Clients of a Wholesale Distributor Using NumPy

6. NumPy, SciPy, Pandas, and Scikit-Learn

7. Advanced Numpy

8. Overview of High-Performance Numerical Computing Libraries

9. Performance Benchmarks

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

NumPy and pandas

When you think about it, NumPy is a fairly low-level array-manipulation library, and the majority of other Python libraries are written on top of it.

One of these libraries is pandas, which is a high-level data-manipulation library. When you are exploring a dataset, you usually perform operations such as calculating descriptive statistics, grouping by a certain characteristic, and merging. The pandas library has many friendly functions to perform these various useful operations.

Let's use a diabetes dataset in this example. The diabetes dataset in sklearn.datasets is standardized with a zero mean and unit L2 norm.

The dataset contains 442 records with 10 features: age, sex, body mass index, average blood pressure, and six blood serum measurements.

The target represents the disease progression after these baseline measures are taken. You can look at the data description at https://www4.stat.ncsu.edu/~boos/var.select/diabetes.html and a related paper at http://web.stanford.edu...