Packt+ | Advance your knowledge in tech

You're reading from Functional Python Programming Discover the power of functional programming, generator functions, lazy evaluation, the built-in itertools library, and monads

Product type Paperback

Published in Apr 2018

Publisher Packt

ISBN-13 9781788627061

Length 408 pages

Edition 2nd Edition

Languages

Python

Concepts

Functional Programming

Table of Contents (22) Chapters

Title Page

Packt Upsell

Contributors

Preface

1. Understanding Functional Programming FREE CHAPTER

2. Introducing Essential Functional Concepts

3. Functions, Iterators, and Generators

4. Working with Collections

5. Higher-Order Functions

6. Recursions and Reductions

7. Additional Tuple Techniques

8. The Itertools Module

9. More Itertools Techniques

10. The Functools Module

11. Decorator Design Techniques

12. The Multiprocessing and Threading Modules

13. Conditional Expressions and the Operator Module

14. The PyMonad Library

15. A Functional Approach to Web Services

16. Optimizations and Improvements

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Cleaning raw data with generator functions

One of the tasks that arise in exploratory data analysis is cleaning up raw source data. This is often done as a composite operation applying several scalar functions to each piece of input data to create a usable dataset.

Let's look at a simplified set of data. This data is commonly used to show techniques in exploratory data analysis. It's called Anscombe's quartet, and it comes from the article, Graphs in Statistical Analysis, by F. J. Anscombe that appeared in American Statistician in 1973. The following are the first few rows of a downloaded file with this dataset:

Anscombe's quartet 
I  II  III  IV 
x  y  x  y  x  y  x  y 
10.0  8.04  10.0  9.14       10.0  7.46  8.0  6.58 
8.0      6.95  8.0  8.14  8.0  6.77  8.0  5.76 
13.0  7.58  13.0  8.74  13.0  12.74  8.0  7.71

Sadly, we can't trivially process this with the csv module. We have to do a little bit of parsing to extract the useful information from this file. Since the data is properly tab...

The rest of the chapter is locked

You're reading from Functional Python Programming Discover the power of functional programming, generator functions, lazy evaluation, the built-in itertools library, and monads

Table of Contents (22) Chapters

Cleaning raw data with generator functions

Other recommended products

Personalised recommendations for you

You're reading from Functional Python Programming Discover the power of functional programming, generator functions, lazy evaluation, the built-in itertools library, and monads

Table of Contents (22) Chapters

Cleaning raw data with generator functions

Unlock this book and the full library FREE for 7 days

Other recommended products

Personalised recommendations for you