Chapter 2. Getting Started with Apache Hadoop and Apache Spark
In this chapter, we will understand the basics of Hadoop and Spark, how Spark is different from MapReduce, and get started with the installation of clusters and setting up the tools needed for analytics.
This chapter is divided into the following subtopics:
Introducing Apache Hadoop
Introducing Apache Spark
Discussing why we use Hadoop with Spark
Installing Hadoop and Spark clusters