Introducing YARN
The Yet Another Resource Negotiator (YARN) separates the resource management, scheduling, and processing components. It helps to achieve 100% resource utilization of the cluster resources. YARN manages the CPU and memory of the cluster based on the Hadoop scheduler policy. YARN supports any type of application and is not restricted to just MapReduce. It supports applications written in any type of language, provided binaries can be installed on the Hadoop cluster.
YARN architecture
Let's understand the YARN architecture in detail in the following sections.
Resource manager
The resource manager is responsible for tracking the resources in a cluster and scheduling applications. The resource manager has two main components: the scheduler and the applications manager.
Node manager
The node manager is responsible for launching and managing containers on a node. Containers execute tasks as specified by the application master. It acts as a slave for the resource manager. Each node manager...