Chapter 8. Data Processing using Apache Flink
By now, I am sure you have got the approach of each chapter in this part of the book. This chapter follows the same approach. It will introduce the Data Ingestion Layer initially and then it will make a technology mapping, in our case, Apache Flink.
Handling both stream and batch data and appropriately processing it is an important feature required for our Data Lake implementation, and Flink is the choice for us. In this chapter, we will give you just enough details that you need to know about Flink to execute the Data Lake use case in hand. Covering Flink in its full aspects is out of the scope of this book and would take a book in itself. We will initially dive into Flink’s core strengths and weaknesses, followed by its architecture and important components. We will then delve deep into an actual hand on coding session of Flink and the connection with our SCV use case.
Finally, we will explain some of the alternate technologies that you can think...