Azure Data Lake
Azure Data Lake is a big data storage and analytics service that can store an unlimited amount of structured, semi-structured, or unstructured data. It is based on the Hadoop Yes Another Resource Negotiator (YARN) cluster management platform, which can scale dynamically across Azure SQL Server instances or instances of Azure SQL Data Warehouse.
Note
For more information about Hadoop YARN, you can refer to the Hadoop website at https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html.
Hadoop YARN offers three types of solutions:
- Azure Data Lake Store
- Azure Data Lake Analytics
- Azure HDInsight
Azure Data Lake Store
Azure Data Lake Store is a storage repository for big data workloads, where you can store raw data. A data lake is a container where you can store all kinds of data, such as structured, semi-structured, and unstructured data. Data is still unprocessed when it is added to the data lake. This is different from a data warehouse, where you store structured...