Summary and Quiz

Get a refresher on the Analytics section, and take a short quiz to test your knowledge.

We'll cover the following

In this lesson, we’ll summarize what we have learned about the analytics services provided by AWS. Also, we’ll test our knowledge through a quiz.

Summary

Here is a summary of key takeaways from the Analytics section:

  • AWS EMR: Amazon Elastic Map Reduce is a fully managed service that helps us process and analyze large amounts of data. It simplifies running big data frameworks like Hadoop and Spark on AWS for data processing and analysis. Each EMR cluster has a primary, task, and core node.

  • Amazon Redshift: It is a fully managed data warehousing service. It supports a multi-query engine with a massively parallel architecture, which allows us to perform complex queries on large amounts of data in less time. Each Redshift cluster consists of a leader node responsible for managing the queries and results and multiple compute nodes to perform parallel queries on the data.

  • AWS Glue: AWS Glue is a serverless data integration service facilitating easy discovery, preparation, movement, and integration of data from multiple sources, aiding analytics, machine learning, and application development. AWS Glue automates data discovery through a data crawler, organizes data on a data catalog, and creates ETL jobs to extract, transform, and load data.

  • AWS Lake Formation: AWS Lake Formation helps organize the data for easier analysis. It gathers data from various sources like databases and file storage into the data lake. It is built on AWS Glue and utilizes features like data catalog, crawlers, and jobs.

Get hands-on with 1200+ tech skills courses.