Questions 7 to 9

Explanations for questions 7 to 9

We'll cover the following

Question 7

A company has an eCommerce application that runs from multiple AWS Regions. Each region has a separate database running on Amazon EC2 instances. The company plans to consolidate the data to a columnar database and run analytics queries. Which approach should the company take?

  1. Run an AWS Batch job to copy and process the data into a columnar Amazon RDS database. Use Amazon Athena to analyze the data.
  2. Use the COPY command to load data into an Amazon RedShift data warehouse, and run the analytics queries there.
  3. Launch Amazon Kinesis Data Streams producers to load data into a Kinesis Datastream. Use Kinesis Data Analytics to analyze the data.
  4. Create an AWS Lambda function that copies the data onto Amazon S3. Use Amazon S3 Select to query the data.

Correct Answer: 2

Explanation: Amazon Redshift is an enterprise-level, petabyte-scale, fully managed data warehousing service. It uses columnar storage to improve the performance of complex queries.

You can use the COPY command to load data in parallel from one or more remote hosts, such as Amazon EC2 instances or other computers. COPY connects to the remote hosts using SSH and executes commands on the remote hosts to generate text output.

INCORRECT: “Run an AWS Batch job to copy and process the data into a columnar Amazon RDS database. Use Amazon Athena to analyze the data.” is incorrect. AWS Batch is used for running batch computing jobs across a fleet of EC2 instances. You cannot create a “columnar Amazon RDS database” as RDS is optimized for transactional workloads. Athena is used to analyze data on S3.

CORRECT: “Use the COPY command to load data into an Amazon RedShift data warehouse, and run the analytics queries there.” is the correct answer.

INCORRECT: “Launch Amazon Kinesis Data Streams producers to load data into a Kinesis Data stream. Use Kinesis Data Analytics to analyze the data.” is incorrect. Kinesis is a real-time streaming data service. It is not a columnar database, so it is unsuitable for this use case.

INCORRECT: “Create an AWS Lambda function that copies the data onto Amazon S3. Use Amazon S3 Select to query the data.” is incorrect. S3 is not a columnar database, and S3 select does not run analytics queries; it simply selects data from an object to retrieve.

References: https://docs.aws.amazon.com/redshift/latest/dg/loading-data-from-remote-hosts.html

Level up your interview prep. Join Educative to access 70+ hands-on prep courses.