4.2. Data Analysis on AWS
š” First Principle: Data analysis is where pipeline investment pays off ā analysts and scientists extract insights from the data you've ingested, transformed, and stored. The choice of analysis tool depends on who the consumer is and how they work. SQL-fluent analysts prefer Athena or Redshift; visual explorers prefer QuickSight dashboards; data scientists prefer notebooks. Unlike building the pipeline (your job), analysis is the consumer's experience ā so picking the right tool means thinking about the end user, not just the data.
What happens when you give a business analyst a Jupyter notebook? Or hand a data scientist an Athena console? They'll waste time fighting the tool instead of analyzing data. Matching the analysis tool to the consumer's skill set is as important as matching the data store to the access pattern. The exam tests this through scenarios that describe the consumer's role and preferences.