Amazon Bedrock Foundations and Model Selection
Understand Amazon Bedrock's architecture and how it offers a unified API to access multiple foundation models. Learn to select the right model family based on task type, cost, latency, and output quality. Discover how the Converse API enables flexible deployment without rewriting code, and explore practical strategies to evaluate and benchmark generative AI models for real-world AWS applications.
We'll cover the following...
Amazon Bedrock is the AWS-managed gateway to foundation models, and it is one of the most heavily tested services on the AWS Certified Machine Learning Engineer – Associate exam. Candidates who understand how Bedrock provides a unified API for invoking models from different providers, how an application selects the target model using the modelId parameter, how the Converse API abstracts model-specific details, and how to select the right model based on cost, latency, and quality trade-offs will be well positioned to answer a significant portion of the generative AI questions. This lesson walks through the Bedrock architecture, the available model families, the unified API layer, and a practical decision framework that maps directly to exam scenarios.
The lesson covers three areas. First, you will learn which foundation model families are available and what each excels at. Second, you will understand how the Converse API decouples your application from any single provider. Third, you will develop a structured approach to model selection trade-offs.
Attention: A common exam mistake is choosing SageMaker-based custom training or self-hosted inference when the scenario describes a standard generative AI task. Bedrock is typically the simplest and most cost-effective answer for standard generative AI tasks unless the question explicitly requires custom model training, unsupported model types, or full infrastructure control.
Foundation models available in Bedrock
Amazon Bedrock provides access to
The key model families and their strengths break down as follows:
Amazon Nova (text, chat, and reasoning): AWS’s flagship multimodal models. They are tiered by capability (for example, Micro and Lite for fast, low-cost tasks; Pro and Premier for complex reasoning, math, and agentic workflows).
Amazon Nova (media and speech): Domain-specific models for rich media. This includes Nova Canvas (image generation), Nova Reel (video generation), and Nova Sonic (real-time speech recognition and generation).
Amazon Embeddings ...