Implementation and Integration II

Understand and evaluate deployment options for generative AI applications on AWS, focusing on cost efficiency, scaling, and model routing. Learn to implement reliable, scalable architectures using Amazon Bedrock, SageMaker endpoints, serverless inference, and custom container deployments to meet various business and technical requirements for generative AI workloads.

We'll cover the following...

Question 26
Question 27
Question 28
Question 29
Question 30

Question 26

A company is building a customer support chatbot using Amazon Bedrock. Traffic is highly variable, ranging from zero users overnight to sudden spikes of up to 1,000 concurrent users during promotions. The company wants to minimize cost when idle while maintaining compatibility with the Amazon Bedrock API.

Which deployment approach is most appropriate?

A. Amazon Bedrock provisioned throughput

B. AWS Lambda invoking Amazon Bedrock on demand

C. SageMaker AI real-time endpoint with auto scaling

D. SageMaker AI serverless inference endpoint

Question 27

A company has fine-tuned a large language model for legal document analysis. The application is used intermittently by internal analysts throughout the day, with periods of low usage and occasional bursts of activity. The company wants to minimize cost when the system is idle while still supporting near–real-time inference when requests arrive. Occasional cold starts are acceptable, but the solution must automatically scale without manual capacity management.

Which deployment option best ...

1.Introduction

2.AWS Core Services for AIP Exam

Breakout Session

3.Generative AI Fundamentals

4.Introducing Amazon Bedrock

Cloud Lab

5.Data Engineering and Retrieval-Augmented Generation (RAG)

Cloud Lab

Cloud Lab

6.Agentic AI Systems

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

Mock Interview

Cloud Lab

7. Model Deployment with SageMaker AI

Cloud Lab

Cloud Lab

8.AI Safety and Content Moderation

Cloud Lab

Cloud Lab

9.AI Governance and Compliance

10.Operational Efficiency for AI Systems

11.Model Evaluation and Troubleshooting

Cloud Lab

Cloud Lab

12.Conclusion

Assessment

13.Practice Exam Solution: AWS Certified GenAI Developer

14.Free AWS Certified Generative AI Developer Practice Exam

Implementation and Integration II

Question 26

Question 27