Built-In Algorithms and Custom Training

Explore how to choose the right SageMaker approach for machine learning tasks from built-in algorithms to custom Script Mode training. Understand the trade-offs between interpretability and performance, and learn how to integrate pretrained and external models to optimize ML workflows on AWS.

We'll cover the following...

SageMaker built-in algorithms
- Key algorithms mapped to problem types
Interpretability vs. performance
Fine-tuning pretrained models
- SageMaker JumpStart vs. Amazon Bedrock
Integrating external models and Script Mode
- Bring Your Own Model (BYOM)
- SageMaker Script Mode
  - Key components of a Script Mode setup
Conclusion

Selecting the right algorithm or model architecture for a given ML problem is one of the most consequential decisions in the machine learning life cycle, and it is heavily tested on the AWS Certified Machine Learning Engineer – Associate exam. Within Amazon SageMaker, this decision follows a structured workflow. SageMaker provides multiple pathways, ranging from fully managed built-in algorithms for standard tasks to pretrained model hubs like SageMaker JumpStart and Amazon Bedrock for transfer learning, to Script Mode for writing fully custom training logic in TensorFlow or PyTorch. The decision-making hierarchy follows a managed-first escalation principle: start with the simplest managed option that fits the problem, and escalate to custom training only when managed options fall short.

Attention: A common exam pitfall is selecting custom model development (Script Mode or BYOM) when a built-in algorithm or JumpStart model would solve the problem with far less operational overhead. Always evaluate managed options first.

This lesson walks through each pathway in order of increasing complexity, covers the interpretability vs. performance trade-off that governs algorithm selection, and concludes with how to integrate externally trained models into SageMaker for deployment. Once you make these decisions, the next lesson, Training Jobs and Data Access Patterns, covers how training data flows into these jobs and how compute is optimized.

SageMaker built-in algorithms

SageMaker built-in algorithms are prepackaged, optimized ML implementations that run inside managed containers. They require no custom training code. A data scientist specifies the algorithm, points to training data in Amazon S3, configures hyperparameters, and launches a training job. SageMaker handles container orchestration, distributed training, and hardware optimization automatically.

Key algorithms mapped to problem types

Each built-in algorithm targets a specific category of ML problem, and the exam expects candidates to match algorithms to business scenarios quickly.

Linear learner: Supports both regression and binary/multiclass classification on tabular data, producing models with high interpretability because of their linear decision boundaries.
XGBoost: A ...

1.Introduction and Exam Strategy

2.AWS Core Services for MLA-C01

Cloud Lab

Cloud Lab

Cloud Lab

3.Machine Learning Foundations for AWS Engineer

4.SageMaker and Secure ML Environments

5.Data Ingestion and Storage Architectures

Cloud Lab

Cloud Lab

6.Data Transformation and Feature Engineering

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

7.Data Quality, Labelling, and Governance

Cloud Lab

Cloud Lab

8.Managed AI and Generative AI Solutions

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

9.Model Development, Optimisation, and Management

Cloud Lab

10.Deployment, Inference, and Orchestration

Cloud Lab

Cloud Lab

Cloud Lab

Cloud Lab

11.Monitoring and Cost Optimisation

12.Conclusion

Assessment

13.Practice Exam Solution - AWS Certified Machine Learning Engineer

14.Free AWS Certified Machine Learning Engineer Associate Practice

Built-In Algorithms and Custom Training

SageMaker built-in algorithms

Key algorithms mapped to problem types