Copyright (c) 2026 MindMesh Academy. All rights reserved. This content is proprietary and may not be reproduced or distributed without permission.

2.1.2. The Bedrock Model Catalog: Claude, Titan, Llama, and Beyond

💡 First Principle: The Bedrock model catalog is organized by provider and capability tier — understanding which providers offer which capabilities, and which models within each provider suit which task types, is prerequisite to any selection decision.

Primary model families on Amazon Bedrock (AIP-C01 relevant):
ProviderModel FamilyStrengthsContext WindowBest For
AnthropicClaude 3 HaikuFast, low cost200K tokensHigh-volume simple tasks, classification
AnthropicClaude 3 SonnetBalanced200K tokensGeneral-purpose reasoning, code
AnthropicClaude 3 OpusHighest capability200K tokensComplex reasoning, long docs
AmazonTitan Text LiteCost-optimized4K tokensSimple tasks, tight budgets
AmazonTitan Text PremierBalanced32K tokensGeneral enterprise use
AmazonTitan Embeddings v21024-dim embeddingsRAG, semantic search
AmazonTitan MultimodalImage + textProduct catalog, image Q&A
MetaLlama 3 (various)Open weights, customizable8K–128KWhen open-source licensing needed
Mistral AIMistral/MixtralEfficient, multilingual32K tokensEuropean data residency requirements
Stability AIStable Diffusion XLImage generationCreative content, product images

Cross-Region Inference: When a model is not available in your required AWS region (common for new model releases), Bedrock's cross-region inference automatically routes your request to the nearest region where the model is available. This is transparent to your application — you use the same API call with a cross-region inference profile ARN.

# Cross-region inference profile — handles routing automatically
response = bedrock_runtime.invoke_model(
    modelId='arn:aws:bedrock:us-east-1::foundation-model/anthropic.claude-3-5-sonnet-20241022-v2:0',
    # If us-east-1 lacks capacity, Bedrock routes to us-west-2 or eu-west-1
    body=json.dumps({'messages': [...], 'max_tokens': 1000})
)

⚠️ Exam Trap: Cross-region inference means your data may leave your primary AWS region to be processed in another region. For workloads with strict data residency requirements (GDPR, HIPAA data that must stay in eu-west-1), cross-region inference must be disabled or configured with region constraints. The exam specifically tests this data residency conflict.

Reflection Question: A European healthcare company processes patient data and wants to use a foundation model available only in us-east-1. They have strict GDPR data residency requirements preventing patient data from leaving the EU. What is the correct architectural approach?

Alvin Varughese
Written byAlvin Varughese
Founder15 professional certifications