Domain 3 — Safety, Security & Governance

3.1 — Input & Output Safety Controls (Guardrails)

Guardrails Component Map — Know What Each Part Does

Component	What It Catches	Key Exam Trigger
Content filters	Harmful content: violence, hate speech, sexual content, insults — by category and severity level	"Block toxic or offensive AI responses"
Denied topics	Specific off-topic subjects you define (e.g., competitors, cryptocurrencies, politics)	"Prevent discussion of [specific topic]"
Word filters	Specific words or phrases you blocklist	"Block profanity" or custom forbidden terms
Sensitive info / PII	Credit cards, SSNs, phone numbers, emails — detect and/or redact in I/O	"Prevent PII exposure in AI responses"
Contextual grounding	Hallucinations — responses not supported by retrieved context	"Ensure RAG responses are grounded in retrieved docs"
Prompt attack filter	Prompt injection, jailbreaks, instruction override attempts	"Prevent users from overriding the system prompt" or "SQL injection via prompt"

The most common trap: using "denied topics" to block injection attacks. Denied topics = content restrictions (what the model talks about). Prompt attack filter = security (preventing manipulation of the model). These are completely different things.

Guardrails Monitoring — The Right Metric Dimension

To monitor which guardrail policies are triggering, use the InvocationsIntervened metric filtered by the GuardrailPolicyType dimension.

GuardrailPolicyType dimension values: ContentPolicy · TopicPolicy · SensitiveInformationPolicy · WordPolicy · GroundingPolicy

GuardrailContentSource = filters by whether the intervention was on INPUT or OUTPUT — a different dimension. The exam will swap these two dimensions on you.

Always enable tracing first: set {"trace": "enabled"} in guardrailConfig on your API call. Without this, you get no per-policy intervention details.

Enforcing Guardrails — IAM Condition Key (Not Lambda Proxy)

To enforce that ALL calls to Bedrock use a specific guardrail, add this IAM condition to the policies of every role that accesses Bedrock:

bedrock:GuardrailIdentifier condition key on InvokeModel and Converse API actions. This makes it impossible for any caller to skip the guardrail.

Lambda proxy approach (all Bedrock calls go through a validation Lambda) adds latency, creates a single point of failure, and still requires additional enforcement to prevent direct Bedrock access. IAM conditions enforce at the API level — no bypass possible.

Scenario: "ensure ALL FM interactions are secured with guardrails, no exceptions" → IAM condition key bedrock:GuardrailIdentifier applied to all roles accessing Bedrock.

Guardrails for Non-Bedrock Models (SageMaker)

Even if your FM is deployed on SageMaker (not Bedrock), you can still apply Bedrock Guardrails to its I/O by calling the Guardrails API separately and applying the filter to the model's input/output.

Pattern: SageMaker inference → Lambda post-processor → EventBridge event → Lambda applies Bedrock Guardrails to the output.

SageMaker Clarify = bias detection and explainability for ML training. It is not a runtime content safety tool. Using SageMaker Clarify + Model Monitor to enforce usage policies is the wrong architecture.

Bedrock Guardrails = runtime content safety for any FM (Bedrock or SageMaker). SageMaker Clarify = training-time fairness analysis.

WAF vs Guardrails — Two Different Attack Surfaces

Bedrock Guardrails Handles

Harmful AI content (toxic, violent, offensive)

Prompt injection / jailbreaks

PII in AI I/O

Off-topic AI responses

RAG hallucinations (contextual grounding)

✓ AI-layer safety and control

AWS WAF Handles

HTTP-layer attacks (SQLi, XSS, path traversal)

DDoS protection at CloudFront/ALB layer

Bot protection, rate limiting

IP reputation blocking

✗ Does NOT understand AI content

✗ Does NOT prevent prompt injection

WAF protects your web application's HTTP layer. Guardrails protect your AI model's I/O. A prompt injection attack passes through WAF as valid HTTP — WAF has no concept of "this text is trying to jailbreak an LLM." Only Guardrails understands that.

Contextual Grounding for RAG Quality

Contextual grounding checks compare the model's response against the retrieved context. If the model says something that isn't supported by the retrieved documents, the grounding check intervenes.

Scenario: "RAG application, need to ensure responses are based only on retrieved documents, prevent hallucination" → enable contextual grounding checks in Bedrock Guardrails. Not SageMaker Clarify, not custom Lambda validation.

Two thresholds: groundedness (response supported by context) and relevance (response addresses the query). Both configurable.

3.2 — Data Security & Privacy Controls

IAM Identity Center — The Enterprise Identity Answer

AWS IAM Identity Center is the right answer whenever the scenario involves enterprise employees needing access to AWS services, especially when a corporate directory (Active Directory, Microsoft Entra ID, Okta) is already in use.

Pattern: Corporate IdP (Entra ID / AD) → IAM Identity Center → Permission Sets (define what access each role gets) → assigned to AWS accounts. Permission sets can include Bedrock model-specific access conditions.

Creating IAM users that mirror Active Directory usernames is an anti-pattern. You end up managing two identity systems, credentials can drift, and there's no single source of truth. IAM Identity Center federates to AD — users sign in with their AD credentials.

Scenario mentions: "corporate Active Directory," "Microsoft Entra ID," "employees need access," "consistent permissions across accounts" → IAM Identity Center + permission sets. Every time.

OIDC + Cognito for App Authentication (Not IAM Users)

For third-party apps or web/mobile apps that need to access AWS resources (including Bedrock), use: existing IdP → Amazon Cognito via OIDC → Cognito exchanges IdP tokens for temporary AWS credentials via STS.

Temporary credentials from STS expire automatically — no long-lived secrets to rotate, no Secrets Manager needed for this purpose. That's the security advantage.

Creating IAM users + Secrets Manager rotation is operationally heavy and still uses long-lived credentials. OIDC + Cognito = federated, temporary, auditable. IAM users = static credentials, rotation risk.

Cognito user pools = user directory for authentication. Cognito identity pools = exchange third-party tokens for AWS credentials. The latter is what you need for Bedrock access.

VPC Endpoint Type for Amazon Bedrock

Amazon Bedrock uses a VPC Interface endpoint (PrivateLink). Traffic stays on the AWS private network — never traverses the public internet.

VPC Gateway endpoints only work with Amazon S3 and Amazon DynamoDB. Everything else — Bedrock, SageMaker, SSM, Secrets Manager, Kinesis — uses Interface endpoints.

Scenario: "application in private subnet must call Bedrock without internet access" → VPC Interface endpoint for the Bedrock Runtime service + update subnet route tables.

Interface endpoint creates an Elastic Network Interface (ENI) in your subnet. Your app resolves the Bedrock endpoint to this private IP, keeping all traffic inside AWS.

PII Detection Toolkit — Right Tool for Right Job

Tool	Use For	How It Works
Amazon Comprehend	Detect + redact PII in text (emails, transcripts, documents)	NLP API — real-time or batch. Returns entity types with offsets. Can redact in place.
Amazon Macie	Discover sensitive data (PII) across S3 buckets at scale	Managed service — scans S3 continuously. Custom classifiers for specific data types.
Amazon Textract	Extract text from scanned documents, images, PDFs	OCR + form extraction. Extracts the text — does not analyze it for PII.
Bedrock Guardrails PII	Block/redact PII in LLM I/O at inference time	Intercepts prompts and responses, redacts configured entity types

The exam uses Textract + Macie together as a distractor for email PII scenarios. Textract is OCR (for scanned images). Macie is for S3 bucket discovery, not active redaction. For text PII → Comprehend.

Real-time stream PII = Comprehend real-time API. Batch historical data = Comprehend batch. S3 bucket-level discovery = Macie with custom classifiers. Runtime AI I/O = Guardrails.

3.3 — AI Governance & Compliance

Bedrock Prompt Management — Governance Features

Prompt Management provides: versioning (track every change to a template), review workflows (approve before activating), parameterized templates (reusable with variables), and role definitions (system/user/assistant roles embedded in template).

Auto-activating new versions as they save bypasses the review workflow entirely — that's the anti-pattern. The exam distinguishes between "versioning with review" vs "versioning with auto-activation." Review = governance. Auto-activate = no governance.

Scenario: "governance requirement, all prompt changes must be approved before deployment" → Prompt Management with review workflow + versioning. Not auto-activation.

Compliance Stack — What Belongs Where

The correct governance + compliance stack:

Bedrock Prompt Management → manage parameterized templates, versioning, review workflows
Bedrock Guardrails → enforce content safety policies (not RBAC, not compliance logging)
AWS CloudTrail → audit log of API calls (who called what Bedrock API, with which template, when) — this is your compliance record
IAM + permission sets → role-based access control (who can invoke which models)

CloudWatch Logs = operational monitoring (errors, latency). CloudTrail = compliance audit trail (API activity). For "compliance" and "audit trail" questions → CloudTrail. Never CloudWatch.

Avoid: "use Guardrails for RBAC." Guardrails filter content — they don't control who can invoke which model. RBAC = IAM.

Prompt Lineage — Tracking Template History

Prompt lineage = tracking which version of a template was used for which invocation, and what changed between versions.

Implementation: Bedrock Prompt Management stores template versions → AWS CloudTrail records the specific template version ARN used in each Bedrock API call → you can reconstruct exactly what prompt was sent and when.

CloudWatch Logs stores the text of invocations but not the template version management. S3 object tags are a manual approach with no native versioning integration. Prompt Management + CloudTrail is the AWS-native lineage solution.

Output Source Traceability

When you want to know "which document from the knowledge base influenced this response?", the pattern is: tag FM outputs with metadata from the data source at generation time — include source document ID, title, chunk reference in the response metadata.

Model invocation logs tell you what went in and came out of the LLM, but not which retrieved KB chunk was the source of truth. Tagging the output explicitly at generation time is how you get end-to-end traceability.

IAM Identity Center — Permission Sets vs IAM Roles Directly

Permission sets in IAM Identity Center define what access a role has across multiple AWS accounts. They're defined once and assigned to accounts — no need to replicate roles manually across accounts.

For Bedrock model access control: permission sets can include condition keys like bedrock:ModelArn to restrict which models a department can invoke.

IAM Identity Center + permission sets = single definition, multi-account consistency. IAM roles created per-account = drift, inconsistency, management overhead. The exam always prefers Identity Center for enterprise multi-account scenarios.

3.4 — Responsible AI Principles

Bias Detection in Text Generation — BOLD Dataset

For detecting demographic bias in text generation models (loan decisions, customer communications), the correct approach is Amazon Bedrock model evaluation jobs using the BOLD dataset (Bias in Open-Ended Language Generation).

BOLD = designed specifically for demographic bias assessment in language generation. It tests whether the model generates biased text about different demographic groups.

SageMaker Clarify uses the RealToxicityPrompts dataset which focuses on toxicity, not demographic bias. It also lacks a secondary model validation methodology. Wrong tool for demographic bias in text gen.

Scenario: "text generation model for loan processing, must evaluate for demographic bias" → Bedrock model evaluation + BOLD dataset + secondary evaluator model. Not SageMaker Clarify.

Responsible AI Tools Summary

Need	Right Tool	Wrong Tool
Demographic bias in text generation	Bedrock eval + BOLD dataset	SageMaker Clarify
Toxicity in model outputs	Bedrock Guardrails content filters	SageMaker Clarify
ML model fairness (classification, regression)	SageMaker Clarify	Bedrock eval (wrong use)
Explainability for ML predictions	SageMaker Clarify	Bedrock Guardrails (wrong use)
Grounding / hallucination in RAG	Bedrock Guardrails contextual grounding	SageMaker Clarify

SageMaker Clarify = ML model fairness + explainability (training time, feature importance). Bedrock = LLM evaluation, content safety, bias in text generation. They operate at different layers.

🔴 Domain 3 — AI Safety, Security & Governance

Guardrails Component Map — Know What Each Part Does

Guardrails Monitoring — The Right Metric Dimension

Enforcing Guardrails — IAM Condition Key (Not Lambda Proxy)

Guardrails for Non-Bedrock Models (SageMaker)

WAF vs Guardrails — Two Different Attack Surfaces

Bedrock Guardrails Handles

AWS WAF Handles

Contextual Grounding for RAG Quality

IAM Identity Center — The Enterprise Identity Answer

OIDC + Cognito for App Authentication (Not IAM Users)

VPC Endpoint Type for Amazon Bedrock

PII Detection Toolkit — Right Tool for Right Job

Bedrock Prompt Management — Governance Features

Compliance Stack — What Belongs Where

Prompt Lineage — Tracking Template History

Output Source Traceability

IAM Identity Center — Permission Sets vs IAM Roles Directly

Bias Detection in Text Generation — BOLD Dataset

Responsible AI Tools Summary