Week 14 – Foundations of Agentic AI

🤖

What is an AI Agent? Definitions & The Big Shift

M13T1L1 · From chatbots to autonomous goal-pursuing systems

📖 Story: The Difference Between a Tool and an Agent

A hammer is a tool — it does exactly what you do with it, nothing more. A calculator is a tool too. But imagine an intern you give a task to: "Research our competitors and write a report." They break it down: search Google, take notes, organize findings, write draft, review it, fix errors, deliver the final product. They make decisions, use tools, handle unexpected obstacles, and persist toward a goal across many steps. That's an agent. The key insight: agents pursue goals autonomously over extended horizons, using tools and environment feedback to adapt.

What vs. Traditional NLP Models

▼

Dimension	Traditional LLM (chatbot)	AI Agent
Task duration	Single turn: query → response	Multi-step: goal → plan → execute → reflect → deliver
Tool use	None (just text generation)	Actively calls APIs, runs code, searches web, reads files
Memory	Only current context window	Short-term context + long-term external memory
Decision making	One-shot: generate best response	Iterative: act → observe → reason → act again
Error handling	None — output is final	Detects failures, self-corrects via reflection
Goal persistence	Answers one question at a time	Maintains goal state across many steps/sessions

🔑 Core Definition An AI Agent is a system that: (1) perceives its environment, (2) reasons about its state and goal, (3) plans and takes actions, (4) observes the results, and (5) adapts until the goal is achieved. The LLM is the "brain" — the cognitive engine — but the agent is the whole system including memory, tools, and feedback loop.

The Agent Loop: Perception → Brain → Action → Observation CORE

▼

👁️ PERCEPTION

User goal + environment state + memory retrieval + tool outputs

↓

🧠 BRAIN (LLM)

Reason, plan, decide: "What should I do next?"

↓

📋 PLANNING

Decompose goal → sub-tasks → select next action

↓

⚡ ACTION

Execute: call tool / write code / call API / respond

↓

🌍 ENVIRONMENT (Tool outputs, APIs, files)

Result: success / error / new information

↓

🔁 Observation → back to Perception → loop until goal achieved or max steps

✅ Why the Loop Matters The loop is what gives agents their power. A single-turn LLM gets one shot. An agent can try, fail, observe the error, reason about why it failed, adjust its approach, try again — exactly like a human solving a complex problem. This enables handling of tasks with unpredictable intermediate states.

🧩

Agent Architecture: The Five Core Components

M13T1L1 · Building blocks of every AI agent

1. Memory Systems DEEP DIVE

▼

Why Agents Need Multiple Memory Types

The context window of an LLM is finite (even 128K tokens is limited for long tasks). Agents need different types of memory to operate over extended horizons. Cognitive science inspired the design: human memory has episodic, semantic, procedural, and working memory — agent memory systems mirror this.

📝 In-Context (Working Memory)

What's currently in the LLM's context window: the prompt, conversation history, recent tool outputs, and partial results.

Analogy: RAM — fast, immediately accessible, but limited and lost when the session ends.

📖 Episodic Memory

Records of past agent runs, interactions, and experiences. Retrieved when relevant to a new task. Enables the agent to learn from past successes and failures.

Implementation: vector database of past task logs. Retrieve via semantic similarity.

🧠 Semantic Memory

General world knowledge, domain facts, product documentation — not tied to a specific episode. Queried via RAG to provide factual grounding.

Implementation: vector DB indexed with domain documents (same as RAG knowledge base).

⚙️ Procedural Memory

Learned skills, workflows, and action sequences that have worked before. Encoded in the agent's system prompt, fine-tuned weights, or tool definitions.

Implementation: system prompt with successful workflow patterns, or fine-tuned model weights.

2. Tool Use & Action Space

▼

What Tools Enable

Without tools, an LLM can only manipulate text — it's trapped in language. Tools extend the agent into the real world: reading files, writing code, calling databases, fetching URLs, executing calculations. The agent selects tools via function calling — the LLM outputs a structured JSON specifying the tool and its arguments.

Common Tool Categories:

🔍 Web Search 🐍 Code Executor 📂 File Reader/Writer 🗄️ Database Query (SQL) 🌐 HTTP/API calls 📊 Data Visualization 🧮 Calculator 📧 Email/Calendar 🔢 Vector DB Retrieval 🤖 Sub-agent spawning

# How Function/Tool Calling works (OpenAI-style): 1. Agent receives task: "What's the current stock price of NVDA?" 2. LLM outputs a tool call: { "tool": "web_search", "arguments": {"query": "NVDA stock price today"} } 3. Framework executes tool → result returned: {"result": "NVDA: $875.40 (+2.3%) as of 10:30 AM EST"} 4. Tool result added to context, LLM generates final answer: "NVIDIA (NVDA) is currently trading at $875.40, up 2.3% today."

3. Planning & Task Decomposition

▼

How Agents Decompose Complex Goals

A capable agent doesn't tackle a complex goal in one step — it breaks it down into a plan: a sequence of sub-tasks, each with defined inputs, outputs, and tool requirements. This mirrors how humans solve complex problems by breaking them into manageable steps.

# Example: Goal decomposition for a research task GOAL: "Write a research brief on the current state of AI regulation in the EU" PLAN (agent-generated): Step 1: search_web("EU AI Act latest updates 2025") → article summaries Step 2: search_web("EU AI regulation enforcement 2025") → enforcement news Step 3: retrieve_docs("EU AI Act text", vector_db) → policy details Step 4: synthesize_findings(step1, step2, step3) → structured notes Step 5: write_draft(notes, format="executive brief") → draft text Step 6: review_and_edit(draft, criteria="accuracy, clarity") → final # Agent executes each step, handles errors, and adapts if # a search returns insufficient info

📋 Plan-and-Execute Pattern

Generate the full plan upfront, then execute each step. Efficient but brittle — if early steps fail, the plan may be invalid.

🔄 ReAct Pattern (Reactive Planning)

Generate only the next action at each step, using current observations. More adaptive — the plan evolves based on what the agent discovers.

4. Perception & Context Management

▼

What Perception Means for LLM Agents

Perception = what information the agent has access to at each step. This includes: the original goal, conversation history, tool outputs, retrieved memories, and environmental state. Managing this context is crucial — context windows are finite, and irrelevant information degrades reasoning quality.

Context compression: Summarize long histories to free up context window space while preserving key information
Memory gating: Only retrieve memories relevant to the current step — don't flood context
Structured prompting: Use clear sections (Goal / Memory / Tools / Last observation / Instruction) to organize what the LLM "sees"
Multimodal perception: Modern agents can perceive images, audio, PDFs, code — not just text

5. Reflection & Self-Correction

▼

Why Self-Correction Matters

LLMs make errors. Agents that can reflect on their outputs, compare them against success criteria, and generate corrective actions dramatically outperform those that don't. Reflection is a meta-cognitive skill: reasoning about one's own reasoning.

# Reflexion agent pattern (Shinn et al., 2023): Step 1: GENERATE → Agent attempts the task → produces output Step 2: EVALUATE → Evaluator scores output (LLM judge or ground truth) "Your code throws IndexError on line 7. Test case 3 fails." Step 3: REFLECT → Agent writes self-reflection: "I forgot to check array bounds before accessing index. I should add a bounds check before line 7." Step 4: REFINE → Store reflection in episodic memory → use it to guide next attempt → regenerate Step 5: REPEAT → until tests pass or max attempts reached # Result: Reflexion agents solve significantly more # HumanEval coding tasks than non-reflective agents

💡

Reasoning Frameworks for Agents M13T1L2

ReAct, Reflexion, Tree-of-Thought, and MCTS

ReAct: Reasoning + Acting KEY FRAMEWORK

▼

What

ReAct (Yao et al., 2022) is the foundational agent reasoning pattern. It interleaves Thought (LLM reasoning trace) with Action (tool call) with Observation (tool output). This tight coupling between reasoning and acting reduces hallucination because every factual claim is immediately grounded by a tool result.

💭 Thought
Reason about state

→

⚡ Action
Call tool/API

→

👁️ Observation
Tool result

→

💭 Thought
Update reasoning

→ ...

✅ Final Answer

# ReAct trace example for a multi-hop factual question: # "Who founded the company that makes the A100 GPU?" Thought 1: I need to find which company makes the A100 GPU. Action 1: search("A100 GPU manufacturer") Observation 1: "NVIDIA A100 GPU is manufactured by NVIDIA Corporation" Thought 2: Now I need to find who founded NVIDIA Corporation. Action 2: search("NVIDIA Corporation founders") Observation 2: "NVIDIA was co-founded by Jensen Huang, Chris Malachowsky, and Curtis Priem in 1993" Thought 3: I have the complete answer. Answer: NVIDIA, which makes the A100 GPU, was founded by Jensen Huang, Chris Malachowsky, and Curtis Priem.

✅ ReAct vs. Pure CoT CoT reasons entirely in the model's "head" — no external verification. ReAct grounds each reasoning step with a real tool call. For factual tasks, ReAct reduces hallucination significantly because the model can't make up an observation — it comes from an actual tool.

🔬 Interactive: ReAct Agent Trace Simulator

Select a task to see how a ReAct agent breaks it down into thought-action-observation cycles.

Advanced Reasoning Frameworks Comparison BEYOND CLASS

▼

ReAct (Reasoning + Acting)

Yao et al., 2022 · Princeton / Google

Interleaves thought traces with tool actions. Linear chain: T→A→O→T→A→O... Best for: factual Q&A, web research, API tasks where you need grounded, verifiable answers at each step.

Reflexion

Shinn et al., 2023 · Northeastern University

Adds a reflection loop: after each failed attempt, the agent writes a verbal self-critique stored in episodic memory. Next attempt uses this reflection. Best for: coding challenges, multi-step reasoning where initial attempts often fail.

Tree-of-Thoughts (ToT)

Yao et al., 2023 · Princeton / Google

Generates multiple candidate reasoning steps at each point, evaluates them, and explores the best via BFS/DFS/beam search. Best for: creative problem-solving, mathematical proofs, strategic planning — tasks with a combinatorial search space.

MCTS for Agents (AlphaCode-style)

Used in DeepMind's AlphaCode 2, o1/o3 reasoning

Monte Carlo Tree Search: expand a tree of possible action sequences, simulate outcomes, backpropagate scores, repeat. Enables superhuman performance on hard reasoning tasks. The basis of OpenAI's "thinking" models (o1, o3, o4-mini).

🔬 Why "Reasoning Models" (o1/o3/DeepSeek-R1) Are Different Traditional LLMs generate a single response. Reasoning models (using MCTS + RLHF) generate an extended internal "thinking" trace — exploring multiple reasoning paths, backtracking, verifying — before producing an answer. This is why they excel at math olympiad problems, PhD-level science, and complex code. The thinking tokens are a form of test-time compute scaling.

Cognitive Architectures: Classical vs. Neural Agents DEEP DIVE

▼

Why Study Classical Architecture

LLM agents didn't emerge in a vacuum — they were preceded by decades of AI research on cognitive architectures. Understanding the classical foundations helps you reason about why LLM agents work the way they do, and where they still fall short.

🏛️ SOAR (1983)

Symbolic cognitive architecture. Uses production rules, working memory, and "chunking" to learn new rules from problem-solving. Foundation of rule-based AI agents.

🧠 ACT-R (1993)

Hybrid symbolic/neural. Modules for declarative memory, procedural memory, and perceptual/motor. Most empirically grounded cognitive architecture in psychology.

🤖 BDI Agents

Belief-Desire-Intention. Agents have beliefs (world state), desires (goals), and intentions (committed plans). Still used in multi-agent systems and robotics.

💬 LLM Agents (2022+)

The LLM implicitly serves as all cognitive modules. Context window = working memory. Tool calls = perception/motor. System prompt = procedural knowledge. Emergent rather than designed.

🔗 Hybrid Symbolic-Neural

Combines LLM "intuition" with formal planners, theorem provers, or logic engines. Best for safety-critical domains where the neural component must be verifiably constrained.

🌐 Embodied Agents

Agents grounded in physical or virtual environments (robotics, game agents). Perception = visual/sensor input. The agent must ground language in physical reality — hardest open problem in agentic AI.

⚠️

Agent Failure Modes & Safety

Why agents fail — and how to make them safer

Common Agent Failure Modes

▼

Failure Mode	Description	Mitigation
Hallucinated tool calls	LLM invents tool arguments or tool names that don't exist	Strict function schemas, input validation, constrained generation
Infinite loops	Agent gets stuck in a loop (same action repeatedly)	Max step limits, loop detection, action deduplication
Goal drift	Agent pursues sub-goal instead of original goal	Persistent goal reminder in every prompt, goal-checking step
Cascading errors	Wrong result in step 2 propagates and corrupts all later steps	Error detection at each step, checkpointing, rollback
Prompt injection	Malicious content in environment overrides agent instructions	Input sanitization, privilege separation, human-in-the-loop
Over-trust of tools	Agent blindly trusts incorrect tool output	Cross-verification with multiple sources, confidence thresholds
Context overflow	Conversation grows beyond context window, losing key info	Context compression, sliding window, memory summarization

⚠️ The Alignment Challenge in Agentic Systems Alignment problems are amplified in agents: a slightly misspecified goal in a single-turn chatbot causes a bad response. The same misspecification in an agent running 100 steps and calling external APIs can cause significant real-world harm before anyone notices. Human-in-the-loop checkpoints and reversibility are critical engineering requirements for production agents.

🧠 Quiz 11 Prep — Foundations of Agentic AI

1. What is the fundamental property that distinguishes an AI "agent" from a standard LLM chatbot?

✅ Correct! The agent loop is the defining characteristic. Unlike a chatbot that handles one query at a time, an agent perceives its environment, reasons about its state, takes actions (including tool calls), observes results, and loops — persisting toward a goal across many steps.

❌ The distinction is about the architecture and operational paradigm, not model size or training. The key is the perceive-reason-act loop with tool use and goal persistence.

2. In the ReAct framework, why does interleaving Thought-Action-Observation reduce hallucination compared to standard Chain-of-Thought?

✅ Correct! In pure CoT, the model reasons entirely in its own head — it can hallucinate both the reasoning steps and the final answer. In ReAct, the Observation comes from an actual tool call, providing a verified anchor point. The model's subsequent thoughts are conditioned on real information, not confabulated context.

❌ The key insight is about grounding. Observations in ReAct are tool outputs — real, verified data from the world. This prevents the model from "making up" facts during its reasoning chain.

3. An agent working on a 30-step task starts "forgetting" its original goal by step 20 and begins pursuing a tangential sub-goal. This is called:

✅ Correct! Goal drift is a well-documented agent failure mode. Over many steps, the context window fills up with tool outputs and observations, and the original high-level goal gets "washed out." The mitigation is to include an explicit goal reminder at every step of the prompt.

❌ While context overflow and hallucination can contribute, the specific phenomenon of losing the original goal in favor of a sub-goal across many steps is called goal drift.

4. Which type of agent memory is analogous to a human writing notes to themselves after completing a task so they do it better next time?

✅ Correct! Episodic memory records past experiences — what happened, what worked, what failed. The Reflexion framework writes self-critiques (reflections on failed episodes) into episodic memory, which are retrieved when the agent faces a similar task again. This is exactly analogous to a human writing a "lessons learned" document.

❌ Semantic memory is general world knowledge. Procedural memory is about learned skills/workflows. Episodic memory is the record of specific events and experiences — the "diary" memory type.

← Week 13: GenAI & RAG Week 15: Multi-Agent Systems →

Aspect	PPO-RLHF	DPO
Models needed	4 (policy, ref, reward, value)	2 (policy, ref)
Stability	Often unstable	Stable (SFT-like)
Compute cost	Very high	Moderate
Reward hacking	Yes (explicit RM)	Less (implicit)
Used by	GPT-4, Claude	LLaMA-2, Mistral, Zephyr

Criterion	Definition	Formula	Tradeoff
Demographic Parity	Equal positive prediction rates across groups	P(Ŷ=1\|A=0) = P(Ŷ=1\|A=1)	Ignores base rate differences
Equalized Odds	Equal TPR and FPR across groups	P(Ŷ=1\|Y=y,A=a) equal for all a,y	Requires same accuracy per group
Calibration	Predicted probabilities match true frequencies in each group	P(Y=1\|score=s,A=a) = s for all a	Conflicts with equalized odds when base rates differ

Foundations of Agentic AI & Reasoning

What is an AI Agent? Definitions & The Big Shift

What vs. Traditional NLP Models

The Agent Loop: Perception → Brain → Action → Observation CORE

Agent Architecture: The Five Core Components

1. Memory Systems DEEP DIVE

📝 In-Context (Working Memory)

📖 Episodic Memory

🧠 Semantic Memory

⚙️ Procedural Memory

2. Tool Use & Action Space

3. Planning & Task Decomposition

📋 Plan-and-Execute Pattern

🔄 ReAct Pattern (Reactive Planning)

4. Perception & Context Management

5. Reflection & Self-Correction

Reasoning Frameworks for Agents M13T1L2

ReAct: Reasoning + Acting KEY FRAMEWORK

🔬 Interactive: ReAct Agent Trace Simulator

Advanced Reasoning Frameworks Comparison BEYOND CLASS

ReAct (Reasoning + Acting)

Reflexion

Tree-of-Thoughts (ToT)

MCTS for Agents (AlphaCode-style)

Cognitive Architectures: Classical vs. Neural Agents DEEP DIVE

🏛️ SOAR (1983)

🧠 ACT-R (1993)

🤖 BDI Agents

💬 LLM Agents (2022+)

🔗 Hybrid Symbolic-Neural

🌐 Embodied Agents

Agent Failure Modes & Safety

Common Agent Failure Modes

🧠 Quiz 11 Prep — Foundations of Agentic AI

Ethics, Alignment & Responsible AI: The Science Behind Safe LLMs

⚖️ Bias in Embeddings: WEAT & Measurement Research

Classic WEAT Findings

Debiasing Methods

🎯 DPO: Direct Preference Optimization Deep Dive PhD

DPO vs PPO-RLHF

🏛️ EU AI Act: What LLM Developers Must Know 2026 Compliance

Unacceptable Risk (Prohibited)

High Risk (Strict Requirements)

General Purpose AI (GPAI) Models

Watermarking Requirements (2026)

🧮 Fairness Metrics: The Irreconcilable Trio Research