Agents, Tools, and Context

Understand the three core components of every Claude interaction: model calls, tool calls, and context. Learn to analyze API requests and responses, interpret tool usage, and manage conversation history to build reliable AI agent loops. This lesson helps you develop a clear mental model essential for architecting Claude-powered systems.

We'll cover the following...

The three things in every Claude interaction
What’s next?

In the last lesson, we reasoned about why code-level checks around tool calls provide a reliable enforcement boundary and when a task requires an agent. Now we’ll make that concrete: we’ll inspect what the API sends and receives, label the key parts, and build the vocabulary you’ll use for the rest of the course. By the end of this lesson, you’ll be able to:

Build a mental model for the three moving parts in every Claude-powered system.
Review annotated examples of a request, a tool-use response, and a tool result.
Read a full conversation transcript labeled from the first message to the final reply.
Complete a short exercise to test whether the labels have landed.

The three things in every Claude interaction

Every interaction with Claude, no matter how complex, is built from three concepts:

Model call: One round trip to the API. We send a request, and Claude returns a response. An agent loop is just many of these in sequence.
Tool call: A request Claude makes in its response, asking us to run a function and report back. Claude does not run the function. We do.
Context: The messages array we send on every model call. It is the only memory Claude has. Whatever is not in that array does not exist as far as Claude is concerned.

import anthropic
client = anthropic.Anthropic()
response = client.messages.create(
    model="claude-opus-4-8",
    max_tokens=1024,
    system="You are a customer support agent. Process refunds within policy only.",
    tools=[
        {
            "name": "process_refund",
            "description": "Process a refund for a customer order.",
            "input_schema": {
                "type": "object",
                "properties": {
                    "order_id": {"type": "string"},
                    "amount":   {"type": "number"},
                    "reason":   {"type": "string"}
                },
                "required": ["order_id", "amount", "reason"]
            }
        }
    ],
    messages=[
        {"role": "user", "content": "I need a refund for order #12345. It arrived damaged."}
    ]
)

A minimal API request with a system prompt, one tool definition, and a single user message

Let’s walk through each part:

Line 1: We import the official Anthropic Python SDK. This is the only dependency needed to make API calls.
Line 3: We create a client instance. By default, it reads the ANTHROPIC_API_KEY environment variable for authentication. We never hardcode the key.
Line 5: We open the messages.create() call, which sends a request to the API and returns a response object.
Line 6: We specify which Claude model to use. The model is set per request, not on the client, so different calls in the same agent loop can target different models if needed.
Line 7: We set a hard cap on how many tokens Claude can generate in this response. This is a budget control, not a quality setting. The response stops at this limit even if Claude is mid-sentence.
Line 8: We provide the system prompt. Claude reads this on every turn of the conversation. It shapes behavior for the whole session, which is why business rules should live here and not in a user message.
Lines 9–22: We define the tools list, the functions Claude is allowed to request. Each tool has a name, a description that Claude reads to decide when to use it, and an input_schema that defines the required arguments as a JSON Schema.
Lines 13–21: The input_schema tells Claude which arguments to populate, what types they must be, and which are required. Claude uses this schema to construct the input object it sends back in a tool_use block.
Lines 24–26: We provide the messages array, the conversation history. Right now, it holds one user message. After every round trip, we append to this array and send the full history back. This array is the entire state of the conversation.

When Claude responds directly

If Claude has enough information to reply without calling a tool, stop_reason is "end_turn", and the response looks like this:

Here’s what each field tells us:

Line 2: "id" is a unique identifier for this response. Use it when logging or correlating requests to responses.
Line 3: "type" is the top-level type of the response object. It is always "message" for text and tool interactions.
Line 4: "role": "assistant" confirms this is Claude’s output. When we append this response to the messages array to build context, we use this role value.
Lines 5–10: "content" is an array of content blocks. Here, it holds a single text block. The same array can hold multiple blocks of different types, which we’ll see in the next section.
Line 7: "type": "text" inside the content block identifies this as a plain text response. The other content block type we care about is "tool_use", which signals a tool call request.
Line 12: "stop_reason": "end_turn" is the most important field for loop control. It means Claude finished its response and is waiting for input. There is no tool to run. The agent loop should surface this response to the user or advance to the next step.
Line 13: "usage" provides the token counts for this round trip. Tracking these across turns tells us how fast the context window is filling up.

When Claude wants to use a tool

Now, the same scenario with more context provided. Claude decides it has enough information to act. The response shape changes:

{
    "id": "msg_01DEF456",
    "type": "message",
    "role": "assistant",
    "content": [
        {
            "type": "text",
            "text": "I'll process that refund for order #12345 right away."
        },
        {
            "type": "tool_use",
            "id": "toolu_01XYZ789",
            "name": "process_refund",
            "input": {
                "order_id": "12345",
                "amount": 49.99,
                "reason": "Item arrived damaged"
            }
        }
    ],
    "model": "claude-opus-4-8",
    "stop_reason": "tool_use",
    "usage": {"input_tokens": 97, "output_tokens": 52}
}

A response where Claude requested a tool call. The content array holds both a text block and a tool_use block. The stop_reason is "tool_use".

Let’s go through the new and changed fields:

Lines 5–20: The content array now holds two blocks: a text block and a tool_use block. Claude narrated its intention and made the tool call in the same turn. Both blocks must be preserved when we add this response to the messages array.
Line 11: "type": "tool_use" identifies this content block as a tool call request. This is Claude’s way of saying “run this function for me and send me the result.”
Line 12: "id": "toolu_01XYZ789" is the unique identifier for this specific tool call. This is separate from the message id on line 2. We must echo this exact value back as tool_use_id when we send the result. Without it, Claude cannot match the result to its request.
Line 13: "name": "process_refund" is the tool Claude wants to run. This must exactly match a tool name in the tools list we sent in the request.
Lines 14–18: "input" contains the arguments Claude chose for this call, populated according to the input_schema we defined. These are the values our code will use when running the actual function.
Line 22: "stop_reason": "tool_use" signals that Claude is pausing and waiting for a tool result. The agent loop must not surface this to the user. Instead, it runs the requested tool and sends the result back.

messages = [
    {
        "role": "user",
        "content": "I need a refund for order #12345. It arrived damaged."
    },
    {
        "role": "assistant",
        "content": response.content
    },
    {
        "role": "user",
        "content": [
            {
                "type": "tool_result",
                "tool_use_id": "toolu_01XYZ789",
                "content": "Refund of $49.99 processed. Confirmation number: REF-7890."
            }
        ]
    }
]
final_response = client.messages.create(
    model="claude-opus-4-8",
    max_tokens=1024,
    system="You are a customer support agent. Process refunds within policy only.",
    tools=[...],
    messages=messages
)

Building the second API call: The original user message, Claude’s previous response, and the tool result are all appended to the messages array before sending

Here’s what each new part does:

Lines 1–20: We construct the full messages array to pass to the next API call. The API is stateless; it has no memory between calls, so we send the complete history every time.
Lines 6–9: We append Claudes previous response as an assistant turn using response.content, which gives us the full content list, including both the text block and the tool_use block. Claude needs to see its own tool call request in context to understand what the result refers to. Stripping the tool_use block would break the conversation.
Lines 10–19: We add the tool result as a user-role message. This surprises people at first. The role field describes who produced the content. Our application produced this result, so it belongs under user. The assistant role is reserved exclusively for Claudes own responses.
Line 14: "type": "tool_result" identifies this content block as a function output. Claude looks for this type to know the requested function has been executed.
Line 15: "tool_use_id": "toolu_01XYZ789" links this result to the specific tool call that requested it. This must match the id field from the tool_use block exactly.
Line 16: "content" holds the output our function returned. This can be a plain string or a list of content blocks. Claude reads this to determine its next action.
Lines 22–28: We make the second API call, passing the complete updated messages array. This call will produce Turn 4, Claudes final response to the user.

The complete annotated transcript

Here is the full four-turn conversation after both API calls complete. Read it top to bottom as if you are the agent loop watching the context grow:

messages = [
    # Turn 1 — User opens the conversation
    # This is the only input to API call 1.
    {
        "role": "user",
        "content": "I need a refund for order #12345. It arrived damaged."
    },
    # Turn 2 — Claude requests a tool call (API call 1 response)
    # stop_reason was "tool_use". We append the full content list unchanged.
    {
        "role": "assistant",
        "content": [
            {"type": "text", "text": "I'll process that refund for order #12345 right away."},
            {"type": "tool_use", "id": "toolu_01XYZ789", "name": "process_refund",
             "input": {"order_id": "12345", "amount": 49.99, "reason": "Item arrived damaged"}}
        ]
    },
    # Turn 3 — We return the tool result
    # Turns 1, 2, and 3 together are the input to API call 2.
    {
        "role": "user",
        "content": [
            {"type": "tool_result", "tool_use_id": "toolu_01XYZ789",
             "content": "Refund of $49.99 processed. Confirmation number: REF-7890."}
        ]
    },
    # Turn 4 — Claude delivers the final response (API call 2 response)
    # stop_reason is "end_turn". The loop surfaces this to the user.
    {
        "role": "assistant",
        "content": [
            {"type": "text",
             "text": "Done! Your refund of $49.99 has been processed. Confirmation: REF-7890."}
        ]
    }
]

The complete four-turn messages array after both API calls. Turns 1–3 are the input to API call 2. Turn 4 is its output.

Lines 3–8: Turn 1 is the users opening message. This is the only input to the first API call.
Lines 10–19: Turn 2 is Claudes response to the first API call. The stop_reason was "tool_use", so the loop does not surface this to the user. Instead, it appends the full content list (both the text block and the tool_use block) unchanged and moves to the next step.
Lines 21–28: Turn 3 is the tool result we send back. Turns 1, 2, and 3 together make up the messages array for the second API call. This is the context Claude sees when generating Turn 4.
Lines 30–38: Turn 4 is Claudes final response. The stop_reason is "end_turn", so the loop surfaces the text to the user and stops iterating.

Complete code

We bring together the request anatomy, both response shapes, and the tool result handshake from this lesson into one runnable file. The script makes two API calls: the first returns a tool_use stop reason with both a text block and a tool_use block in the content array, and the second returns end_turn after the tool result is delivered. The annotated transcript at the end labels each turn exactly as the lesson diagrams show.

Python 3.14.0

import json
import anthropic
anthropic_api_key = "{{ANTHROPIC_API_KEY}}"
client = anthropic.Anthropic(api_key=anthropic_api_key)
TOOLS = [
    {
        "name": "process_refund",
        "description": "Process a refund for a customer order.",
        "input_schema": {
            "type": "object",
            "properties": {
                "order_id": {"type": "string"},
                "amount":   {"type": "number"},
                "reason":   {"type": "string"}
            },
            "required": ["order_id", "amount", "reason"]
        }
    }
]
SYSTEM = (
    "You are a customer support agent. Process refunds within policy only.\n\n"
    "Order database:\n"
    "  #12345 — Widget Set x3, $49.99, shipped 2025-03-10"
)
def process_refund(order_id: str, amount: float, reason: str) -> str:
    confirmation = f"REF-{abs(hash(order_id)) % 10000:04d}"
    return f"Refund of ${amount:.2f} processed. Confirmation number: {confirmation}."
def run():
    print("=== API call 1: user asks for a refund ===\n")
    USER_MSG = "I need a refund for order #12345. It arrived damaged."
    messages = [
        {"role": "user", "content": USER_MSG}
    ]
    response = client.messages.create(
        model="claude-opus-4-8",
        max_tokens=1024,
        system=SYSTEM,
        tools=TOOLS,
        messages=messages
    )
    print(f"stop_reason  : {response.stop_reason}")
    for i, block in enumerate(response.content):
        if block.type == "text":
            print(f"content[{i}]   : text      — {block.text}")
        elif block.type == "tool_use":
            print(f"content[{i}]   : tool_use  — {block.name}({json.dumps(block.input)})")
            print(f"tool_use_id  : {block.id}")
    if response.stop_reason != "tool_use":
        print("\nClaude replied directly (end_turn). No tool cycle needed.")
        return
    tool_block  = next(b for b in response.content if b.type == "tool_use")
    tool_result = process_refund(**tool_block.input)
    print(f"\nTool result  : {tool_result}")
    print("\n=== API call 2: send tool result, receive final reply ===\n")
    messages = [
        {"role": "user",      "content": USER_MSG},
        {"role": "assistant", "content": response.content},
        {"role": "user",      "content": [
            {"type": "tool_result", "tool_use_id": tool_block.id, "content": tool_result}
        ]}
    ]
    final = client.messages.create(
        model="claude-opus-4-8",
        max_tokens=1024,
        system=SYSTEM,
        tools=TOOLS,
        messages=messages
    )
    print(f"stop_reason  : {final.stop_reason}")
    for i, block in enumerate(final.content):
        if block.type == "text":
            print(f"content[{i}]   : text      — {block.text}")
    print("\n=== Annotated 4-turn transcript ===\n")
    print(f"Turn 1 [user]      : {messages[0]['content']}")
    for block in messages[1]["content"]:
        if block.type == "text":
            print(f"Turn 2 [assistant] : text      — {block.text}")
        elif block.type == "tool_use":
            print(f"Turn 2 [assistant] : tool_use  — {block.name}({json.dumps(block.input)})")
    tr = messages[2]["content"][0]
    print(f"Turn 3 [user]      : tool_result({tr['tool_use_id'][:24]}...) = {tr['content']}")
    for block in final.content:
        if block.type == "text":
            print(f"Turn 4 [assistant] : text      — {block.text}")
if __name__ == "__main__":
    run()

=== API call 1: user asks for a refund ===
stop_reason  : tool_use
content[0]   : text      — I'll process that refund for order #12345 right away.
content[1]   : tool_use  — process_refund({"order_id": "12345", "amount": 49.99, "reason": "Item arrived damaged"})
tool_use_id  : toolu_01AbCdEfGhIjKlMnOpQ...
Tool result  : Refund of $49.99 processed. Confirmation number: REF-5840.
=== API call 2: send tool result, receive final reply ===
stop_reason  : end_turn
content[0]   : text      — Your refund of $49.99 for order #12345 has been processed. Confirmation number: REF-5840.
=== Annotated 4-turn transcript ===
Turn 1 [user]      : I need a refund for order #12345. It arrived damaged.
Turn 2 [assistant] : text      — I'll process the refund for order #12345 right away.
Turn 2 [assistant] : tool_use  — process_refund({"order_id": "12345", "amount": 49.99, "reason": "Item arrived damaged"})
Turn 3 [user]      : tool_result(toolu_01AbCdEfGhIjKlMnOpQ...) = Refund of $49.99 processed. Confirmation number: REF-5840.
Turn 4 [assistant] : text      — Your refund of $49.99 for order #12345 has been processed. Confirmation number: REF-5840.

messages = [
    {"role": "user",      "content": "What's the status of my order #99001?"},
    {"role": "assistant", "content": [
        {"type": "tool_use", "id": "toolu_ABCdef", "name": "get_order_status",
         "input": {"order_id": "99001"}}
    ]},
    {"role": "user",      "content": [
        {"type": "tool_result", "tool_use_id": "toolu_ABCdef",
         "content": "Order #99001 is in transit. Expected delivery: June 12."}
    ]},
    {"role": "assistant", "content": [
        {"type": "text",
         "text": "Your order #99001 is on its way and should arrive by June 12."}
    ]}
]

A complete two-call transcript

1.Claude AI Systems Foundations

2.Building Agents with the Claude Client SDK

3.Architecting Agentic Systems

4.Orchestrating Multi-Agent Systems

5.Designing Tools and MCP Integrations

6.Prompting and Schema Design

7.Claude Code Configuration and Project Workflows

8.Validation, Retry Loops, and Metrics

9.Context Management Techniques

10.Making Reliable Claude Systems

Agents, Tools, and Context

The three things in every Claude interaction

Anatomy of a request

When Claude responds directly

When Claude wants to use a tool

Closing the loop: The tool result

The complete annotated transcript

Complete code

Exercise: Label the transcript

What’s next?