Mastering LlamaIndex: From Fundamentals to Building AI Apps/

...

Tracing and Debugging AI Systems in LlamaIndex

Learn how to trace and debug LLM applications using LlamaIndex’s built-in tools for logging, performance profiling, and system optimization.

We'll cover the following...

Enabling basic logging
Using callback handlers for structured tracing
- Enabling the simple callback handler
- Tracing execution time with LlamaDebugHandler
Integration with observability tools: LlamaTrace
Conclusion

When we build AI systems with LlamaIndex or any other framework, it’s easy to focus on inputs and outputs—a question goes in, and an answer emerges. But much happens beneath the surface: documents are embedded and retrieved, prompts are dynamically constructed, and language models generate results.

In more complex setups, agents may call tools, and workflows may branch based on logic or user interaction. When something goes wrong—a poor answer, a missed tool call, or a slow response—we must understand what happened.

Tracing and debugging help us make the invisible visible. They let us follow how data flows through the system and diagnose exactly where things break down. With LlamaIndex, we can inspect each step—from retrieval and prompt construction to final response generation.

Press + to interact

Line 1: We import Python’s built-in logging module.
Line 4: We configure the logging system to show messages at the INFO level or higher.
Line 5: logger = logging.getLogger(__name__) creates a logger instance. Using __name__ ensures the logger is properly scoped to the current module.
Lines 8–12: We define a function retrieve_response(query) that simulates generating a response to a user query. Before generating the response, we log the received query. After generating the (simulated) response, we log the output.
Lines 15–16: We test the function with a sample query and print the returned response to the console.

This helps us confirm that our query logic is working as expected, verifying that queries are received ...

Getting Started

Core Concepts and Using LLMs

Building a RAG Pipeline

Extracting Structured Outputs from LLMs

Agents and Workflows

Monitoring and Evaluating LLM Applications

Building Real-World Applications with LlamaIndex

Wrap Up

Tracing and Debugging AI Systems in LlamaIndex

Enabling basic logging