Scoping and Planning a Production RAG Application

Discover how to apply the 4D LLMOps lifecycle to build a reliable, scalable Retrieval-Augmented Generation (RAG) HR assistant. Learn to scope requirements, select simple tools like FastAPI and PostgreSQL for vector storage, and map engineering tasks to quality gates. Understand how to transform raw markdown documentation into structured data, implement ingestion and inference pipelines, and prepare for containerized deployment. This lesson sets the foundation for building and testing a production RAG system.

We'll cover the following...

An HR support assistant
The application architecture
Executing the 4D Framework
Phase 1: Discover
Conclusion

The RAG application we will build is an HR support assistant for a fictional company. We will build upon this throughout the remainder of the course. We also map the 4D framework directly to the engineering work we will execute.

This will provide a consistent framework for determining what to build next, measuring progress, and identifying when a phase is complete.

An HR support assistant

Assume we are the engineering team at a fictional company called Halluli that does not have a dedicated HR department. All company policies and processes are documented. New hires currently have to search through hundreds of pages of Markdown documentation ...

1.The Evolution of Modern AI Systems

2.LLMOps Core Concepts

3.Phase 1: Discover and Data Engineering

4.Phase 2: Distill and The Core Engine

5.Phase 3: Deploy and Hardening

6.Phase 4: Deliver and Evolution

Scoping and Planning a Production RAG Application

An HR support assistant