Retrieval-augmented generation designed for multi-agent AI is fundamentally different from chatbot RAG. This live workshop teaches the production RAG architecture built for agent systems: shared knowledge access, citation propagation across agent handoffs, and memory-augmented retrieval via MCP.
By Packt Publishing · Refunds up to 10 days before
Single-agent RAG retrieves and generates in one step. Multi-agent RAG must handle concurrent retrieval from multiple agents, maintain citation chains through agent handoffs, share episodic memory across the agent system, and expose retrieval as a composable MCP service. Each of these requirements changes the RAG architecture significantly.
Context engineering is the discipline of designing systems that give AI the right information, in the right format, to reason and act reliably. It goes beyond prompt engineering — building structured, deterministic systems that scale in production.
A multi-agent system uses multiple specialised AI agents working together — each with a defined role, context, and tools — to complete complex tasks no single agent could handle reliably. Context engineering makes them predictable.
MCP is Anthropic's open standard for connecting AI models to tools, data sources, and other agents. It provides structured agent orchestration with clear context boundaries — making systems transparent and debuggable.
Context engineering requires hands-on practice to truly understand. This live workshop lets you build a working system with a world-class instructor answering your questions in real time.
Six modules. Six hours. A production-ready context-engineered AI system by the time you finish.
Understand why prompts fail at scale and how semantic blueprints give AI structured, goal-driven contextual awareness.
Design and orchestrate multi-agent workflows using the Model Context Protocol. Build transparent, traceable agent systems.
Build RAG pipelines that deliver accurate, cited responses. Engineer memory systems that persist context reliably across agents.
Architect a transparent, explainable context engine where every decision is traceable and debuggable in production.
Implement safeguards against prompt injection and data poisoning. Enforce trust boundaries in multi-agent environments.
Deploy your context-engineered system to production. Apply patterns for scaling, monitoring, and reliability.
Concrete working deliverables — not just theory and slides.
A working Glass-Box Context Engine with transparent, traceable reasoning
Multi-agent workflow orchestrated with the Model Context Protocol
High-fidelity RAG pipeline with memory and citations
Safeguards against prompt injection and data poisoning
Reusable architecture patterns for production AI systems
Certificate of completion from Packt Publishing
Denis Rothman brings decades of production AI engineering experience to this live workshop.
Denis Rothman is a bestselling AI author with over 30 years of experience in artificial intelligence, agent systems, and optimization. He has authored multiple cutting-edge AI books published by Packt and is renowned for making complex AI architecture concepts practical and immediately applicable. He guides you step by step through building production-ready context-engineered multi-agent systems — answering your questions live throughout the 6-hour session.
Intermediate to advanced workshop. Solid Python and basic LLM experience required.
Everything you need to know before registering.
In a multi-agent system, retrieval-augmented generation must handle: concurrent queries from multiple agents without conflicts, citation metadata that propagates through agent-to-agent handoffs preserving source attribution, shared episodic memory that multiple agents can read and contribute to, retrieval confidence information that informs downstream agent routing decisions, and exposure through a typed MCP interface so retrieval can be invoked by any agent in the system.
Citation chains in multi-agent RAG work by attaching structured citation objects to every retrieved fact as it moves through the agent system. When agent A retrieves a document and uses a fact in its output, the citation metadata (document ID, section, confidence score) is attached to that fact. When agent B processes agent A's output, the citation metadata travels with the fact. The final synthesis agent can then verify that every claim in the system's output traces to an original retrieved source.
RAG serves as the knowledge grounding layer of the Glass-Box Context Engine. The context engine constructs semantic blueprints for each agent that include a knowledge section populated by RAG retrievals. The Glass-Box logging layer captures every retrieval operation, which makes the knowledge basis of every agent decision observable and auditable. Without RAG, agent decisions cannot be traced to specific knowledge sources.
Preventing RAG from becoming a bottleneck requires three optimizations: connection pooling so multiple agents can query the vector store concurrently without blocking each other, a semantic cache that serves results for similar queries without re-executing the embedding search, and async retrieval that allows agents to begin processing while retrieval is still running for the portions of the response that do not require knowledge grounding.
Yes. Once the knowledge base is embedded and stored in the vector store, retrieval-augmented generation works completely offline. The embedding computations happen once during indexing (which requires access to an embedding model). Retrieval at query time is purely a vector similarity search against the pre-built index. The workshop covers setting up a fully offline RAG pipeline that requires no external API calls during operation.
Knowledge base synchronization uses a document change monitoring pipeline that detects when source documents are modified, re-computes embeddings for changed sections, updates the vector store index incrementally, and invalidates any episodic memory entries that referenced the changed content. The workshop covers implementing this synchronization pipeline so your RAG knowledge base stays current without requiring full re-indexing when source documents change.
6 hours. Bestselling AI author. Production context-engineered multi-agent system by the end. Seats are limited.
Register Now →Saturday April 25 · 9am to 3pm EDT · Online · Packt Publishing · Cohort 2