Learning Objectives
- Explain what RAG (Retrieval-Augmented Generation) is and why it matters for PMs
- Design AI-ready documentation that retrieves accurately
- Apply chunking and tagging principles to product knowledge bases
- Evaluate when RAG is the right architecture vs. fine-tuning
- Build a knowledge base audit checklist for your product
Lesson 10.1 — What Is RAG and Why Should PMs Care?
Retrieval-Augmented Generation (RAG) is the architecture behind most enterprise AI systems that answer questions about internal documents. Rather than relying on training data alone, a RAG system retrieves relevant chunks from a knowledge base and injects them into the prompt before generating a response.
RAG vs. Fine-Tuning
| Dimension | RAG | Fine-Tuning |
|---|---|---|
| When to use | Knowledge changes frequently | Style or behaviour needs to change |
| Update cost | Low — update documents, not the model | High — retrain required |
| Transparency | High — can cite sources | Low — knowledge is baked into weights |
| PM control | Full — PMs own the document layer | Limited — requires ML engineering |
| Best for | Internal wikis, product specs, support docs | Tone, domain vocabulary, task specialisation |
How RAG Works — The PM-Relevant Steps
- Ingest: Documents are split into chunks and stored with embeddings (vector representations)
- Retrieve: User query is converted to an embedding; most similar chunks are retrieved
- Augment: Retrieved chunks are injected into the prompt as context
- Generate: LLM generates a response grounded in the retrieved chunks
The PM-controllable layer is step 1 — the quality, structure, and completeness of the documents.
Lesson 10.2 — Designing AI-Ready Documentation
AI-ready documentation is structured for retrieval, not just human reading. The principles differ from traditional documentation.
Chunking Principles
| Principle | Why it matters | Practice |
|---|---|---|
| One idea per chunk | Retrieval returns the most relevant chunk; mixed topics dilute relevance | Each section header = one concept |
| Self-contained chunks | Retrieved chunks are shown without surrounding context | Each chunk should make sense in isolation |
| Consistent length | Very short or very long chunks reduce retrieval quality | Aim for 200–600 words per chunk |
| Descriptive headings | Headings are often included in the embedding signal | Use precise, keyword-rich headings |
| Metadata tagging | Enables filtered retrieval (e.g., "only search compliance docs") | Add front matter: product, team, date, category |
AI-Ready Document Template
---
title: [Precise, descriptive title]
product: [Product name]
category: [spec | policy | guide | reference]
team: [Product | Engineering | Support]
date: YYYY-MM-DD
status: [draft | review | approved | deprecated]
---
## [Clear section heading]
[Self-contained content, 200-600 words, one concept only]
## [Next concept heading]
[Self-contained content...]
Common AI-Readiness Failures
- Implicit knowledge: "As discussed in Q3 planning..." — the AI has no access to that meeting
- Nested conditionals: Long if/then/else structures become ambiguous when chunked
- Acronyms without definition: First use must spell it out, even in internal docs
- Version stacking: Old and new versions of the same policy in the same file create contradictions
- Image-only content: Diagrams with no text description are invisible to RAG
status: deprecated and moved to an archive folder — not deleted. Deletion breaks retrieval for historical queries.
Lesson 10.3 — Knowledge Base Governance
A knowledge base without governance degrades fast. As a PM, you are often responsible for the product knowledge that engineering, support, and AI systems rely on.
Knowledge Base Health Metrics
| Metric | What to measure | Target |
|---|---|---|
| Freshness | % of docs updated in last 90 days | >80% |
| Coverage | % of product features with a spec | >90% |
| Orphan rate | % of docs with no owner assigned | <10% |
| Deprecation hygiene | % of deprecated docs properly tagged | 100% |
| Retrieval accuracy | % of AI answers citing the correct doc | >85% |
Lab 10 — Knowledge Base Audit & AI-Readiness Rewrite
You will audit a set of FlowScale documents for AI-readiness and rewrite one to meet the standard.
Take any product document you have (a spec, policy, or wiki page). Paste it into your AI agent with this prompt:
Take the lowest-scoring section from your audit and rewrite it using the AI-ready document template from Lesson 10.2. Use this prompt to help:
Using the metrics table from Lesson 10.3, create a simple health snapshot for your product's knowledge base (or use FlowScale as a hypothetical). For each metric, estimate the current value and identify the single biggest improvement action.
Deliverables
- Audit report: scored document with flagged issues and recommended fixes
- Rewritten section: AI-ready version with YAML front matter and chunked structure
- Health snapshot: 5-metric table with current estimate and top action per metric
How to Verify Completion
- Audit report identifies at least 3 specific AI-readiness issues with actionable fixes
- Rewritten section has valid YAML front matter with all 6 fields populated
- Rewritten section contains no external references ("as discussed", "per Q3 planning", etc.)
- Each section in the rewrite covers exactly one concept and is 200–400 words
- Health snapshot covers all 5 metrics with realistic current estimates
Module 10 Quiz
6 questions · click an option to answer · review all before checking your score
1. What does RAG stand for and what is its core advantage over fine-tuning for PMs?
2. What is the recommended word count per document chunk for optimal RAG retrieval?
3. Which of these is an example of an AI-readiness failure in documentation?
4. Why should deprecated documents be tagged rather than deleted?
status: deprecated and archiving is the correct approach.5. What does the PM-controllable layer in a RAG pipeline primarily consist of?
6. When is fine-tuning preferable to RAG?