From Similarity to Structure
Most production RAG pipelines index enterprise knowledge as flat text and retrieve it by vector similarity alone — which works for lookup questions but fails quietly when the answer lives in the relationships between entities. This paper offers practitioners a workload-first framework: a taxonomy of five question shapes flat retrieval can't answer reliably, a five-level maturity ladder for graph-aware retrieval, a reference architecture, an evaluation approach, and a phased adoption roadmap. The central recommendation is deliberately conservative — audit your real query workload first, then adopt the lowest rung that makes your critical questions answerable.
