Google gives enterprise RAG a guard who knows when not to answer | Radar

Google introduced an agentic RAG system for Gemini Enterprise Agent Platform that checks whether it has enough context before answering. For companies, that brake matters more than another polished retrieval layer.

Agentic RAG is supposed to stop before it starts guessing

Google Research and Google Cloud describe a Cross-Corpus Retrieval framework hosted in Gemini Enterprise Agent Platform. The system splits complex questions across agents, plans follow-up steps, rewrites queries and searches across multiple corpora. The key component is the Sufficient Context Agent. It reviews the prompt, retrieved text chunks and a draft answer before the model is allowed to produce the final response.

Google says the system improves accuracy on factuality datasets by up to 34 % compared with standard RAG. On FramesQA it cites 824 queries and 2,676 PDF documents. In the cross-corpus setting, where the planner selected from four corpora, Google reports 90.1 % correct answers and latency within 3 % on average of the single-corpus version. The feature is available as a public preview in Gemini Enterprise Agent Platform.

The value is auditability, not the number of agents

Enterprise RAG usually fails in quiet ways. It misses a second document, treats an ID as a fact, or answers from only part of the evidence. Google is aiming at that specific failure mode: the agent should not be just a smarter search box, but a control mechanism that can say what is still missing.

For enterprise buyers, that is the practical shift. If an answer touches legal, finance or healthcare operations, polished prose is not enough. The team needs a trail: where the system searched, what it found, why it searched again and why it judged the context sufficient.

The context checker still depends on the quality of the corpus

The Sufficient Context Agent does not fix bad permissions, stale documents or inconsistent enterprise data. If truth is scattered across systems that do not share entities, the agentic loop may only hit the same wall more carefully.

The second risk is evals. Google shows FramesQA and internal proprietary datasets, but the real test is corporate data where documents contradict each other and users ask imprecise questions. That is where we will learn whether the context checker reduces hallucinations or just adds another confident process layer.

An audit trail will determine whether RAG stops being a marketing phrase

The signals to watch are specific: how clearly Google exposes the Sufficient Context Agent's reasons to customers and whether decisions can be audited later without manual archaeology through logs.

If enterprise teams can see not just the answer but the path to it, agentic RAG may finally become more than a marketing phrase. If not, it remains another pipeline that looks sensible until the first incident.

Lilith's verdict

The value of the system does not rest on the number of agents in the architecture. It rests on whether an answer has a readable trail back to the source, or ends up as confident text with no address.