Why does RAG need separate compliance treatment from standard GPAI integration?

The knowledge base is a governed data asset, the retrieval pipeline is a decision-making component, and the interaction between retrieved context and model behaviour creates emergent compliance properties not covered by base GPAI guidance.

What is the regulatory status of a RAG knowledge base under the EU AI Act?

Retrieved documents are inference-time context, not training data under Article 10. However, they shape outputs as profoundly as training data and must be governed with the same rigour.

How does grounding verification reduce hallucination risk in RAG systems?

A verification layer checks whether each output claim is supported by retrieved documents, using citation verification, entailment checking, or fact extraction. Outputs below the grounding threshold are flagged, held, or suppressed.

What are the prompt injection risks specific to RAG systems?

Indirect prompt injection through the knowledge base allows adversaries to embed instructions in documents. Defence requires input sanitisation, content isolation, output validation, and retrieval filtering across four layers.

When does a knowledge base update trigger a substantial modification assessment?

Organisations define a materiality threshold. Routine updates are operational maintenance; changes exceeding the threshold (new domains, demographic coverage shifts, source distribution changes) trigger the risk gate and change classification.

Does updating a RAG knowledge base require a new conformity assessment?

Not necessarily. Organisations define a materiality threshold. Routine updates (document additions within existing domains, factual corrections, outdated document removal) are operational maintenance. Changes above the threshold (new domains, demographic coverage shifts, source distribution changes exceeding defined divergence metrics) trigger the risk gate and change classification logic.

What is grounding verification and why is it a compliance requirement?

Grounding verification checks whether each claim in a RAG system's output is supported by the retrieved documents. Three approaches exist in increasing sophistication: citation verification, entailment checking via NLI models, and fact extraction and matching. For high-risk systems, ungrounded outputs are a safety and fundamental rights risk, not merely an accuracy problem.

How should organisations defend RAG systems against indirect prompt injection?

Four defence layers are required: input sanitisation screening documents for injection patterns, content isolation protecting the system prompt from override, output validation checking against expected behaviour boundaries, and retrieval filtering excluding flagged documents. Red-teaming should test all document ingestion pathways.

What bias risks are specific to RAG knowledge bases?

Knowledge base bias is distinct from model parametric bias. A model with no detectable bias can still produce biased outputs if the knowledge base underrepresents certain perspectives, demographics, geographic regions, or languages. The Technical SME must conduct a representativeness analysis documented in AISDP Module 4.

Does updating a RAG knowledge base require a new conformity assessment?

Not necessarily. Organisations define a materiality threshold. Routine updates (document additions within existing domains, factual corrections, outdated document removal) are operational maintenance. Changes above the threshold (new domains, demographic coverage shifts, source distribution changes exceeding defined divergence metrics) trigger the risk gate and change classification logic.

What is grounding verification and why is it a compliance requirement?

Grounding verification checks whether each claim in a RAG system's output is supported by the retrieved documents. Three approaches exist in increasing sophistication: citation verification, entailment checking via NLI models, and fact extraction and matching. For high-risk systems, ungrounded outputs are a safety and fundamental rights risk, not merely an accuracy problem.

How should organisations defend RAG systems against indirect prompt injection?

Four defence layers are required: input sanitisation screening documents for injection patterns, content isolation protecting the system prompt from override, output validation checking against expected behaviour boundaries, and retrieval filtering excluding flagged documents. Red-teaming should test all document ingestion pathways.

What bias risks are specific to RAG knowledge bases?

Knowledge base bias is distinct from model parametric bias. A model with no detectable bias can still produce biased outputs if the knowledge base underrepresents certain perspectives, demographics, geographic regions, or languages. The Technical SME must conduct a representativeness analysis documented in AISDP Module 4.

RAG-Specific Compliance: Knowledge Base and Retrieval Governance

Written by

Michael Clark

Chief Executive Officer, Standard Intelligence

Founder and CEO of Standard Intelligence. Author of the Practitioners Implementation Guide series for EU AI Act compliance.

Martin Dean

Chief Technology Officer, Standard Intelligence

CTO of Standard Intelligence. Leads platform engineering and contributes to the PIG series technical content.

Retrieval-augmented generation introduces compliance challenges spanning data governance, version control, cybersecurity, and post-market monitoring that the base GPAI integration guidance does not fully address. The knowledge base is a compliance-critical data asset under Article 10 principles, and the retrieval pipeline is a decision-making component requiring dedicated documentation in the AISDP.

Abstract

Read abstract

RAG is the dominant architecture for deploying GPAI models in enterprise contexts, yet it introduces compliance challenges that span multiple regulatory requirements simultaneously. The knowledge base occupies an interpretive grey area under Article 10: it is not training data, but it shapes outputs as profoundly as training data does. Organisations must govern the knowledge base with documented provenance, copyright screening, completeness assessment, currency monitoring, bias analysis, and versioning. The retrieval pipeline requires documentation in AISDP Module 3 covering embedding models, similarity thresholds, chunking strategies, and cross-lingual quality. Embedding model governance addresses bias evaluation, multilingual consistency, and version alignment between indexed and query-time embeddings. Grounding verification is the primary compensating control for hallucination risk, operating through citation verification, entailment checking, or fact extraction, with results logged for every inference. Indirect prompt injection through the knowledge base demands a four-layer defence architecture spanning input sanitisation, content isolation, output validation, and retrieval filtering. Knowledge base changes may constitute substantial modifications under Article 3, requiring organisations to define and document materiality thresholds reviewed quarterly against quantitative indicators.

Why does RAG require dedicated compliance treatment?

Regulatory Requirement

Retrieval-augmented generation introduces compliance challenges that standard GPAI integration guidance does not fully address.

Retrieval-augmented generation introduces compliance challenges that standard GPAI integration guidance does not fully address. A RAG system is not simply a model with a search function attached. The knowledge base is a governed data asset with its own lifecycle. The retrieval pipeline is a decision-making component that determines which information the model sees. The interaction between retrieved context and model behaviour creates emergent properties that neither the knowledge base nor the model exhibits in isolation.

The fundamental regulatory question is the status of the knowledge base under the EU AI Act. Article 10 governs training, validation, and testing data. Retrieved documents are none of these in the technical sense; they are inference-time context. Yet they shape the model's outputs as profoundly as training data does. A GPAI model that produces accurate, unbiased outputs on its own may produce inaccurate or biased outputs when provided with a knowledge base containing outdated, incomplete, or skewed information.

The knowledge base is, functionally, a compliance-critical data asset that must be governed with the same rigour as training data, even if the Article 10 label does not apply by strict textual interpretation. The compliance implications span data governance, version control, cybersecurity, human oversight, and post-market monitoring simultaneously. The outputs feed into aisdp Modules 3, 4, 5, 6, 8, 9, 10, and 12.

What is the regulatory status of a RAG knowledge base?

Regulatory Requirement

Article 10 applies to training, validation, and testing data, but retrieved documents occupy an interpretive grey area as inference-time context.

Article 10 applies to training, validation, and testing data, but retrieved documents occupy an interpretive grey area as inference-time context. Each characteristic of a RAG system has compliance implications that the base GPAI integration guidance does not fully address.

The knowledge base shapes outputs as decisively as training data. A model that performs well in isolation may produce harmful results when paired with a biased or outdated document collection. Regulators are likely to treat the knowledge base as a compliance-critical asset regardless of whether Article 10 formally applies by strict textual reading.

Organisations should govern the knowledge base with the same rigour applied to training data. This means documented provenance, version control, bias assessment, and ongoing currency monitoring. The practical consequence is that knowledge base governance must be woven into every stage of the compliance lifecycle, not treated as a peripheral concern.

How should knowledge base provenance and copyright be managed?

Engineering Approach

Every document in the knowledge base must have documented provenance covering the source, acquisition date, licence or permission, and conditions attached to use.

Every document in the knowledge base must have documented provenance covering the source, acquisition date, licence or permission, and conditions attached to use. Copyright exposure is significant: a RAG system that retrieves and reproduces copyrighted content in its outputs may expose the deploying organisation to infringement claims distinct from the GPAI model provider's own obligations under Article 53.

The Technical SME maintains a knowledge base catalogue recording, for each document or document source: the original author or publisher; the acquisition date; the licence type (proprietary, Creative Commons, public domain, contractual licence); any usage restrictions; the expiry date if the licence is time-limited; and the last review date.

Automated copyright screening. For knowledge bases assembled from web-scraped or publicly available sources, automated screening reduces the risk of inadvertent infringement. The screening checks each document against known copyright databases, robots.txt restrictions, and the organisation's internal list of prohibited sources. Documents that fail are quarantined pending manual review by the Legal and Regulatory Advisor.

Completeness assessment. The Technical SME defines, before deployment, the domains and topics the knowledge base must cover. The completeness assessment maps each required domain to the documents that address it, identifies gaps, and documents the remediation plan. The assessment is recorded in AISDP Module 4.

How is bias in the knowledge base identified and addressed?

Engineering Approach

A knowledge base assembled from sources that underrepresent certain perspectives, demographics, or viewpoints will produce outputs reflecting those gaps.

A knowledge base assembled from sources that underrepresent certain perspectives, demographics, or viewpoints will produce outputs reflecting those gaps. The fairness assessment for a RAG system must address not only the GPAI model's parametric biases but also the representational biases in the knowledge base.

The Technical SME conducts a representativeness analysis assessing whether the document collection adequately covers: all geographic regions relevant to the system's deployment; all demographic groups within the affected population; all perspectives relevant to the domain, including dissenting or minority viewpoints where outputs may influence decisions affecting diverse groups; and all languages in which the system operates.

The analysis is documented in AISDP Module 4. Where gaps are identified, the remediation approach is documented with a timeline. Bias in the knowledge base is distinct from bias in the model itself, and both must be assessed independently. A model with no detectable parametric bias can still produce biased outputs if the knowledge base systematically excludes relevant perspectives.

What governance applies to the retrieval pipeline?

Engineering Approach

The retrieval pipeline determines which documents the GPAI model sees, making it a decision-making component with direct impact on outputs.

The retrieval pipeline determines which documents the GPAI model sees, making it a decision-making component with direct impact on outputs. Its behaviour must be documented in AISDP Module 3 with the same rigour as the model inference layer.

Documentation covers: the embedding model used for semantic search (version, provider, known limitations); the similarity metric and threshold; the number of documents retrieved (top-k); the re-ranking strategy if any; the chunking strategy for splitting documents into retrievable segments; and the metadata filtering logic if retrieval is constrained by document attributes.

Embedding model governance. The embedding model that converts queries and documents into vectors is itself an AI component that introduces bias, accuracy, and version control considerations. Three requirements apply specifically to RAG systems.

First, embedding bias evaluation. The Technical SME evaluates whether the embedding model produces systematically different retrieval results for semantically equivalent queries phrased in different ways. A query about "maternity leave entitlements" should retrieve the same documents as one about "parental leave for mothers." If the embedding model's semantic space encodes demographic biases, the retrieval pipeline may systematically underserve certain query formulations.

Second, cross-lingual retrieval quality. For multilingual RAG systems, the Technical SME evaluates whether retrieval quality is consistent across languages. A system that retrieves comprehensive, relevant documents for English queries but sparse, tangential documents for Polish or Romanian queries produces systematically different output quality across language communities.

How does grounding verification work as a compliance control?

Compensating Controls

Hallucination is the defining compliance risk for RAG systems: outputs not supported by retrieved context may be incorrect, misleading, or harmful.

Hallucination is the defining compliance risk for RAG systems: outputs not supported by retrieved context may be incorrect, misleading, or harmful. For high-risk applications, hallucination is not merely an accuracy problem but a safety and fundamental rights risk. The Technical SME implements a grounding verification layer that checks whether each claim in the model's output is supported by the retrieved documents.

Implementation approaches. Three approaches are available in increasing order of sophistication. Citation verification requires the model to cite specific passages from retrieved documents, and the verification layer checks that cited passages exist and support the claim. This is the simplest approach and the most auditable. Entailment checking uses a separate NLI (natural language inference) model to evaluate whether each claim is entailed by the retrieved context. Fact extraction and matching extracts factual claims from the output and matches them against facts extracted from the retrieved documents.

The verification operates at inference time and produces a grounding score for each output. Outputs below the grounding threshold are flagged, held for human review, or suppressed, depending on risk profile and severity of potential harm. Results are logged for every inference and documented in AISDP Module 5.

Grounding failure handling. The system's response to a grounding failure must be defined in the AISDP and implemented in the inference pipeline. Options range from appending a disclaimer ("This response could not be fully verified against the knowledge base") to suppressing the output entirely and returning a fallback response. For high-risk systems where ungrounded outputs could cause harm, suppression is the safer default. The AI Governance Lead defines the grounding failure policy; the Technical SME implements it.

How are RAG systems defended against prompt injection?

Compensating Controls

RAG systems face a specific attack vector distinct from direct prompt injection: indirect prompt injection through the knowledge base.

RAG systems face a specific attack vector distinct from direct prompt injection: indirect prompt injection through the knowledge base. An adversary who can insert or modify documents in the knowledge base can embed instructions that the GPAI model may follow when the document is retrieved as context.

Attack surface. The knowledge base is the attack surface. Any pathway through which documents enter is a potential injection vector: user-uploaded documents, web-scraped content, partner data feeds, and automated document ingestion pipelines. A malicious document containing hidden instructions may be retrieved by the RAG pipeline and processed by the model as context, causing outputs that violate the system's intended behaviour.

Defence architecture. The Technical SME implements four layers of defence. Input sanitisation screens documents for injection patterns before they enter the knowledge base. Content isolation ensures the model's system prompt is protected from override by retrieved content, using the provider's recommended separation mechanisms. Output validation checks the model's output against expected behaviour boundaries regardless of retrieved context. Retrieval filtering excludes documents flagged as potentially adversarial from retrieval results.

The cybersecurity testing programme should include specific test cases for indirect prompt injection through the knowledge base. Red-teaming exercises should attempt to inject malicious documents through each ingestion pathway and verify that defence layers detect and block the injection. Results are documented in AISDP Module 9.

When does a knowledge base change trigger a substantial modification assessment?

Regulatory Requirement

Whether a change to the knowledge base alone constitutes a substantial modification under Article 3 does not have a definitive regulatory answer.

Whether a change to the knowledge base alone constitutes a substantial modification under Article 3 does not have a definitive regulatory answer. The analysis requires balancing two positions.

The case for substantial modification. A material change to the knowledge base changes the information available to the model, which changes outputs, which may change compliance with the Chapter 2 requirements. If the knowledge base is updated to include documents introducing new bias, containing incorrect information about a protected group, or removing previously available information, the system's fairness, accuracy, and safety profiles may all be affected.

The case against. Knowledge base updates are normal operational activity for any information system. Treating every document addition or removal as a substantial modification would make RAG systems operationally impractical for high-risk applications, because every update would trigger a new conformity assessment.

The practical approach. The AI Governance Lead defines a materiality threshold for knowledge base changes. Changes below the threshold (routine document additions within the existing domain, corrections to factual errors, removal of outdated documents) are treated as normal operational maintenance documented in AISDP Module 12. Changes above the threshold (introduction of a new domain or topic area, material change to demographic coverage, significant change to the proportion of sources from a particular perspective or origin) trigger the governance pipeline's risk gate and change classification logic.

RAG-Specific Compliance: Knowledge Base and Retrieval Governance

Written by

Why does RAG require dedicated compliance treatment?

What is the regulatory status of a RAG knowledge base?

How should knowledge base provenance and copyright be managed?

How is bias in the knowledge base identified and addressed?

What governance applies to the retrieval pipeline?

How does grounding verification work as a compliance control?

How are RAG systems defended against prompt injection?

When does a knowledge base change trigger a substantial modification assessment?

Frequently Asked Questions

Related Pages

Start your compliance journey