Technical AI
What Is RAG?
Retrieval-augmented generation — the technique that gives AI access to real-time facts, company documents, and up-to-date information without retraining the entire model.
The problem
Standard LLMs have two fundamental limitations: their knowledge is frozen at training cutoff, and they can't access private data. Ask ChatGPT what happened in the news yesterday and it'll either admit it doesn't know or hallucinate a plausible-sounding answer. Ask it to summarise your company's internal documentation and it's completely blind.
RAG solves this by splitting the problem in two: a retrieval system finds relevant information at query time, and a generation model synthesises the retrieved information into a response. The model doesn't need to memorise everything — it just needs to be good at using information you hand it in context.
How it works
RAG adds a retrieval step before generation. Here's the full pipeline from query to response.
RAG vs fine-tuning
Both RAG and fine-tuning can help models work better with domain-specific information. They solve different problems.
| Factor | RAG | Fine-tuning |
|---|---|---|
| Best for | Accessing up-to-date or private documents | Changing model behaviour, tone, or format |
| Knowledge updates | Instant — update the document store | Requires full retraining cycle |
| Cost | Low — infrastructure + inference costs | High — GPU compute for training |
| Hallucination risk | Lower — facts grounded in retrieved docs | Medium — model can still confabulate |
| Source attribution | Built-in — you know where facts came from | Not available — knowledge is implicit |
| Style/behaviour change | Limited — prompt engineering required | Strong — model learns new patterns |
Most production AI systems use both: fine-tuning to establish the right behaviour and response style, RAG to provide the knowledge. Neither approach alone is sufficient for enterprise use cases.
Use cases
FAQ
Sources
Get AI insights every week
The AI Briefing covers what actually matters in AI — no hype, no jargon, just what you need to stay ahead.
Subscribe free