LLM Hallucinations — Why AI Makes Things Up
Hallucinations are the most significant reliability problem with Large Language Models. Understanding why they happen and how to mitigate them is essential for anyone building or using AI applications.
What Are Hallucinations?
A LLM hallucination occurs when a language model generates information that sounds confident and plausible but is factually incorrect, fabricated, or not supported by its training data. The term "hallucination" is borrowed from psychology, but it is somewhat misleading — the model is not perceiving things that are not there. It is generating text that follows the patterns of its training data without verifying whether the output is true.
Hallucinations are not bugs in the traditional sense. They are a fundamental consequence of how LLMs work. A language model is trained to predict the most likely next token given a sequence of text. It optimizes for fluency and plausibility, not for truth. When the model does not have the correct answer, it does not say "I don't know" — it generates something that looks like a correct answer.
A Real Example
Ask an LLM: "Tell me about the 2019 paper 'Attention Is All You Need' by Smith et al."
The model might respond with a detailed summary of a paper, including methodology, results, and conclusions — all completely fabricated. The real paper is by Vaswani et al., but the model fills in the gap with plausible-sounding nonsense because it has learned the pattern of "summarize a research paper" and generates text that fits that pattern.
What makes hallucinations dangerous is that they are indistinguishable from correct responses in terms of confidence, detail, and formatting. A hallucinated answer reads exactly like a correct one.
Why LLMs Hallucinate
Hallucinations arise from several fundamental properties of how LLMs are built and trained:
1. Next-Token Prediction, Not Fact-Checking
LLMs are trained to predict the next token in a sequence. This objective rewards fluency and coherence, not accuracy. The model learns that certain token sequences are more likely than others, but it has no mechanism to verify whether those sequences correspond to real-world facts. When the model is uncertain, it produces the most statistically likely continuation, which may be wrong.
2. Training Data Limitations
LLMs are trained on large but finite datasets. If a fact appears rarely, is absent from the training data, or is contradicted by different sources, the model may generate incorrect information. The training data itself contains errors, outdated information, and contradictions — and the model absorbs all of it.
3. No World Model
LLMs do not have a true understanding of the world. They have learned statistical patterns in text. They can produce text that describes causal relationships, physical laws, and logical arguments, but they do not truly understand these concepts. This means they can generate text that violates common sense or physical reality.
4. Sycophancy and Confidence Bias
During training (especially RLHF), models are rewarded for being helpful and providing answers. This creates a bias toward giving confident, complete responses even when the model should say "I don't know" or "I'm not sure." The model learns that providing an answer is rewarded more than expressing uncertainty.
5. Long-Context Degradation
When processing long documents or conversations, LLMs can lose track of details from earlier in the context. This leads to contradictions, fabrications, or mixing up information from different parts of the input. The problem is worse with smaller models and longer contexts.
Types of Hallucinations
Not all hallucinations are the same. Understanding the types helps you identify and address them:
Factual Fabrication
The model invents facts that do not exist: fake citations, non-existent people, made-up statistics, fictional events presented as real. This is the most common and dangerous type.
- Example: "According to a 2023 study by Stanford (Johnson et al.), 73% of AI models exhibit..." — the study, authors, and statistic are all fabricated.
Factual Errors
The model states something that is verifiably wrong, but the error is a distortion of real information rather than a pure fabrication.
- Example: "The Eiffel Tower was built in 1889 and is 324 meters tall" — correct. "The Eiffel Tower was built in 1887" — wrong by 2 years.
Reasoning Errors
The model makes logical mistakes, mathematical errors, or flawed deductions. These are especially common in multi-step reasoning tasks.
- Example: Correctly identifying that a shirt costs $20 more than a $10 pair of pants, then concluding the pants cost $30.
Instruction Contradiction
The model ignores or contradicts explicit instructions in the prompt or system message.
- Example: Being told "Only use information from the provided document" and then adding information not in the document.
Contextual Confabulation
The model mixes up information from different parts of the conversation or document, creating coherent but incorrect connections.
- Example: In a long conversation about two different people, the model attributes one person's achievements to the other.
How to Detect Hallucinations
Detecting hallucinations is challenging because, by definition, they sound correct. Here are practical strategies:
Ask for Sources
Ask the model to cite its sources. If it provides a citation, verify it. Fabricated citations are one of the easiest hallucinations to catch — the paper, book, or URL simply does not exist. Be aware that more advanced models are getting better at generating plausible-looking citations.
Cross-Reference with Reliable Sources
For any factual claim that matters, verify it against a trusted source. Do not rely solely on LLM output for medical, legal, financial, or safety-critical information.
Use Multiple Models
Ask the same question to different LLMs. If they disagree, at least one is hallucinating. This is not foolproof (they can all hallucinate the same way if the false information is common in training data), but disagreements are a strong signal.
Check for Specificity
Hallucinations often include suspiciously specific details: exact dates, precise statistics, specific names. Real knowledge tends to come with uncertainty ("approximately," "around," "I believe"). Overly specific claims without hedging can be a red flag.
Look for Consistency
Ask the same question in different ways. If the model gives contradictory answers, at least one is a hallucination. Consistent answers across different phrasings increase confidence (but do not guarantee correctness).
Use RAG with Citations
When using RAG, require the model to cite specific passages from the retrieved documents. This makes it possible to verify whether the model's answer is actually supported by the source material.
Strategies to Reduce Hallucinations
While hallucinations cannot be completely eliminated, several techniques can significantly reduce their frequency:
1. Retrieval-Augmented Generation (RAG)
The single most effective technique. RAG retrieves relevant documents at query time and passes them as context to the LLM. The model generates answers grounded in real, verified information rather than relying on its training data. This is the gold standard for reducing hallucinations in production applications. See our RAG guide for implementation details.
2. System Prompts with Guardrails
Instruct the model to say "I don't know" when uncertain:
"If you are not confident in your answer or cannot find the information in the provided context, say 'I don't have enough information to answer this question.' Do not guess or fabricate information."
This reduces sycophancy but does not eliminate hallucinations — the model may still be confidently wrong.
3. Lower Temperature
Temperature controls randomness. Lower values (0-0.3) make the model more deterministic and conservative, reducing creative but incorrect outputs. Use temperature=0 for factual queries.
4. Chain-of-Thought Prompting
Ask the model to "think step by step" or "show your reasoning." This forces the model to work through a problem methodically rather than jumping to a conclusion. Chain-of-thought reduces reasoning errors and makes it easier to spot where the logic goes wrong.
5. Grounding with Context
Always provide relevant context in your prompts. Instead of asking "What is X?", ask "Based on the following text, what is X?" and include the relevant text. The model is far less likely to hallucinate when it has the correct information right in front of it.
6. Self-Consistency Checking
Generate multiple responses to the same question (using higher temperature for diversity) and take the majority answer. If the responses disagree, the model is uncertain and may be hallucinating. This is computationally expensive but effective for critical queries.
7. Fine-Tuning on Domain Data
Fine-tuning a model on high-quality, domain-specific data reduces hallucinations in that domain. The model learns the correct facts and patterns for your specific use case, reducing reliance on potentially incorrect training data. However, fine-tuning is expensive and requires careful data curation.
RAG as the Best Defense
Of all the techniques for reducing hallucinations, Retrieval-Augmented Generation (RAG) is the most effective and widely adopted. It addresses the root cause: the model generates false information because it does not have access to the correct information.
How RAG Reduces Hallucinations
In a RAG pipeline, the user's question is used to retrieve relevant documents from a trusted knowledge base. These documents are inserted into the prompt as context. The model then generates its answer based on the provided context rather than its internal training data.
This approach has several advantages for hallucination reduction:
- Grounded in sources: The model's answer is based on specific, verifiable documents rather than fuzzy training data.
- Citable: You can require the model to cite which document (and which passage) each claim comes from.
- Updatable: When your knowledge base changes, the model's answers change too — no retraining needed.
- Auditable: You can inspect the retrieved documents to understand why the model gave a particular answer.
RAG Is Not Perfect
RAG reduces hallucinations significantly but does not eliminate them. The model can still:
- Ignore the provided context and generate from training data
- Misinterpret or misrepresent what the documents say
- Hallucinate connections between different retrieved passages
- Generate plausible-sounding additions to the context
The best practice is to combine RAG with other techniques: low temperature, chain-of-thought reasoning, source citation requirements, and output validation. For a complete implementation guide, see our RAG Explained article and LangChain tutorial.
Frequently Asked Questions
What is an LLM hallucination?
An LLM hallucination is when a language model generates confident-sounding information that is factually incorrect, fabricated, or not supported by its training data. The model presents false information as if it were true, often with specific details like names, dates, or citations that do not exist.
Why do LLMs hallucinate?
LLMs hallucinate because they are trained to predict the next likely token, not to verify facts. They optimize for fluency and plausibility, not accuracy. They also lack a true understanding of the world, cannot distinguish between what they know and do not know, and were trained on data that may contain errors or biases.
How can I reduce LLM hallucinations?
The most effective strategies are: RAG (Retrieval-Augmented Generation) to ground responses in real documents, asking the model to cite sources, using temperature=0 for less creative output, breaking complex questions into smaller steps, and always verifying critical information from LLM outputs.
Can hallucinations be completely eliminated?
No, hallucinations cannot be completely eliminated with current LLM technology. They are a fundamental consequence of how these models work — predicting plausible text rather than verifying facts. However, techniques like RAG, grounding, and chain-of-thought reasoning can significantly reduce their frequency.