LLM Hallucinations — Why AI Makes Things Up

Last updated: June 23, 2026 · 8 min read

Hallucinations are the most significant reliability problem with Large Language Models. Understanding why they happen and how to mitigate them is essential for anyone building or using AI applications.

What Are Hallucinations?

A LLM hallucination occurs when a language model generates information that sounds confident and plausible but is factually incorrect, fabricated, or not supported by its training data. The term "hallucination" is borrowed from psychology, but it is somewhat misleading — the model is not perceiving things that are not there. It is generating text that follows the patterns of its training data without verifying whether the output is true.

Hallucinations are not bugs in the traditional sense. They are a fundamental consequence of how LLMs work. A language model is trained to predict the most likely next token given a sequence of text. It optimizes for fluency and plausibility, not for truth. When the model does not have the correct answer, it does not say "I don't know" — it generates something that looks like a correct answer.

A Real Example

Ask an LLM: "Tell me about the 2019 paper 'Attention Is All You Need' by Smith et al."

The model might respond with a detailed summary of a paper, including methodology, results, and conclusions — all completely fabricated. The real paper is by Vaswani et al., but the model fills in the gap with plausible-sounding nonsense because it has learned the pattern of "summarize a research paper" and generates text that fits that pattern.

What makes hallucinations dangerous is that they are indistinguishable from correct responses in terms of confidence, detail, and formatting. A hallucinated answer reads exactly like a correct one.

Why LLMs Hallucinate

Hallucinations arise from several fundamental properties of how LLMs are built and trained:

1. Next-Token Prediction, Not Fact-Checking

LLMs are trained to predict the next token in a sequence. This objective rewards fluency and coherence, not accuracy. The model learns that certain token sequences are more likely than others, but it has no mechanism to verify whether those sequences correspond to real-world facts. When the model is uncertain, it produces the most statistically likely continuation, which may be wrong.

2. Training Data Limitations

LLMs are trained on large but finite datasets. If a fact appears rarely, is absent from the training data, or is contradicted by different sources, the model may generate incorrect information. The training data itself contains errors, outdated information, and contradictions — and the model absorbs all of it.

3. No World Model

LLMs do not have a true understanding of the world. They have learned statistical patterns in text. They can produce text that describes causal relationships, physical laws, and logical arguments, but they do not truly understand these concepts. This means they can generate text that violates common sense or physical reality.

4. Sycophancy and Confidence Bias

During training (especially RLHF), models are rewarded for being helpful and providing answers. This creates a bias toward giving confident, complete responses even when the model should say "I don't know" or "I'm not sure." The model learns that providing an answer is rewarded more than expressing uncertainty.

5. Long-Context Degradation

When processing long documents or conversations, LLMs can lose track of details from earlier in the context. This leads to contradictions, fabrications, or mixing up information from different parts of the input. The problem is worse with smaller models and longer contexts.

Types of Hallucinations

Not all hallucinations are the same. Understanding the types helps you identify and address them:

Factual Fabrication

The model invents facts that do not exist: fake citations, non-existent people, made-up statistics, fictional events presented as real. This is the most common and dangerous type.

Example: "According to a 2023 study by Stanford (Johnson et al.), 73% of AI models exhibit..." — the study, authors, and statistic are all fabricated.

Factual Errors

The model states something that is verifiably wrong, but the error is a distortion of real information rather than a pure fabrication.

Example: "The Eiffel Tower was built in 1889 and is 324 meters tall" — correct. "The Eiffel Tower was built in 1887" — wrong by 2 years.

Reasoning Errors

The model makes logical mistakes, mathematical errors, or flawed deductions. These are especially common in multi-step reasoning tasks.

Example: Correctly identifying that a shirt costs $20 more than a $10 pair of pants, then concluding the pants cost $30.

Instruction Contradiction

The model ignores or contradicts explicit instructions in the prompt or system message.

Example: Being told "Only use information from the provided document" and then adding information not in the document.

Contextual Confabulation

The model mixes up information from different parts of the conversation or document, creating coherent but incorrect connections.

Example: In a long conversation about two different people, the model attributes one person's achievements to the other.

How to Detect Hallucinations

Detecting hallucinations is challenging because, by definition, they sound correct. Here are practical strategies:

Ask for Sources

Ask the model to cite its sources. If it provides a citation, verify it. Fabricated citations are one of the easiest hallucinations to catch — the paper, book, or URL simply does not exist. Be aware that more advanced models are getting better at generating plausible-looking citations.

Cross-Reference with Reliable Sources

For any factual claim that matters, verify it against a trusted source. Do not rely solely on LLM output for medical, legal, financial, or safety-critical information.

Use Multiple Models

Ask the same question to different LLMs. If they disagree, at least one is hallucinating. This is not foolproof (they can all hallucinate the same way if the false information is common in training data), but disagreements are a strong signal.

Check for Specificity

Hallucinations often include suspiciously specific details: exact dates, precise statistics, specific names. Real knowledge tends to come with uncertainty ("approximately," "around," "I believe"). Overly specific claims without hedging can be a red flag.

Look for Consistency

Ask the same question in different ways. If the model gives contradictory answers, at least one is a hallucination. Consistent answers across different phrasings increase confidence (but do not guarantee correctness).

Use RAG with Citations

When using RAG, require the model to cite specific passages from the retrieved documents. This makes it possible to verify whether the model's answer is actually supported by the source material.

Strategies to Reduce Hallucinations

While hallucinations cannot be completely eliminated, several techniques can significantly reduce their frequency:

1. Retrieval-Augmented Generation (RAG)

The single most effective technique. RAG retrieves relevant documents at query time and passes them as context to the LLM. The model generates answers grounded in real, verified information rather than relying on its training data. This is the gold standard for reducing hallucinations in production applications. See our RAG guide for implementation details.

2. System Prompts with Guardrails

Instruct the model to say "I don't know" when uncertain:

"If you are not confident in your answer or cannot find the information in the provided context, say 'I don't have enough information to answer this question.' Do not guess or fabricate information."

This reduces sycophancy but does not eliminate hallucinations — the model may still be confidently wrong.

3. Lower Temperature

Temperature controls randomness. Lower values (0-0.3) make the model more deterministic and conservative, reducing creative but incorrect outputs. Use temperature=0 for factual queries.

4. Chain-of-Thought Prompting

Ask the model to "think step by step" or "show your reasoning." This forces the model to work through a problem methodically rather than jumping to a conclusion. Chain-of-thought reduces reasoning errors and makes it easier to spot where the logic goes wrong.

5. Grounding with Context

Always provide relevant context in your prompts. Instead of asking "What is X?", ask "Based on the following text, what is X?" and include the relevant text. The model is far less likely to hallucinate when it has the correct information right in front of it.

6. Self-Consistency Checking

Generate multiple responses to the same question (using higher temperature for diversity) and take the majority answer. If the responses disagree, the model is uncertain and may be hallucinating. This is computationally expensive but effective for critical queries.

7. Fine-Tuning on Domain Data

Fine-tuning a model on high-quality, domain-specific data reduces hallucinations in that domain. The model learns the correct facts and patterns for your specific use case, reducing reliance on potentially incorrect training data. However, fine-tuning is expensive and requires careful data curation.

RAG as the Best Defense

Of all the techniques for reducing hallucinations, Retrieval-Augmented Generation (RAG) is the most effective and widely adopted. It addresses the root cause: the model generates false information because it does not have access to the correct information.

How RAG Reduces Hallucinations

In a RAG pipeline, the user's question is used to retrieve relevant documents from a trusted knowledge base. These documents are inserted into the prompt as context. The model then generates its answer based on the provided context rather than its internal training data.

This approach has several advantages for hallucination reduction:

Grounded in sources: The model's answer is based on specific, verifiable documents rather than fuzzy training data.
Citable: You can require the model to cite which document (and which passage) each claim comes from.
Updatable: When your knowledge base changes, the model's answers change too — no retraining needed.
Auditable: You can inspect the retrieved documents to understand why the model gave a particular answer.

RAG Is Not Perfect

RAG reduces hallucinations significantly but does not eliminate them. The model can still:

Ignore the provided context and generate from training data
Misinterpret or misrepresent what the documents say
Hallucinate connections between different retrieved passages
Generate plausible-sounding additions to the context

The best practice is to combine RAG with other techniques: low temperature, chain-of-thought reasoning, source citation requirements, and output validation. For a complete implementation guide, see our RAG Explained article and LangChain tutorial.

Frequently Asked Questions

What is an LLM hallucination?

An LLM hallucination is when a language model generates confident-sounding information that is factually incorrect, fabricated, or not supported by its training data. The model presents false information as if it were true, often with specific details like names, dates, or citations that do not exist.

Why do LLMs hallucinate?

LLMs hallucinate because they are trained to predict the next likely token, not to verify facts. They optimize for fluency and plausibility, not accuracy. They also lack a true understanding of the world, cannot distinguish between what they know and do not know, and were trained on data that may contain errors or biases.

How can I reduce LLM hallucinations?

The most effective strategies are: RAG (Retrieval-Augmented Generation) to ground responses in real documents, asking the model to cite sources, using temperature=0 for less creative output, breaking complex questions into smaller steps, and always verifying critical information from LLM outputs.

Can hallucinations be completely eliminated?

No, hallucinations cannot be completely eliminated with current LLM technology. They are a fundamental consequence of how these models work — predicting plausible text rather than verifying facts. However, techniques like RAG, grounding, and chain-of-thought reasoning can significantly reduce their frequency.