Truth and Factuality Failures
Hallucination
The AI gives an answer that sounds confident but is partly or completely made up.
Why this severity?
High · Weighted rubric score: 3.1 / 4.0
High because hallucinations are common, can look credible, and may affect research, security, legal, medical, financial, or professional decisions when users do not verify the output.
Top contributing factors
Likelihood
4/4
Hallucination is a common failure mode across many LLM workflows, especially when the model lacks grounding or current context.
Detectability
3/4
False information can be difficult to notice when it is fluent, specific, and presented confidently.
Human overtrust
3/4
Users may trust polished answers without checking sources, especially when the answer sounds complete.
Impact
3/4
Impact can become serious when hallucinated claims affect academic, professional, security, legal, medical, or financial decisions.
Context that can raise severity
- The answer affects professional, legal, medical, financial, security, or safety decisions.
- The model provides specific names, dates, citations, links, or numbers without verification.
- Users are likely to act on the answer without checking external sources.
Context that can lower severity
- The task is casual, creative, or low-stakes.
- The user verifies key claims against reliable sources.
- The model is grounded in trusted source material and uncertainty is clearly stated.
Example
An AI assistant gives a detailed explanation of a court case, academic paper, API function, or CVE that does not actually exist.
Warning signs
- The answer sounds very specific but does not cite checkable sources.
- The model gives names, dates, links, or numbers without evidence.
- The response changes when asked the same question again.
Mitigations
- Verify important claims against reliable external sources.
- Ask the model to separate confirmed facts from assumptions.
- Request exact source names, dates, and quoted evidence.