Skip to Main Content

Artificial Intelligence Faculty Guide

Fact-checking AI

Things to consider when fact-checking AI content:

 

AI "hallucination"

The official term in the field of AI is "hallucination." This refers to the fact that it sometimes "makes stuff up." This is because these systems are probabilistic, not deterministic.


Which models are less prone to this?

GPT-4 (the more capable model behind ChatGPT Plus and Bing Chat) has improved and is less prone to hallucination. According to OpenAI, it's "40% more likely to produce factual responses than GPT-3.5 on our internal evaluations." But it's still not perfect. So verification of the output is still needed.


ChatGPT often makes up fictional sources

One area where ChatGPT usually gives fictional answers is when asked to create a list of sources. See the Twitter thread, "Why does chatGPT make up fake academic papers?" for a useful explanation of why this happens.

 
There is progress in making these models more truthful

However, there is progress in making these systems more truthful by grounding them in external sources of knowledge. Some examples are Bing Chat and Perplexity AI, which use internet search results to ground answers. However, the Internet sources used, could also contain misinformation or disinformation. But at least with Bing Chat and Perplexity you can link to the sources used to begin verification.

 

“University of Arizona Libraries, licensed under a Creative Commons Attribution 4.0 International License.”