Understanding Knowledge Drift in LLMs Through Misinformation

Alina Fastowski, Gjergji Kasneci

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large Language Models (LLMs) have revolutionized numerous applications, making them an integral part of our digital ecosystem. However, their reliability becomes critical, especially when these models are exposed to misinformation. This paper primarily analyzes the susceptibility of state-of-the-art LLMs to factual inaccuracies when they encounter false information in a Q&A scenario, an issue that can lead to a phenomenon we refer to as knowledge drift, which significantly undermines the trustworthiness of these models. We evaluate the factuality and the uncertainty of the models’ responses relying on Entropy, Perplexity, and Token Probability metrics. Our experiments reveal that an LLM’s uncertainty can increase up to 56.6% when the question is answered incorrectly due to the exposure to false information. At the same time, repeated exposure to the same false information can decrease the models’ uncertainty again (-52.8% w.r.t. the answers on the untainted prompts), potentially manipulating the underlying model’s beliefs and introducing a drift from its original knowledge. These findings provide insights into LLMs’ robustness and vulnerability to adversarial inputs, paving the way for developing more reliable LLM applications across various domains. The code is available at https://github.com/afastowski/knowledge_drift.

Original languageEnglish
Title of host publicationDiscovering Drift Phenomena in Evolving Landscapes - 1st International Workshop, DELTA 2024, Proceedings
EditorsMarco Piangerelli, Bardh Prenkaj, Ylenia Rotalinti, Ananya Joshi, Giovanni Stilo
PublisherSpringer Science and Business Media Deutschland GmbH
Pages74-85
Number of pages12
ISBN (Print)9783031823459
DOIs
StatePublished - 2025
Event1st International Workshop on Discovering Drift Phenomena in Evolving Landscapes, DELTA 2024 - Barcelona, Spain
Duration: 26 Aug 202426 Aug 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15013 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st International Workshop on Discovering Drift Phenomena in Evolving Landscapes, DELTA 2024
Country/TerritorySpain
CityBarcelona
Period26/08/2426/08/24

Keywords

  • Knowledge Drift
  • Large Language Models
  • Uncertainty

Fingerprint

Dive into the research topics of 'Understanding Knowledge Drift in LLMs Through Misinformation'. Together they form a unique fingerprint.

Cite this