Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?

Miriam Anschütz, Edoardo Mosca, Georg Groh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Text simplification seeks to improve readability while retaining the original content and meaning. Our study investigates whether pre-trained classifiers also maintain such coherence by comparing their predictions on both original and simplified inputs. We conduct experiments using 11 pre-trained models, including BERT and OpenAI’s GPT 3.5, across six datasets spanning three languages. Additionally, we conduct a detailed analysis of the correlation between prediction change rates and simplification types/strengths. Our findings reveal alarming inconsistencies across all languages and models. If not promptly addressed, simplified inputs can be easily exploited to craft zero-iteration model-agnostic adversarial attacks with success rates of up to 50%.

Original languageEnglish
Title of host publicationDeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 at LREC-COLING 2024 - Workshop Proceedings
EditorsGiorgio Maria Di Nunzio, Federica Vezzani, Liana Ermakova, Hosein Azarbonyad, Jaap Kamps
PublisherEuropean Language Resources Association (ELRA)
Pages185-195
Number of pages11
ISBN (Electronic)9782493814159
StatePublished - 2024
Event1st DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 - Torino, Italy
Duration: 21 May 2024 → …

Publication series

NameDeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 at LREC-COLING 2024 - Workshop Proceedings

Conference

Conference1st DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024
Country/TerritoryItaly
CityTorino
Period21/05/24 → …

Keywords

  • model consistency
  • model robustness
  • text simplification

Fingerprint

Dive into the research topics of 'Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?'. Together they form a unique fingerprint.

Cite this