Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word Substitutes

Research output: Contribution to journalConference articlepeer-review

Abstract

Lexical Substitution is the task of replacing a single word in a sentence with a similar one. This should ideally be one that is not necessarily only synonymous, but also fits well into the surrounding context of the target word, while preserving the sentence’s grammatical structure. Recent advances in Lexical Substitution have leveraged the masked token prediction task of Pre-trained Language Models to generate replacements for a given word in a sentence. With this technique, we introduce CONCAT, a simple augmented approach which utilizes the original sentence to bolster contextual information sent to the model. Compared to existing approaches, it proves to be very effective in guiding the model to make contextually relevant predictions for the target word. Our study includes a quantitative evaluation, measured via sentence similarity and task performance. In addition, we conduct a qualitative human analysis to validate that users prefer the substitutions proposed by our method, as opposed to previous methods. Finally, we test our approach on the prevailing benchmark for Lexical Substitution, CoInCo, revealing potential pitfalls of the benchmark. These insights serve as the foundation for a critical discussion on the way in which Lexical Substitution is evaluated.

Original languageEnglish
Pages (from-to)1481-1488
Number of pages8
JournalInternational Conference on Agents and Artificial Intelligence
Volume3
DOIs
StatePublished - 2025
Event17th International Conference on Agents and Artificial Intelligence, ICAART 2025 - Porto, Portugal
Duration: 23 Feb 202525 Feb 2025

Keywords

  • Language Models
  • Lexical Semantics
  • Lexical Substitution
  • Natural Language Processing

Fingerprint

Dive into the research topics of 'Lexical Substitution is not Synonym Substitution: On the Importance of Producing Contextually Relevant Word Substitutes'. Together they form a unique fingerprint.

Cite this