Skip to main navigation Skip to search Skip to main content

AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity Using Contrastive Learning and Structured Knowledge

  • Technical University of Munich
  • DFKI

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Generic sentence embeddings provide a coarsegrained approximation of semantic textual similarity but ignore specific aspects that make texts similar. Conversely, aspect-based sentence embeddings provide similarities between texts based on certain predefined aspects. Thus, similarity predictions of texts are more targeted to specific requirements and more easily explainable. In this paper, we present AspectCSE, an approach for aspect-based contrastive learning of sentence embeddings. Results indicate that AspectCSE achieves an average improvement of 3.97% on information retrieval tasks across multiple aspects compared to the previous best results. We also propose using Wikidata knowledge graph properties to train models of multiaspect sentence embeddings in which multiple specific aspects are simultaneously considered during similarity predictions. We demonstrate that multi-aspect embeddings outperform single-aspect embeddings on aspect-specific information retrieval tasks. Finally, we examine the aspect-based sentence embedding space and demonstrate that embeddings of semantically similar aspect labels are often close, even without explicit similarity training between different aspect labels.

Original languageEnglish
Title of host publicationInternational Conference Recent Advances in Natural Language Processing, RANLP 2023
Subtitle of host publicationLarge Language Models for Natural Language Processing - Proceedings
EditorsGalia Angelova, Maria Kunilovskaya, Ruslan Mitkov
PublisherIncoma Ltd
Pages1054-1065
Number of pages12
ISBN (Electronic)9789544520922
DOIs
StatePublished - 2023
Event14th International Conference on Recent Advances in Natural Language Processing: Large Language Models for Natural Language Processing, RANLP 2023 - Varna, Bulgaria
Duration: 4 Sep 20236 Sep 2023

Publication series

NameInternational Conference Recent Advances in Natural Language Processing, RANLP
ISSN (Print)1313-8502

Conference

Conference14th International Conference on Recent Advances in Natural Language Processing: Large Language Models for Natural Language Processing, RANLP 2023
Country/TerritoryBulgaria
CityVarna
Period4/09/236/09/23

Fingerprint

Dive into the research topics of 'AspectCSE: Sentence Embeddings for Aspect-based Semantic Textual Similarity Using Contrastive Learning and Structured Knowledge'. Together they form a unique fingerprint.

Cite this