Structured Extraction of Terms and Conditions from German and English Online Shops

Tobias Schamel, Daniel Braun, Florian Matthes

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

The automated analysis of Terms and Conditions has gained attention in recent years, mainly due to its relevance to consumer protection. Well-structured data sets are the base for every analysis. While content extraction, in general, is a well-researched field and many open source libraries are available, our evaluation shows, that existing solutions cannot extract Terms and Conditions in sufficient quality, mainly because of their special structure. In this paper, we present an approach to extract the content and hierarchy of Terms and Conditions from German and English online shops. Our evaluation shows, that the approach outperforms the current state of the art. A python implementation of the approach is made available under an open license.

Original languageEnglish
Title of host publicationECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop
EditorsShervin Malmasi, Oleg Rokhlenko, Nicola Ueffing, Ido Guy, Eugene Agichtein, Surya Kallumadi
PublisherAssociation for Computational Linguistics (ACL)
Pages181-190
Number of pages10
ISBN (Electronic)9781955917353
StatePublished - 2022
Event5th Workshop on e-Commerce and NLP, ECNLP 2022 - Dublin, Ireland
Duration: 26 May 2022 → …

Publication series

NameECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop

Conference

Conference5th Workshop on e-Commerce and NLP, ECNLP 2022
Country/TerritoryIreland
CityDublin
Period26/05/22 → …

Fingerprint

Dive into the research topics of 'Structured Extraction of Terms and Conditions from German and English Online Shops'. Together they form a unique fingerprint.

Cite this