TY - GEN
T1 - Structured Extraction of Terms and Conditions from German and English Online Shops
AU - Schamel, Tobias
AU - Braun, Daniel
AU - Matthes, Florian
N1 - Publisher Copyright:
© 2022 Association for Computational Linguistics.
PY - 2022
Y1 - 2022
N2 - The automated analysis of Terms and Conditions has gained attention in recent years, mainly due to its relevance to consumer protection. Well-structured data sets are the base for every analysis. While content extraction, in general, is a well-researched field and many open source libraries are available, our evaluation shows, that existing solutions cannot extract Terms and Conditions in sufficient quality, mainly because of their special structure. In this paper, we present an approach to extract the content and hierarchy of Terms and Conditions from German and English online shops. Our evaluation shows, that the approach outperforms the current state of the art. A python implementation of the approach is made available under an open license.
AB - The automated analysis of Terms and Conditions has gained attention in recent years, mainly due to its relevance to consumer protection. Well-structured data sets are the base for every analysis. While content extraction, in general, is a well-researched field and many open source libraries are available, our evaluation shows, that existing solutions cannot extract Terms and Conditions in sufficient quality, mainly because of their special structure. In this paper, we present an approach to extract the content and hierarchy of Terms and Conditions from German and English online shops. Our evaluation shows, that the approach outperforms the current state of the art. A python implementation of the approach is made available under an open license.
UR - http://www.scopus.com/inward/record.url?scp=85137728665&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85137728665
T3 - ECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop
SP - 181
EP - 190
BT - ECNLP 2022 - 5th Workshop on e-Commerce and NLP, Proceedings of the Workshop
A2 - Malmasi, Shervin
A2 - Rokhlenko, Oleg
A2 - Ueffing, Nicola
A2 - Guy, Ido
A2 - Agichtein, Eugene
A2 - Kallumadi, Surya
PB - Association for Computational Linguistics (ACL)
T2 - 5th Workshop on e-Commerce and NLP, ECNLP 2022
Y2 - 26 May 2022
ER -