TY - GEN
T1 - Enhancing Answer Attribution for Faithful Text Generation with Large Language Models
AU - Vladika, Juraj
AU - Mülln, Luca
AU - Matthes, Florian
N1 - Publisher Copyright:
© 2024 by SCITEPRESS – Science and Technology Publications, Lda.
PY - 2024
Y1 - 2024
N2 - The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM answers is the ability to trace the individual claims from responses back to relevant sources that support them, the process known as answer attribution. While recent work has started exploring the task of answer attribution in LLMs, some challenges still remain. In this work, we first perform a case study analyzing the effectiveness of existing answer attribution methods, with a focus on subtasks of answer segmentation and evidence retrieval. Based on the observed shortcomings, we propose new methods for producing more independent and contextualized claims for better retrieval and attribution. The new methods are evaluated and shown to improve the performance of answer attribution components. We end with a discussion and outline of future directions for the task.
AB - The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM answers is the ability to trace the individual claims from responses back to relevant sources that support them, the process known as answer attribution. While recent work has started exploring the task of answer attribution in LLMs, some challenges still remain. In this work, we first perform a case study analyzing the effectiveness of existing answer attribution methods, with a focus on subtasks of answer segmentation and evidence retrieval. Based on the observed shortcomings, we propose new methods for producing more independent and contextualized claims for better retrieval and attribution. The new methods are evaluated and shown to improve the performance of answer attribution components. We end with a discussion and outline of future directions for the task.
KW - Answer Attribution
KW - Information Retrieval
KW - Interpretability
KW - Large Language Models
KW - Natural Language Processing
KW - Question Answering
KW - Text Generation
UR - http://www.scopus.com/inward/record.url?scp=85215303129&partnerID=8YFLogxK
U2 - 10.5220/0013066600003838
DO - 10.5220/0013066600003838
M3 - Conference contribution
AN - SCOPUS:85215303129
T3 - International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K - Proceedings
SP - 147
EP - 158
BT - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of IC3K 2024 - Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management
A2 - Coenen, Frans
A2 - Fred, Ana
A2 - Bernardino, Jorge
PB - Science and Technology Publications, Lda
T2 - 16th International Conference on Knowledge Discovery and Information Retrieval, KDIR 2024 as part of 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, IC3K 2024
Y2 - 17 November 2024 through 19 November 2024
ER -