TY - GEN
T1 - Large Language Models for the Analysis of Project Proposals
AU - Tsangko, Iosif
AU - Triantafyllopoulos, Andreas
AU - Kyriakidis, Evangelos
AU - Margetis, George
AU - Schuller, Björn W.
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - We introduce a framework that integrates traditional topic modeling methods-Latent Dirichlet Allocation (LDA) and BERTopic- with Large Language Models (LLMs) to automatically identify topics featured in project proposals for the cultural heritage (CH) domain. Applied to a dataset of 1, 757 English project proposals aimed at protecting and promoting CH in Africa, our approach begins by extracting initial topics using LDA and BERTopic. These topics are further refined by LLaMA3, generating precise and semantically meaningful categories that incorporate domain expert-curated labels to ensure contextual relevance. The consistency of assigned labels is evaluated using automatic classification. Additionally, we explore the role of linguistic features, such as sentence complexity, sentiment analysis, and gendered language, as predictors of proposal success. Results highlight the potential of combining traditional topic modeling with LLMs to uncover hidden insights into funding allocation patterns, aiming to enhance the equitable distribution of resources in CH projects.
AB - We introduce a framework that integrates traditional topic modeling methods-Latent Dirichlet Allocation (LDA) and BERTopic- with Large Language Models (LLMs) to automatically identify topics featured in project proposals for the cultural heritage (CH) domain. Applied to a dataset of 1, 757 English project proposals aimed at protecting and promoting CH in Africa, our approach begins by extracting initial topics using LDA and BERTopic. These topics are further refined by LLaMA3, generating precise and semantically meaningful categories that incorporate domain expert-curated labels to ensure contextual relevance. The consistency of assigned labels is evaluated using automatic classification. Additionally, we explore the role of linguistic features, such as sentence complexity, sentiment analysis, and gendered language, as predictors of proposal success. Results highlight the potential of combining traditional topic modeling with LLMs to uncover hidden insights into funding allocation patterns, aiming to enhance the equitable distribution of resources in CH projects.
KW - Cultural Heritage
KW - Funding Proposals
KW - Large Language Models
KW - Linguistic Analysis
KW - Topic Modeling
UR - https://www.scopus.com/pages/publications/105007669009
U2 - 10.1007/978-3-031-93415-5_24
DO - 10.1007/978-3-031-93415-5_24
M3 - Conference contribution
AN - SCOPUS:105007669009
SN - 9783031934148
T3 - Lecture Notes in Computer Science
SP - 408
EP - 420
BT - Artificial Intelligence in HCI - 6th International Conference, AI-HCI 2025, Held as Part of the 27th HCI International Conference, HCII 2025, Proceedings
A2 - Degen, Helmut
A2 - Ntoa, Stavroula
PB - Springer Science and Business Media Deutschland GmbH
T2 - 6th International Conference on Artificial Intelligence in HCI, AI-HCI 2025, held as part of the 27th HCI International Conference, HCII 2025
Y2 - 22 June 2025 through 27 June 2025
ER -