TY - GEN
T1 - Explicit and latent topic representations of information spaces in social information retrieval
AU - Fuchs, Christoph
AU - Voigt, Cordt
AU - Baldizan, Oriana
AU - Groh, Georg
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016
Y1 - 2016
N2 - We evaluate the suitability of latent and explicit semantic spaces of documents for Information Retrieval (IR) tasks using a dataset obtained from the Q&A community Stackexchange. In addition, the ability of the latent semantic spaces to reconstruct human relevance judgments is explored. The latent semantic spaces are generated with Latent Dirichlet Allocation (LDA), while explicit semantic spaces are modeled using Explicit Semantic Analyis (ESA). In the first part of the experiment, a series of ad-hoc information retrieval tasks is performed, interpreting closeness in the semantic and explicit spaces as a criterion for relevance. In the second part, it is investigated whether the latent semantic representation allows to infer user defined quality assessments of answers. The findings suggest that the semantic spaces show a correlation between query and relevant information items, however, both algorithms are outperformed by a simple Vector Space Model using TF-IDF. In addition, no significant correlation between the user defined order of relevant answers to a question and the similarity-based order (using closeness in the latent semantic space as similarity function) could be demonstrated.
AB - We evaluate the suitability of latent and explicit semantic spaces of documents for Information Retrieval (IR) tasks using a dataset obtained from the Q&A community Stackexchange. In addition, the ability of the latent semantic spaces to reconstruct human relevance judgments is explored. The latent semantic spaces are generated with Latent Dirichlet Allocation (LDA), while explicit semantic spaces are modeled using Explicit Semantic Analyis (ESA). In the first part of the experiment, a series of ad-hoc information retrieval tasks is performed, interpreting closeness in the semantic and explicit spaces as a criterion for relevance. In the second part, it is investigated whether the latent semantic representation allows to infer user defined quality assessments of answers. The findings suggest that the semantic spaces show a correlation between query and relevant information items, however, both algorithms are outperformed by a simple Vector Space Model using TF-IDF. In addition, no significant correlation between the user defined order of relevant answers to a question and the similarity-based order (using closeness in the latent semantic space as similarity function) could be demonstrated.
KW - Social Information Retrieval
UR - http://www.scopus.com/inward/record.url?scp=85015160307&partnerID=8YFLogxK
U2 - 10.1109/ENIC.2016.023
DO - 10.1109/ENIC.2016.023
M3 - Conference contribution
AN - SCOPUS:85015160307
T3 - Proceedings - 2016 3rd European Network Intelligence Conference, ENIC 2016
SP - 106
EP - 112
BT - Proceedings - 2016 3rd European Network Intelligence Conference, ENIC 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd European Network Intelligence Conference, ENIC 2016
Y2 - 5 September 2016 through 7 September 2016
ER -