TY - GEN
T1 - Text and Team
T2 - 26th ACM International Conference on Evaluation and Assessment in Software Engineering, EASE 2022
AU - Graf-Vlachy, Lorenz
AU - Graziotin, Daniel
AU - Wagner, Stefan
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/6/13
Y1 - 2022/6/13
N2 - Context: Citations are a key measure of scientific performance in most fields, including software engineering. However, there is limited research that studies which characteristics of articles' metadata (title, abstract, keywords, and author list) are driving citations in this field. Objective: In this study, we propose a simple theoretical model for how citations come to be with respect to article metadata, we hypothesize theoretical linkages between metadata characteristics and citations of articles, and we empirically test these hypotheses. Method: We use multiple regression analyses to examine a data set comprising the titles, abstracts, keywords, and authors of 16,131 software engineering articles published between 1990 and 2020 in 20 highly influential software engineering venues. Results: We find that number of authors, number of keywords, number of question marks and dividers in the title, number of acronyms, abstract length, abstract propositional idea density, and corresponding authors in the core Anglosphere are significantly related to citations. Conclusion: Various characteristics of articles' metadata are linked to the frequency with which the corresponding articles are cited. These results partially confirm and partially go counter to prior findings in software engineering and other disciplines.
AB - Context: Citations are a key measure of scientific performance in most fields, including software engineering. However, there is limited research that studies which characteristics of articles' metadata (title, abstract, keywords, and author list) are driving citations in this field. Objective: In this study, we propose a simple theoretical model for how citations come to be with respect to article metadata, we hypothesize theoretical linkages between metadata characteristics and citations of articles, and we empirically test these hypotheses. Method: We use multiple regression analyses to examine a data set comprising the titles, abstracts, keywords, and authors of 16,131 software engineering articles published between 1990 and 2020 in 20 highly influential software engineering venues. Results: We find that number of authors, number of keywords, number of question marks and dividers in the title, number of acronyms, abstract length, abstract propositional idea density, and corresponding authors in the core Anglosphere are significantly related to citations. Conclusion: Various characteristics of articles' metadata are linked to the frequency with which the corresponding articles are cited. These results partially confirm and partially go counter to prior findings in software engineering and other disciplines.
KW - abstract
KW - author
KW - citations
KW - keyword
KW - metadata
KW - title
UR - http://www.scopus.com/inward/record.url?scp=85132440997&partnerID=8YFLogxK
U2 - 10.1145/3530019.3530022
DO - 10.1145/3530019.3530022
M3 - Conference contribution
AN - SCOPUS:85132440997
T3 - ACM International Conference Proceeding Series
SP - 20
EP - 29
BT - Proceedings of the ACM International Conference on Evaluation and Assessment in Software Engineering, EASE 2022
PB - Association for Computing Machinery
Y2 - 13 June 2022 through 15 June 2022
ER -