TY - GEN
T1 - Information mining from public mailing lists
T2 - 4th International Conference on Internet Science, INSCI 2017
AU - Niedermayer, Heiko
AU - Schwellnus, Nikolai
AU - Raumer, Daniel
AU - Cordeiro, Edwin
AU - Carle, Georg
N1 - Publisher Copyright:
© 2017, Springer International Publishing AG.
PY - 2017
Y1 - 2017
N2 - Public mailing lists, such as the mailing lists used by the IETF for Internet Standardization, can be used as big real world data set for analysis of social interactions. However, volatile participation and the usage of mail addresses as changeable pseudonyms constitute a challenge for data mining in these data. We conducted a case study of mailing list analysis wherein we address the consistent identification of a person with all of her contributions to be used as panel data. Based on the postings of individuals on different mailing lists, correlations between standardization areas in the IETF groups can be computed. Isolated and meshed standardization areas can be identified.
AB - Public mailing lists, such as the mailing lists used by the IETF for Internet Standardization, can be used as big real world data set for analysis of social interactions. However, volatile participation and the usage of mail addresses as changeable pseudonyms constitute a challenge for data mining in these data. We conducted a case study of mailing list analysis wherein we address the consistent identification of a person with all of her contributions to be used as panel data. Based on the postings of individuals on different mailing lists, correlations between standardization areas in the IETF groups can be computed. Isolated and meshed standardization areas can be identified.
KW - Clustering
KW - Identity deduplication
KW - Mailing lists
KW - Standardization
UR - http://www.scopus.com/inward/record.url?scp=85033597192&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-70284-1_23
DO - 10.1007/978-3-319-70284-1_23
M3 - Conference contribution
AN - SCOPUS:85033597192
SN - 9783319702834
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 301
EP - 309
BT - Internet Science - 4th International Conference, INSCI 2017, Proceedings
A2 - McMillan, Donald
A2 - Carle, Georg
A2 - Passani, Antonella
A2 - Cave, Jonathan
A2 - Kompatsiaris, Ioannis
A2 - Satsiou, Anna
A2 - Kontopoulos, Efstratios
A2 - Diplaris, Sotiris
PB - Springer Verlag
Y2 - 22 November 2017 through 24 November 2017
ER -