TY - GEN
T1 - Subspace clustering for complex data
AU - Günnemann, Stephan
N1 - Publisher Copyright:
© Gesellschaft für Informatik, Bonn 2013.
PY - 2013
Y1 - 2013
N2 - Clustering is an established data mining technique for grouping objects based on their mutual similarity. Since in today's applications, however, usually many characteristics for each object are recorded, one cannot expect to find similar objects by considering all attributes together. In contrast, valuable clusters are hidden in subspace projections of the data. As a general solution to this problem, the paradigm of subspace clustering has been introduced, which aims at automatically determining for each group of objects a set of relevant attributes these objects are similar in. In this work, we introduce novel methods for effective subspace clustering on various types of complex data: vector data, imperfect data, and graph data. Our methods tackle major open challenges for clustering in subspace projections. We study the problem of redundancy in subspace clustering results and propose models whose solutions contain only non-redundant and, thus, valuable clusters. Since different subspace projections represent different views on the data, often several groupings of the objects are reasonable. Thus, we propose techniques that are not restricted to a single partitioning of the objects but that enable the detection of multiple clustering solutions.
AB - Clustering is an established data mining technique for grouping objects based on their mutual similarity. Since in today's applications, however, usually many characteristics for each object are recorded, one cannot expect to find similar objects by considering all attributes together. In contrast, valuable clusters are hidden in subspace projections of the data. As a general solution to this problem, the paradigm of subspace clustering has been introduced, which aims at automatically determining for each group of objects a set of relevant attributes these objects are similar in. In this work, we introduce novel methods for effective subspace clustering on various types of complex data: vector data, imperfect data, and graph data. Our methods tackle major open challenges for clustering in subspace projections. We study the problem of redundancy in subspace clustering results and propose models whose solutions contain only non-redundant and, thus, valuable clusters. Since different subspace projections represent different views on the data, often several groupings of the objects are reasonable. Thus, we propose techniques that are not restricted to a single partitioning of the objects but that enable the detection of multiple clustering solutions.
UR - https://www.scopus.com/pages/publications/84922759357
M3 - Conference contribution
AN - SCOPUS:84922759357
T3 - Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
SP - 343
EP - 362
BT - Datenbanksysteme fur Business, Technologie und Web (BTW) 2013 - Proceedings
A2 - Markl, Volker
A2 - Saake, Gunter
A2 - Sattler, Kai-Uwe
A2 - Hackenbroich, Gregor
A2 - Mitschang, Bernhard
A2 - Harder, Theo
A2 - Koppen, Veit
PB - Gesellschaft fur Informatik (GI)
T2 - 15. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2013 - 15th Conference of the GI Special Interest Group on Databases and Information Systems, DBIS 2013
Y2 - 13 March 2013 through 15 March 2013
ER -