TY - GEN
T1 - A framework for evaluation and exploration of clustering algorithms in subspaces of high dimensional databases
AU - Müller, Emmanuel
AU - Assent, Ira
AU - Günnemann, Stephan
AU - Gerwert, Patrick
AU - Hannen, Matthias
AU - Jansen, Timm
AU - Seidl, Thomas
N1 - Publisher Copyright:
© 2011 Gesellschaft fur Informatik (GI). All rights reserved.
PY - 2011
Y1 - 2011
N2 - In high dimensional databases, traditional full space clustering methods are known to fail due to the curse of dimensionality. Thus, in recent years, subspace clustering and projected clustering approaches were proposed for clustering in high dimensional spaces. As the area is rather young, few comparative studies on the advantages and disadvantages of the different algorithms exist. Part of the underlying problem is the lack of available open source implementations that could be used by researchers to understand, compare, and extend subspace and projected clustering algorithms. In this work, we discuss the requirements for open source evaluation software and propose the OpenSubspace framework that meets these requirements. OpenSubspace integrates state-of-the-art performance measures and visualization techniques to foster clustering research in high dimensional databases.
AB - In high dimensional databases, traditional full space clustering methods are known to fail due to the curse of dimensionality. Thus, in recent years, subspace clustering and projected clustering approaches were proposed for clustering in high dimensional spaces. As the area is rather young, few comparative studies on the advantages and disadvantages of the different algorithms exist. Part of the underlying problem is the lack of available open source implementations that could be used by researchers to understand, compare, and extend subspace and projected clustering algorithms. In this work, we discuss the requirements for open source evaluation software and propose the OpenSubspace framework that meets these requirements. OpenSubspace integrates state-of-the-art performance measures and visualization techniques to foster clustering research in high dimensional databases.
UR - http://www.scopus.com/inward/record.url?scp=80052985157&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:80052985157
T3 - Lecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
SP - 347
EP - 366
BT - Datenbanksysteme fur Business, Technologie und Web, BTW 2011 - 14. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2011 - Proceedings
A2 - Harder, Theo
A2 - Lehner, Wolfgang
A2 - Mitschang, Bernhard
A2 - Schoning, Harald
A2 - Schwarz, Holger
PB - Gesellschaft fur Informatik (GI)
T2 - 2011 Datenbanksysteme fur Business, Technologie und Web, BTW 2011 - 14. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2011 - Database Systems for Business, Technology and Web, BTW 2011 - 14th Conference of the GI Department "Databases and Information Systems", DBIS 2011
Y2 - 2 March 2011 through 4 March 2011
ER -