A framework for evaluation and exploration of clustering algorithms in subspaces of high dimensional databases

Emmanuel Müller, Ira Assent, Stephan Günnemann, Patrick Gerwert, Matthias Hannen, Timm Jansen, Thomas Seidl

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In high dimensional databases, traditional full space clustering methods are known to fail due to the curse of dimensionality. Thus, in recent years, subspace clustering and projected clustering approaches were proposed for clustering in high dimensional spaces. As the area is rather young, few comparative studies on the advantages and disadvantages of the different algorithms exist. Part of the underlying problem is the lack of available open source implementations that could be used by researchers to understand, compare, and extend subspace and projected clustering algorithms. In this work, we discuss the requirements for open source evaluation software and propose the OpenSubspace framework that meets these requirements. OpenSubspace integrates state-of-the-art performance measures and visualization techniques to foster clustering research in high dimensional databases.

Original languageEnglish
Title of host publicationDatenbanksysteme fur Business, Technologie und Web, BTW 2011 - 14. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2011 - Proceedings
EditorsTheo Harder, Wolfgang Lehner, Bernhard Mitschang, Harald Schoning, Holger Schwarz
PublisherGesellschaft fur Informatik (GI)
Pages347-366
Number of pages20
ISBN (Electronic)9783885792741
StatePublished - 2011
Externally publishedYes
Event2011 Datenbanksysteme fur Business, Technologie und Web, BTW 2011 - 14. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2011 - Database Systems for Business, Technology and Web, BTW 2011 - 14th Conference of the GI Department "Databases and Information Systems", DBIS 2011 - Kaiserslautern, Germany
Duration: 2 Mar 20114 Mar 2011

Publication series

NameLecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
Volume180
ISSN (Print)1617-5468
ISSN (Electronic)2944-7682

Conference

Conference2011 Datenbanksysteme fur Business, Technologie und Web, BTW 2011 - 14. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2011 - Database Systems for Business, Technology and Web, BTW 2011 - 14th Conference of the GI Department "Databases and Information Systems", DBIS 2011
Country/TerritoryGermany
CityKaiserslautern
Period2/03/114/03/11

Fingerprint

Dive into the research topics of 'A framework for evaluation and exploration of clustering algorithms in subspaces of high dimensional databases'. Together they form a unique fingerprint.

Cite this