Subspace clustering for complex data

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Clustering is an established data mining technique for grouping objects based on their mutual similarity. Since in today's applications, however, usually many characteristics for each object are recorded, one cannot expect to find similar objects by considering all attributes together. In contrast, valuable clusters are hidden in subspace projections of the data. As a general solution to this problem, the paradigm of subspace clustering has been introduced, which aims at automatically determining for each group of objects a set of relevant attributes these objects are similar in. In this work, we introduce novel methods for effective subspace clustering on various types of complex data: vector data, imperfect data, and graph data. Our methods tackle major open challenges for clustering in subspace projections. We study the problem of redundancy in subspace clustering results and propose models whose solutions contain only non-redundant and, thus, valuable clusters. Since different subspace projections represent different views on the data, often several groupings of the objects are reasonable. Thus, we propose techniques that are not restricted to a single partitioning of the objects but that enable the detection of multiple clustering solutions.

Original languageEnglish
Title of host publicationDatenbanksysteme fur Business, Technologie und Web (BTW) 2013 - Proceedings
EditorsVolker Markl, Gregor Hackenbroich, Theo Harder, Gunter Saake, Bernhard Mitschang, Veit Koppen, Kai-Uwe Sattler
PublisherGesellschaft fur Informatik (GI)
Pages343-362
Number of pages20
ISBN (Electronic)9783885796084
StatePublished - 2013
Externally publishedYes
Event15. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2013 - 15th Conference of the GI Special Interest Group on Databases and Information Systems, DBIS 2013 - Magdeburg, Germany
Duration: 13 Mar 201315 Mar 2013

Publication series

NameLecture Notes in Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI)
VolumeP-214
ISSN (Print)1617-5468

Conference

Conference15. Fachtagung des GI-Fachbereichs "Datenbanken und Informationssysteme", DBIS 2013 - 15th Conference of the GI Special Interest Group on Databases and Information Systems, DBIS 2013
Country/TerritoryGermany
CityMagdeburg
Period13/03/1315/03/13

Fingerprint

Dive into the research topics of 'Subspace clustering for complex data'. Together they form a unique fingerprint.

Cite this