TY - GEN
T1 - Nesting the earth mover's distance for effective cluster tracing
AU - Kremer, Hardy
AU - Günnemann, Stephan
AU - Wollwage, Simon
AU - Seidl, Thomas
PY - 2013
Y1 - 2013
N2 - Cluster tracing algorithms are used to mine temporal evolutions of clusters. Generally, clusters represent groups of objects with similar values. In a temporal context like tracing, similar values correspond to similar behavior in one snapshot in time. Recently, tracing based on object-value-similarity was introduced. In this new paradigm, the decision whether two clusters are considered similar is based on the similarity of the clusters' object values. Existing approaches of this paradigm, however, have a severe limitation. The mapping of clusters between snapshots in time is performed pairwise, i.e. global connections between a temporal snapshot's clusters are ignored; thus, impacts of other clusters that may affect the mapping are not considered and incorrect cluster tracings may be obtained. In this vision paper, we present our ongoing work on a novel approach for cluster tracing that applies the object-valuesimilarity paradigm and is based on the well-known Earth Mover's Distance (EMD). The EMD enables a cluster tracing that uses global mapping: in the mapping process, all clusters of compared snapshots are considered simultaneously. A special property of our approach is that we nest the EMD: we use it as a ground distance for itself to achieve most effective value-based cluster tracing.
AB - Cluster tracing algorithms are used to mine temporal evolutions of clusters. Generally, clusters represent groups of objects with similar values. In a temporal context like tracing, similar values correspond to similar behavior in one snapshot in time. Recently, tracing based on object-value-similarity was introduced. In this new paradigm, the decision whether two clusters are considered similar is based on the similarity of the clusters' object values. Existing approaches of this paradigm, however, have a severe limitation. The mapping of clusters between snapshots in time is performed pairwise, i.e. global connections between a temporal snapshot's clusters are ignored; thus, impacts of other clusters that may affect the mapping are not considered and incorrect cluster tracings may be obtained. In this vision paper, we present our ongoing work on a novel approach for cluster tracing that applies the object-valuesimilarity paradigm and is based on the well-known Earth Mover's Distance (EMD). The EMD enables a cluster tracing that uses global mapping: in the mapping process, all clusters of compared snapshots are considered simultaneously. A special property of our approach is that we nest the EMD: we use it as a ground distance for itself to achieve most effective value-based cluster tracing.
KW - Cluster evolution
KW - Cluster mapping
KW - Cluster tracing
UR - http://www.scopus.com/inward/record.url?scp=84882945556&partnerID=8YFLogxK
U2 - 10.1145/2484838.2484881
DO - 10.1145/2484838.2484881
M3 - Conference contribution
AN - SCOPUS:84882945556
SN - 9781450319218
T3 - ACM International Conference Proceeding Series
BT - SSDBM 2013 - Proceedings of the 25th International Conference on Scientific and Statistical Database Management
T2 - 25th International Conference on Scientific and Statistical Database Management, SSDBM 2013
Y2 - 29 July 2013 through 31 July 2013
ER -