Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets

K. K.L.B. Adikaram, Mohamed A. Hussein, Mathias Effenberger, Thomas Becker

Publikation: Beitrag in FachzeitschriftArtikelBegutachtung

2 Zitate (Scopus)

Abstract

Big data are visually cluttered by overlapping data points. Rather than removing, reducing or reformulating overlap, we propose a simple, effective and powerful technique for density cluster generation and visualization, where point marker (graphical symbol of a data point) overlap is exploited in an additive fashion in order to obtain bitmap data summaries in which clusters can be identified visually, aided by automatically generated contour lines. In the proposed method, the plotting area is a bitmap and the marker is a shape of more than one pixel. As the markers overlap, the red, green and blue (RGB) colour values of pixels in the shared region are added. Thus, a pixel of a 24-bit RGB bitmap can code up to 224 (over 1.6 million) overlaps. A higher number of overlaps at the same location makes the colour of this area identical, which can be identified by the naked eye. A bitmap is a matrix of colour values that can be represented as integers. The proposed method updates this matrix while adding new points. Thus, this matrix can be considered as an up-to-time knowledge unit of processed data. Results show cluster generation, cluster identification, missing and out-of-range data visualization, and outlier detection capability of the newly proposed method.

OriginalspracheEnglisch
Aufsatznummer152
FachzeitschriftSymmetry
Jahrgang8
Ausgabenummer12
DOIs
PublikationsstatusVeröffentlicht - Dez. 2016

Fingerprint

Untersuchen Sie die Forschungsthemen von „Continuous Learning Graphical Knowledge Unit for Cluster Identification in High Density Data Sets“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren