TY - JOUR
T1 - Multi-Variable, multi-layer graphical knowledge unit for storing and representing density clusters of multi-dimensional big data
AU - Adikaram, K. K.L.B.
AU - Hussein, Mohamed A.
AU - Effenberger, Mathias
AU - Becker, Thomas
N1 - Publisher Copyright:
© 2016 by the authors.
PY - 2016
Y1 - 2016
N2 - A multi-variable visualization technique on a 2D bitmap for big data is introduced. If A andB are two data points that are represented using two similar shapes with m pixels, where each shapeis colored with RGB color of (0, 0, k), when A ∩ B ≠ Φ, adding the color of A ∩ B gives higher color as(0, 0, 2k) and the highlight as a high density cluster, where RGB stands for Red, Green, Blue and k isthe blue color. This is the hypothesis behind the single variable graphical knowledge unit (GKU),which uses the entire bit range of a pixel for a single variable. Instead, the available bit range of apixel is split, and a pixel can be used for representing multiple variables (multi-variables). However,this will limit the bit block for single variables and limit the amount of overlapping. Using the samesize k (>1) bitmaps (multi-layers) will increase the number of bits per variable (BPV), where each (x, y)of an individual layer represents the same data point. Then, one pixel in a four-layer GKU is capableof showing more than four billion overlapping ones when BPV = 8 bits (2(BPV × number of layers)) Then,the 32-bit pixel format allows the representation of a maximum of up to four dependent variablesagainst one independent variable. Then, a four-layer GKU of w width and h height has the capacityof representing a maximum of (2(BPV × number of layers)) × m × w × h overlapping occurrences.
AB - A multi-variable visualization technique on a 2D bitmap for big data is introduced. If A andB are two data points that are represented using two similar shapes with m pixels, where each shapeis colored with RGB color of (0, 0, k), when A ∩ B ≠ Φ, adding the color of A ∩ B gives higher color as(0, 0, 2k) and the highlight as a high density cluster, where RGB stands for Red, Green, Blue and k isthe blue color. This is the hypothesis behind the single variable graphical knowledge unit (GKU),which uses the entire bit range of a pixel for a single variable. Instead, the available bit range of apixel is split, and a pixel can be used for representing multiple variables (multi-variables). However,this will limit the bit block for single variables and limit the amount of overlapping. Using the samesize k (>1) bitmaps (multi-layers) will increase the number of bits per variable (BPV), where each (x, y)of an individual layer represents the same data point. Then, one pixel in a four-layer GKU is capableof showing more than four billion overlapping ones when BPV = 8 bits (2(BPV × number of layers)) Then,the 32-bit pixel format allows the representation of a maximum of up to four dependent variablesagainst one independent variable. Then, a four-layer GKU of w width and h height has the capacityof representing a maximum of (2(BPV × number of layers)) × m × w × h overlapping occurrences.
KW - Big data
KW - Cluster identification
KW - Continuous learning
KW - Knowledge representation
UR - http://www.scopus.com/inward/record.url?scp=84973560617&partnerID=8YFLogxK
U2 - 10.3390/app6040096
DO - 10.3390/app6040096
M3 - Article
AN - SCOPUS:84973560617
SN - 2076-3417
VL - 6
JO - Applied Sciences (Switzerland)
JF - Applied Sciences (Switzerland)
IS - 4
M1 - 96
ER -