TY - JOUR
T1 - Data transformation technique to improve the outlier detection power of grubbs' test for data expected to follow linear relation
AU - Adikaram, K. K.L.B.
AU - Hussein, M. A.
AU - Effenberger, M.
AU - Becker, T.
N1 - Publisher Copyright:
© 2015 K. K. L. B. Adikaram et al.
PY - 2015/1/14
Y1 - 2015/1/14
N2 - Grubbs test (extreme studentized deviate test, maximum normed residual test) is used in various fields to identify outliers in a data set, which are ranked in the order of x1≤x2≤x3≤≤xn (i=1,2,3,.,n). However, ranking of data eliminates the actual sequence of a data series, which is an important factor for determining outliers in some cases (e.g., time series). Thus in such a data set, Grubbs test will not identify outliers correctly. This paper introduces a technique for transforming data from sequence bound linear form to sequence unbound form (y=c). Applying Grubbs test to the new transformed data set detects outliers more accurately. In addition, the new technique improves the outlier detection capability of Grubbs test. Results show that, Grubbs test was capable of identifing outliers at significance level 0.01 after transformation, while it was unable to identify those prior to transforming at significance level 0.05.
AB - Grubbs test (extreme studentized deviate test, maximum normed residual test) is used in various fields to identify outliers in a data set, which are ranked in the order of x1≤x2≤x3≤≤xn (i=1,2,3,.,n). However, ranking of data eliminates the actual sequence of a data series, which is an important factor for determining outliers in some cases (e.g., time series). Thus in such a data set, Grubbs test will not identify outliers correctly. This paper introduces a technique for transforming data from sequence bound linear form to sequence unbound form (y=c). Applying Grubbs test to the new transformed data set detects outliers more accurately. In addition, the new technique improves the outlier detection capability of Grubbs test. Results show that, Grubbs test was capable of identifing outliers at significance level 0.01 after transformation, while it was unable to identify those prior to transforming at significance level 0.05.
UR - http://www.scopus.com/inward/record.url?scp=84947650476&partnerID=8YFLogxK
U2 - 10.1155/2015/708948
DO - 10.1155/2015/708948
M3 - Article
AN - SCOPUS:84947650476
SN - 1110-757X
VL - 2015
JO - Journal of Applied Mathematics
JF - Journal of Applied Mathematics
M1 - 708948
ER -