TY - GEN
T1 - Metrics for the evaluation of data quality of signal data in industrial processes
AU - Kirchen, Iris
AU - Schutz, Daniel
AU - Folmer, Jens
AU - Vogel-Heuser, Birgit
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/11/10
Y1 - 2017/11/10
N2 - For Industry 4.0 technology as well as Cyber-Physical Production Systems analysis of data gained more and more importance. But the disparity of data, often make the efficient use of data mining methods difficult due to data with poor quality. To evaluate data quality and further adopt appropriate measures, the proposal develops a data quality model fitted to the specific properties of signal data of industrial processes. Relevant data quality characteristics are identified and a classification of these characteristics is conducted to ascertain important factors. Furthermore, a measurement for the characteristic Completeness, aggregated of its sub-dimensions, is defined. The data quality model is applied to two different use cases showing its effectiveness and validity of the defined measures. The efficient use of real industrial signal data e.g. appropriateness of the data for the specific data mining purpose, is supported by a comprehensive measurement for data quality and the detailed discussion of the influencing factors.
AB - For Industry 4.0 technology as well as Cyber-Physical Production Systems analysis of data gained more and more importance. But the disparity of data, often make the efficient use of data mining methods difficult due to data with poor quality. To evaluate data quality and further adopt appropriate measures, the proposal develops a data quality model fitted to the specific properties of signal data of industrial processes. Relevant data quality characteristics are identified and a classification of these characteristics is conducted to ascertain important factors. Furthermore, a measurement for the characteristic Completeness, aggregated of its sub-dimensions, is defined. The data quality model is applied to two different use cases showing its effectiveness and validity of the defined measures. The efficient use of real industrial signal data e.g. appropriateness of the data for the specific data mining purpose, is supported by a comprehensive measurement for data quality and the detailed discussion of the influencing factors.
UR - http://www.scopus.com/inward/record.url?scp=85041171788&partnerID=8YFLogxK
U2 - 10.1109/INDIN.2017.8104878
DO - 10.1109/INDIN.2017.8104878
M3 - Conference contribution
AN - SCOPUS:85041171788
T3 - Proceedings - 2017 IEEE 15th International Conference on Industrial Informatics, INDIN 2017
SP - 819
EP - 826
BT - Proceedings - 2017 IEEE 15th International Conference on Industrial Informatics, INDIN 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th IEEE International Conference on Industrial Informatics, INDIN 2017
Y2 - 24 July 2017 through 26 July 2017
ER -