TY - JOUR
T1 - Prioritization-based subsampling quality assessment methodology for mobility-related information systems
AU - Gomari, Syrus
AU - Knoth, Christoph
AU - Antoniou, Constantinos
N1 - Publisher Copyright:
© 2022 The Authors. IET Intelligent Transport Systems published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology.
PY - 2022/5
Y1 - 2022/5
N2 - Mobility-related information systems, such as on-street parking information (OSPI) systems have become more popular in the original equipment manufacturer (OEM) industry over the last decade. However, there is a lack of methods to assess their quality at a large scale. This paper introduces a data-driven methodology to measure the true quality by fleet data prioritization-based subsampling strategies (PSSs). It is applied to the use case of OSPI using parking events (PE), but is applicable to other mobility-related information systems utilizing their respective fleet data. PSSs are defined based on neighbourhoods and time periods. Each PSS generates a unique set of spatio-temporally important areas at different quadkey zoom levels over 168 week-hours, called slices. The importance weight in each slice depends on the volume of PE within them. The algorithm for each PSS automatically selects important areas and time frames that are vital to be observed. Sample prediction models are used for the benefits assessment of the methodology by comparing it against non-prioritized randomized selection of ground truth. It is proven that the methodology can lessen the effort of ground truth collection, while maintaining the amount of information necessary to assess the true quality of a prediction model.
AB - Mobility-related information systems, such as on-street parking information (OSPI) systems have become more popular in the original equipment manufacturer (OEM) industry over the last decade. However, there is a lack of methods to assess their quality at a large scale. This paper introduces a data-driven methodology to measure the true quality by fleet data prioritization-based subsampling strategies (PSSs). It is applied to the use case of OSPI using parking events (PE), but is applicable to other mobility-related information systems utilizing their respective fleet data. PSSs are defined based on neighbourhoods and time periods. Each PSS generates a unique set of spatio-temporally important areas at different quadkey zoom levels over 168 week-hours, called slices. The importance weight in each slice depends on the volume of PE within them. The algorithm for each PSS automatically selects important areas and time frames that are vital to be observed. Sample prediction models are used for the benefits assessment of the methodology by comparing it against non-prioritized randomized selection of ground truth. It is proven that the methodology can lessen the effort of ground truth collection, while maintaining the amount of information necessary to assess the true quality of a prediction model.
UR - http://www.scopus.com/inward/record.url?scp=85122768671&partnerID=8YFLogxK
U2 - 10.1049/itr2.12160
DO - 10.1049/itr2.12160
M3 - Article
AN - SCOPUS:85122768671
SN - 1751-956X
VL - 16
SP - 602
EP - 615
JO - IET Intelligent Transport Systems
JF - IET Intelligent Transport Systems
IS - 5
ER -