Predicting venue popularity using crowd-sourced and passive sensor data

Stanislav Timokhin, Mohammad Sadrani, Constantinos Antoniou

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

Efficient and reliable mobility pattern identification is essential for transport planning research. In order to infer mobility patterns, however, a large amount of spatiotemporal data is needed, which is not always available. Hence, location-based social networks (LBSNs) have received considerable attention as a potential data provider. The aim of this study is to investigate the possibility of using several different auxiliary information sources for venue popularity modeling and provide an alternative venue popularity measuring approach. Initially, data from widely used services, such as Google Maps, Yelp and OpenStreetMap (OSM), are used to model venue popularity. To estimate hourly venue occupancy, two different classes of model are used, including linear regression with lasso regularization and gradient boosted regression (GBR). The predictions are made based on venue-related parameters (e.g., rating, comments) and locational properties (e.g., stores, hotels, attractions). Results show that the prediction can be improved using GBR with a logarithmic transformation of the dependent variables. To investigate the quality of social media-based models by obtaining WiFi-based ground truth data, a microcontroller setup is developed to measure the actual number of people attending venues using WiFi presence detection, demonstrating that the similarity between the results of WiFi data collection and Google “Popular Times” is relatively promising.

Original languageEnglish
Pages (from-to)818-841
Number of pages24
JournalSmart Cities
Volume3
Issue number3
DOIs
StatePublished - Sep 2020

Keywords

  • Big data
  • Google popular times
  • Mobility pattern
  • Venue popularity
  • WiFi data collection

Fingerprint

Dive into the research topics of 'Predicting venue popularity using crowd-sourced and passive sensor data'. Together they form a unique fingerprint.

Cite this