Machine learning from imbalanced data-sets: An application to the bike-sharing inventory problem

Giovanni Ceccarelli, Guido Cantelmo, Marialisa Nigro, Constantinos Antoniou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

One of the major issue bike sharing operators struggle to deal with is the bicycle rebalancing activity, i.e. optimizing the fleet location reducing the related activity cost. In order to reduce operational cost generated by rebalancing and to facilitate the adoption of bike sharing by users, it is extremely important to estimate the correct value of bicycles (and available docks in case of station-based bike sharing), that is the optimal inventory level. In this paper we investigate the potential of using machine learning techniques for estimating the inventory level to address the station-based bike sharing static rebalancing in the case of imbalanced data-set. Specifically, Random Forest (RF) and Gradient Tree Boosting classifiers have been proposed, together with a new iterative approach based on RF. All the methods have been tested adopting real world data of New York City bikes together with weather data.

Original languageEnglish
Title of host publication2021 7th International Conference on Models and Technologies for Intelligent Transportation Systems, MT-ITS 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728189956
DOIs
StatePublished - 16 Jun 2021
Event7th International Conference on Models and Technologies for Intelligent Transportation Systems, MT-ITS 2021 - Heraklion, Greece
Duration: 16 Jun 202117 Jun 2021

Publication series

Name2021 7th International Conference on Models and Technologies for Intelligent Transportation Systems, MT-ITS 2021

Conference

Conference7th International Conference on Models and Technologies for Intelligent Transportation Systems, MT-ITS 2021
Country/TerritoryGreece
CityHeraklion
Period16/06/2117/06/21

Keywords

  • Bike sharing
  • Imbalanced data
  • Inventory level
  • Machine learning
  • Random Forest
  • Rebalancing problem

Fingerprint

Dive into the research topics of 'Machine learning from imbalanced data-sets: An application to the bike-sharing inventory problem'. Together they form a unique fingerprint.

Cite this