Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations

On-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns...

Full description

Saved in:
Bibliographic Details
Main Authors: Raj Bridgelall, Pan Lu, Denver D. Tolliver, Tai Xu
Format: Article
Language:English
Published: Wiley 2018-01-01
Series:Journal of Advanced Transportation
Online Access:http://dx.doi.org/10.1155/2018/8963234
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849704394564567040
author Raj Bridgelall
Pan Lu
Denver D. Tolliver
Tai Xu
author_facet Raj Bridgelall
Pan Lu
Denver D. Tolliver
Tai Xu
author_sort Raj Bridgelall
collection DOAJ
description On-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns that may inform strategies to improve dispatch taxi business. It is not practical to manually clean and filter large-scale mobility data that contains GPS information. Therefore, this research contributes and demonstrates an automated method of data cleaning and filtering that is suitable for such types of datasets. The cleaning method defines three filter variables and applies a layered statistical filtering technique to eliminate outlier records that do not contribute to distributions that match expected theoretical distributions of the variables. Chi-squared statistical tests evaluate the quality of the cleaned data by comparing the distribution of the three variables with their expected distributions. The overall cleaning method removed approximately 5% of the data, which consisted of errors that were obvious and others that were poor quality outliers. Subsequently, mining the cleaned data revealed that trip production in Dubai peaks for the case when only the same two drivers operate the same taxi. This finding would not have been possible without access to proprietary data that contains unique identifiers for both drivers and taxis. Datasets that identify individual drivers are not publicly available.
format Article
id doaj-art-038cd688dd2e4164963caa035c27d438
institution DOAJ
issn 0197-6729
2042-3195
language English
publishDate 2018-01-01
publisher Wiley
record_format Article
series Journal of Advanced Transportation
spelling doaj-art-038cd688dd2e4164963caa035c27d4382025-08-20T03:16:46ZengWileyJournal of Advanced Transportation0197-67292042-31952018-01-01201810.1155/2018/89632348963234Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi OperationsRaj Bridgelall0Pan Lu1Denver D. Tolliver2Tai Xu3College of Business, North Dakota State University, Fargo, North Dakota 58108, USACollege of Business, North Dakota State University, Fargo, North Dakota 58108, USAUGPTI, North Dakota State University, Fargo, North Dakota 58108, USAUniversity of Modern Sciences, Dubai, UAEOn-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns that may inform strategies to improve dispatch taxi business. It is not practical to manually clean and filter large-scale mobility data that contains GPS information. Therefore, this research contributes and demonstrates an automated method of data cleaning and filtering that is suitable for such types of datasets. The cleaning method defines three filter variables and applies a layered statistical filtering technique to eliminate outlier records that do not contribute to distributions that match expected theoretical distributions of the variables. Chi-squared statistical tests evaluate the quality of the cleaned data by comparing the distribution of the three variables with their expected distributions. The overall cleaning method removed approximately 5% of the data, which consisted of errors that were obvious and others that were poor quality outliers. Subsequently, mining the cleaned data revealed that trip production in Dubai peaks for the case when only the same two drivers operate the same taxi. This finding would not have been possible without access to proprietary data that contains unique identifiers for both drivers and taxis. Datasets that identify individual drivers are not publicly available.http://dx.doi.org/10.1155/2018/8963234
spellingShingle Raj Bridgelall
Pan Lu
Denver D. Tolliver
Tai Xu
Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
Journal of Advanced Transportation
title Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
title_full Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
title_fullStr Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
title_full_unstemmed Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
title_short Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
title_sort mining connected vehicle data for beneficial patterns in dubai taxi operations
url http://dx.doi.org/10.1155/2018/8963234
work_keys_str_mv AT rajbridgelall miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations
AT panlu miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations
AT denverdtolliver miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations
AT taixu miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations