Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations
On-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Wiley
2018-01-01
|
| Series: | Journal of Advanced Transportation |
| Online Access: | http://dx.doi.org/10.1155/2018/8963234 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849704394564567040 |
|---|---|
| author | Raj Bridgelall Pan Lu Denver D. Tolliver Tai Xu |
| author_facet | Raj Bridgelall Pan Lu Denver D. Tolliver Tai Xu |
| author_sort | Raj Bridgelall |
| collection | DOAJ |
| description | On-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns that may inform strategies to improve dispatch taxi business. It is not practical to manually clean and filter large-scale mobility data that contains GPS information. Therefore, this research contributes and demonstrates an automated method of data cleaning and filtering that is suitable for such types of datasets. The cleaning method defines three filter variables and applies a layered statistical filtering technique to eliminate outlier records that do not contribute to distributions that match expected theoretical distributions of the variables. Chi-squared statistical tests evaluate the quality of the cleaned data by comparing the distribution of the three variables with their expected distributions. The overall cleaning method removed approximately 5% of the data, which consisted of errors that were obvious and others that were poor quality outliers. Subsequently, mining the cleaned data revealed that trip production in Dubai peaks for the case when only the same two drivers operate the same taxi. This finding would not have been possible without access to proprietary data that contains unique identifiers for both drivers and taxis. Datasets that identify individual drivers are not publicly available. |
| format | Article |
| id | doaj-art-038cd688dd2e4164963caa035c27d438 |
| institution | DOAJ |
| issn | 0197-6729 2042-3195 |
| language | English |
| publishDate | 2018-01-01 |
| publisher | Wiley |
| record_format | Article |
| series | Journal of Advanced Transportation |
| spelling | doaj-art-038cd688dd2e4164963caa035c27d4382025-08-20T03:16:46ZengWileyJournal of Advanced Transportation0197-67292042-31952018-01-01201810.1155/2018/89632348963234Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi OperationsRaj Bridgelall0Pan Lu1Denver D. Tolliver2Tai Xu3College of Business, North Dakota State University, Fargo, North Dakota 58108, USACollege of Business, North Dakota State University, Fargo, North Dakota 58108, USAUGPTI, North Dakota State University, Fargo, North Dakota 58108, USAUniversity of Modern Sciences, Dubai, UAEOn-demand shared mobility services such as Uber and microtransit are steadily penetrating the worldwide market for traditional dispatched taxi services. Hence, taxi companies are seeking ways to compete. This study mined large-scale mobility data from connected taxis to discover beneficial patterns that may inform strategies to improve dispatch taxi business. It is not practical to manually clean and filter large-scale mobility data that contains GPS information. Therefore, this research contributes and demonstrates an automated method of data cleaning and filtering that is suitable for such types of datasets. The cleaning method defines three filter variables and applies a layered statistical filtering technique to eliminate outlier records that do not contribute to distributions that match expected theoretical distributions of the variables. Chi-squared statistical tests evaluate the quality of the cleaned data by comparing the distribution of the three variables with their expected distributions. The overall cleaning method removed approximately 5% of the data, which consisted of errors that were obvious and others that were poor quality outliers. Subsequently, mining the cleaned data revealed that trip production in Dubai peaks for the case when only the same two drivers operate the same taxi. This finding would not have been possible without access to proprietary data that contains unique identifiers for both drivers and taxis. Datasets that identify individual drivers are not publicly available.http://dx.doi.org/10.1155/2018/8963234 |
| spellingShingle | Raj Bridgelall Pan Lu Denver D. Tolliver Tai Xu Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations Journal of Advanced Transportation |
| title | Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations |
| title_full | Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations |
| title_fullStr | Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations |
| title_full_unstemmed | Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations |
| title_short | Mining Connected Vehicle Data for Beneficial Patterns in Dubai Taxi Operations |
| title_sort | mining connected vehicle data for beneficial patterns in dubai taxi operations |
| url | http://dx.doi.org/10.1155/2018/8963234 |
| work_keys_str_mv | AT rajbridgelall miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations AT panlu miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations AT denverdtolliver miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations AT taixu miningconnectedvehicledataforbeneficialpatternsindubaitaxioperations |