Examining trip-level errors in passively collected mobile device data for data quality assurance.

Location-based service (LBS) data passively collected by mobile devices has been widely adopted in multiple fields for its advantages in revealing travel behaviors. Data quality assessments have always been important steps for analyses using the data, but the impact of trip-level errors has not been...

Full description

Saved in:
Bibliographic Details
Main Authors: Peiqi Zhang, Kathleen Stewart, Aref Darzi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0321970
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850159691996332032
author Peiqi Zhang
Kathleen Stewart
Aref Darzi
author_facet Peiqi Zhang
Kathleen Stewart
Aref Darzi
author_sort Peiqi Zhang
collection DOAJ
description Location-based service (LBS) data passively collected by mobile devices has been widely adopted in multiple fields for its advantages in revealing travel behaviors. Data quality assessments have always been important steps for analyses using the data, but the impact of trip-level errors has not been a focus of these assessments. We examine a newly emerged type of error present at trip-level in LBS datasets that violates the spatio-temporal consistency of such data by including trips on road segments where and when there should be no trips. We designed a distributed-computing workflow to quantify the errors by comparing the number of trips on closed road segments during road closures with time periods before and after. Using two real-world cases from 2023, we examined multiple datasets acquired from major vendors in the US, and several of the datasets contained a significant number of trip-level errors. These findings point to the errors being present in recent datasets that have not otherwise been processed for data quality and can significantly impact analyses by data users. Data users should consider conducting trip-level error data quality checks as part of their preprocessing steps.
format Article
id doaj-art-58096e560d694111a3acaa5c10b65a28
institution OA Journals
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-58096e560d694111a3acaa5c10b65a282025-08-20T02:23:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01204e032197010.1371/journal.pone.0321970Examining trip-level errors in passively collected mobile device data for data quality assurance.Peiqi ZhangKathleen StewartAref DarziLocation-based service (LBS) data passively collected by mobile devices has been widely adopted in multiple fields for its advantages in revealing travel behaviors. Data quality assessments have always been important steps for analyses using the data, but the impact of trip-level errors has not been a focus of these assessments. We examine a newly emerged type of error present at trip-level in LBS datasets that violates the spatio-temporal consistency of such data by including trips on road segments where and when there should be no trips. We designed a distributed-computing workflow to quantify the errors by comparing the number of trips on closed road segments during road closures with time periods before and after. Using two real-world cases from 2023, we examined multiple datasets acquired from major vendors in the US, and several of the datasets contained a significant number of trip-level errors. These findings point to the errors being present in recent datasets that have not otherwise been processed for data quality and can significantly impact analyses by data users. Data users should consider conducting trip-level error data quality checks as part of their preprocessing steps.https://doi.org/10.1371/journal.pone.0321970
spellingShingle Peiqi Zhang
Kathleen Stewart
Aref Darzi
Examining trip-level errors in passively collected mobile device data for data quality assurance.
PLoS ONE
title Examining trip-level errors in passively collected mobile device data for data quality assurance.
title_full Examining trip-level errors in passively collected mobile device data for data quality assurance.
title_fullStr Examining trip-level errors in passively collected mobile device data for data quality assurance.
title_full_unstemmed Examining trip-level errors in passively collected mobile device data for data quality assurance.
title_short Examining trip-level errors in passively collected mobile device data for data quality assurance.
title_sort examining trip level errors in passively collected mobile device data for data quality assurance
url https://doi.org/10.1371/journal.pone.0321970
work_keys_str_mv AT peiqizhang examiningtriplevelerrorsinpassivelycollectedmobiledevicedatafordataqualityassurance
AT kathleenstewart examiningtriplevelerrorsinpassivelycollectedmobiledevicedatafordataqualityassurance
AT arefdarzi examiningtriplevelerrorsinpassivelycollectedmobiledevicedatafordataqualityassurance