. 2022;49(5):1339-1383.

doi: 10.1007/s11116-021-10214-3. Epub 2021 Aug 12.

A data-driven travel mode share estimation framework based on mobile device location data

Mofeng Yang¹, Yixuan Pan¹, Aref Darzi¹, Sepehr Ghader¹, Chenfeng Xiong^{1

2}, Lei Zhang¹

Affiliations

¹ Department of Civil and Environmental Engineering, Maryland Transportation Institute (MTI), University of Maryland, 1173 Glenn Martin Hall, College Park, MD 20742 USA.
² Shock Trauma and Anesthesiology Research (STAR) Center, School of Medicine, University of Maryland, 685 W Baltimore Street, Suite 600, Baltimore, MD 21201 USA.

PMID: 34400848
PMCID: PMC8358917
DOI: 10.1007/s11116-021-10214-3

A data-driven travel mode share estimation framework based on mobile device location data

Mofeng Yang et al. Transportation (Amst). 2022.

. 2022;49(5):1339-1383.

doi: 10.1007/s11116-021-10214-3. Epub 2021 Aug 12.

Authors

Mofeng Yang¹, Yixuan Pan¹, Aref Darzi¹, Sepehr Ghader¹, Chenfeng Xiong^{1

2}, Lei Zhang¹

Affiliations

¹ Department of Civil and Environmental Engineering, Maryland Transportation Institute (MTI), University of Maryland, 1173 Glenn Martin Hall, College Park, MD 20742 USA.
² Shock Trauma and Anesthesiology Research (STAR) Center, School of Medicine, University of Maryland, 685 W Baltimore Street, Suite 600, Baltimore, MD 21201 USA.

PMID: 34400848
PMCID: PMC8358917
DOI: 10.1007/s11116-021-10214-3

Abstract

Mobile device location data (MDLD) contains abundant travel behavior information to support travel demand analysis. Compared to traditional travel surveys, MDLD has larger spatiotemporal coverage of the population and its mobility. However, ground truth information such as trip origins and destinations, travel modes, and trip purposes are not included by default. Such important attributes must be imputed to maximize the usefulness of the data. This paper targets at studying the capability of MDLD on estimating travel mode share at aggregated levels. A data-driven framework is proposed to extract travel behavior information from MDLD. The proposed framework first identifies trip ends with a modified Spatiotemporal Density-based Spatial Clustering of Applications with Noise algorithm. Then three types of features are extracted for each trip to impute travel modes using machine learning models. A labeled MDLD dataset with ground truth information is used to train the proposed models, resulting in a 95% recall rate in identifying trip ends and over 93% tenfold cross-validation accuracy in imputing the five travel modes (drive, rail, bus, bike and walk) with a random forest (RF) classifier. The proposed framework is then applied to two large-scale MDLD datasets, covering the Baltimore-Washington metropolitan area and the United States, respectively. The estimated trip distance, trip time, trip rate distribution, and travel mode share are compared against travel surveys at different geographies. The results suggest that the proposed framework can be readily applied in different states and metropolitan regions with low cost in order to study multimodal travel demand, understand mobility trends, and support decision making.

Keywords: Machine learning; Mobile device location data; Travel mode share; Travel surveys.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare that they have no conflict of interest.

Figures

**Fig. 1**
The data-driven travel mode share estimation framework

**Fig. 2**
Typical daily travel pattern of an individual

**Fig. 3**
incenTrip trip trajectories for a drive; b bus; c rail; d non-motorized

**Fig. 4**
Raw location point distribution for a Dataset I; b Dataset II

**Fig. 5**
Multimodal transportation networks: drive (grey), rail (green), bus (blue). (Color figure online)

**Fig. 6**
Trip end identification results

**Fig. 7**
a Spatial accuracy distribution for the three LBS datasets; b location recording interval distribution for the two case studies’ LBS datasets

**Fig. 8**
Comparison with 2007/2008 TPB-BMC HHTS on: a trip distance distribution; b trip time distribution; c trip rate distribution; d time of day distribution

**Fig. 9**
a Statewide travel mode share comparison; b county-level travel mode share correlation; c county-level travel mode share comparison

**Fig. 10**
Census tract-level mode share illustration for a and b: rail mode share in D.C. and Baltimore City; c and d: Bus mode share in D.C. and Baltimore City

**Fig. 11**
Comparison with NHTS 2017 on: a Trip distance distribution; b trip time distribution; c trip rate distribution

**Fig. 13**
Nationwide-level air trip origins

**Fig. 14**
a Nationwide travel mode share comparison; b state-level travel mode share correlation; c state-level travel mode share comparison

**Fig. 15**
CBSA-level illustration of a rail travel mode share; b bus travel mode share

**Fig. 16**
Feature importance of the random forest classifier

**Fig. 17**
Observed duration distribution for: a number of unique hours (Dataset I); b unique hour bins (Dataset II); (3) number of unique days (Dataset II)

**Fig. 18**
Comparison with 2007/2008 TPB-BMC HHTS on: a short-distance trip distance distribution; b long-distance trip distance distribution

**Fig. 19**
Comparison with NHTS 2017 on: a short-distance trip distance distribution; b long-distance trip distance distribution

**Fig. 20**
County-level travel mode share comparison between Dataset I and NHTS 2017

**Fig. 21**
State-level travel mode share comparison between Dataset II and NHTS 2017

See this image and copyright information in PMC

References

1. 2000–2001 California Statewide Household Travel Survey. Final Report. NuStats, Austin (2002)
1. 2010–2012 California Household Travel Survey. Final Report Version 1.0. NuStats, Austin (2013)
1. 2010–2012 Minneapolis – St. Paul Travel Behavior Inventory. Twin Cities Metropolitan Council (2012)
1. 2011 Atlanta, Georgia, Regional Travel Survey. Final Report. NuStats, Austin (2011)
1. 2012–2013 Delaware Valley Household Travel Survey. Delaware Valley Regional Planning Commission (2013)

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A data-driven travel mode share estimation framework based on mobile device location data

Affiliations

A data-driven travel mode share estimation framework based on mobile device location data

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources

Other Literature Sources