Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022;49(5):1339-1383.
doi: 10.1007/s11116-021-10214-3. Epub 2021 Aug 12.

A data-driven travel mode share estimation framework based on mobile device location data

Affiliations

A data-driven travel mode share estimation framework based on mobile device location data

Mofeng Yang et al. Transportation (Amst). 2022.

Abstract

Mobile device location data (MDLD) contains abundant travel behavior information to support travel demand analysis. Compared to traditional travel surveys, MDLD has larger spatiotemporal coverage of the population and its mobility. However, ground truth information such as trip origins and destinations, travel modes, and trip purposes are not included by default. Such important attributes must be imputed to maximize the usefulness of the data. This paper targets at studying the capability of MDLD on estimating travel mode share at aggregated levels. A data-driven framework is proposed to extract travel behavior information from MDLD. The proposed framework first identifies trip ends with a modified Spatiotemporal Density-based Spatial Clustering of Applications with Noise algorithm. Then three types of features are extracted for each trip to impute travel modes using machine learning models. A labeled MDLD dataset with ground truth information is used to train the proposed models, resulting in a 95% recall rate in identifying trip ends and over 93% tenfold cross-validation accuracy in imputing the five travel modes (drive, rail, bus, bike and walk) with a random forest (RF) classifier. The proposed framework is then applied to two large-scale MDLD datasets, covering the Baltimore-Washington metropolitan area and the United States, respectively. The estimated trip distance, trip time, trip rate distribution, and travel mode share are compared against travel surveys at different geographies. The results suggest that the proposed framework can be readily applied in different states and metropolitan regions with low cost in order to study multimodal travel demand, understand mobility trends, and support decision making.

Keywords: Machine learning; Mobile device location data; Travel mode share; Travel surveys.

PubMed Disclaimer

Conflict of interest statement

Conflict of interestThe authors declare that they have no conflict of interest.

Figures

Fig. 1
Fig. 1
The data-driven travel mode share estimation framework
Fig. 2
Fig. 2
Typical daily travel pattern of an individual
Fig. 3
Fig. 3
incenTrip trip trajectories for a drive; b bus; c rail; d non-motorized
Fig. 4
Fig. 4
Raw location point distribution for a Dataset I; b Dataset II
Fig. 5
Fig. 5
Multimodal transportation networks: drive (grey), rail (green), bus (blue). (Color figure online)
Fig. 6
Fig. 6
Trip end identification results
Fig. 7
Fig. 7
a Spatial accuracy distribution for the three LBS datasets; b location recording interval distribution for the two case studies’ LBS datasets
Fig. 8
Fig. 8
Comparison with 2007/2008 TPB-BMC HHTS on: a trip distance distribution; b trip time distribution; c trip rate distribution; d time of day distribution
Fig. 9
Fig. 9
a Statewide travel mode share comparison; b county-level travel mode share correlation; c county-level travel mode share comparison
Fig. 10
Fig. 10
Census tract-level mode share illustration for a and b: rail mode share in D.C. and Baltimore City; c and d: Bus mode share in D.C. and Baltimore City
Fig. 11
Fig. 11
Comparison with NHTS 2017 on: a Trip distance distribution; b trip time distribution; c trip rate distribution
Fig. 12
Fig. 12
Day of week variation
Fig. 13
Fig. 13
Nationwide-level air trip origins
Fig. 14
Fig. 14
a Nationwide travel mode share comparison; b state-level travel mode share correlation; c state-level travel mode share comparison
Fig. 15
Fig. 15
CBSA-level illustration of a rail travel mode share; b bus travel mode share
Fig. 16
Fig. 16
Feature importance of the random forest classifier
Fig. 17
Fig. 17
Observed duration distribution for: a number of unique hours (Dataset I); b unique hour bins (Dataset II); (3) number of unique days (Dataset II)
Fig. 18
Fig. 18
Comparison with 2007/2008 TPB-BMC HHTS on: a short-distance trip distance distribution; b long-distance trip distance distribution
Fig. 19
Fig. 19
Comparison with NHTS 2017 on: a short-distance trip distance distribution; b long-distance trip distance distribution
Fig. 20
Fig. 20
County-level travel mode share comparison between Dataset I and NHTS 2017
Fig. 21
Fig. 21
State-level travel mode share comparison between Dataset II and NHTS 2017

References

    1. 2000–2001 California Statewide Household Travel Survey. Final Report. NuStats, Austin (2002)
    1. 2010–2012 California Household Travel Survey. Final Report Version 1.0. NuStats, Austin (2013)
    1. 2010–2012 Minneapolis – St. Paul Travel Behavior Inventory. Twin Cities Metropolitan Council (2012)
    1. 2011 Atlanta, Georgia, Regional Travel Survey. Final Report. NuStats, Austin (2011)
    1. 2012–2013 Delaware Valley Household Travel Survey. Delaware Valley Regional Planning Commission (2013)

LinkOut - more resources