Modeling freight mode choice using machine learning classifiers: a comparative study using Commodity Flow Survey (CFS) data

Majbah Uddin, Sabreena Anowar, Naveen Eluru

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

This study explores the usefulness of machine learning classifiers for modeling freight mode choice. We investigate eight commonly used machine learning classifiers, namely Naïve Bayes, Support Vector Machine, Artificial Neural Network, K-Nearest Neighbors, Classification and Regression Tree, Random Forest, Boosting and Bagging, along with the classical Multinomial Logit model. US 2012 Commodity Flow Survey data are used as the primary data source; we augment it with spatial attributes from secondary data sources. The performance of the classifiers is compared based on prediction accuracy results. The current research also examines the role of sample size and training-testing data split ratios on the predictive ability of the various approaches. In addition, the importance of variables is estimated to determine how the variables influence freight mode choice. The results show that the tree-based ensemble classifiers perform the best. Specifically, Random Forest produces the most accurate predictions, closely followed by Boosting and Bagging. With regard to variable importance, shipment characteristics, such as shipment distance, industry classification of the shipper and shipment size, are the most significant factors for freight mode choice decisions.

Original languageEnglish
Pages (from-to)543-559
Number of pages17
JournalTransportation Planning and Technology
Volume44
Issue number5
DOIs
StatePublished - 2021

Funding

The authors would like to acknowledge Nowreen Keya and Naveen Chandra for helping with initial data preparation.

Keywords

  • Freight mode choice
  • classification
  • commodity flow survey
  • machine learning
  • mode choice factors

Fingerprint

Dive into the research topics of 'Modeling freight mode choice using machine learning classifiers: a comparative study using Commodity Flow Survey (CFS) data'. Together they form a unique fingerprint.

Cite this