A Comparative Study of Machine Learning Algorithms for Industry-Specific Freight Generation Model

Hyeonsup Lim, Majbah Uddin, Yuandong Liu, Shih Miao Chin, Ho Ling Hwang

Research output: Contribution to journalArticlepeer-review

Abstract

According to Bureau of Transportation Statistics, the U.S. transportation system handled 14,329 million ton-miles of freight per day in 2020. Understanding the generation of these freight shipments is crucial for transportation researchers, planners, and policymakers to design and plan for a more efficient and connected freight transportation system. Traditionally, the freight generation modeling has been based on Ordinary Least Square (OLS) regression, although more advanced Machine Learning (ML) algorithms have been evaluated and proven to have excellent performance in various transportation applications in recent years. Furthermore, one modeling approach applied for one industry might not always be applicable for another as their freight generation logics can be quite different. The objective of this study is to apply and evaluate alternative ML algorithms in the estimation of freight generation for each of 45 industry types. Seven alternative ML algorithms, along with the base OLS regression, were evaluated and compared. In addition, the study considered different combinations of variables in both the original and logarithmic form as well as hyperparameters of those ML algorithms in the model selection for each industry type. The results showed statistically significant improvements in the root mean square error reduction by the alternative ML algorithms over the OLS for over 80% of cases. The study suggests utilizing the alternative ML algorithms can reduce the root mean square error by about 30%, depending on industry types.

Original languageEnglish
Article number15367
JournalSustainability (Switzerland)
Volume14
Issue number22
DOIs
StatePublished - Nov 2022

Funding

This research effort was sponsored by the Federal Highway Administration (FHWA) and the Bureau of Transportation Statistics (BTS), under U.S. Department of Transportation, through the project titled “Design and Development of Statistical Models and Freight Data”, grant number 2116-Z239-18.

FundersFunder number
U.S. Department of Transportation2116-Z239-18
Federal Highway Administration
Bureau of Transportation Statistics

    Keywords

    • Commodity Flow Survey (CFS)
    • North American Industry Classification System (NAICS)
    • freight attraction
    • freight generation model
    • freight production
    • machine learning algorithms

    Fingerprint

    Dive into the research topics of 'A Comparative Study of Machine Learning Algorithms for Industry-Specific Freight Generation Model'. Together they form a unique fingerprint.

    Cite this