Analyzing high speed rail passengers' train choices based on new online booking data in China

作者:Sun, Yanshuo; Jiang, Zhibin*; Gu, Jinjing; Zhou, Min; Li, Yeming; Zhang, Lei
来源:Transportation Research Part C: Emerging Technologies , 2018, 97: 96-113.
DOI:10.1016/j.trc.2018.10.015

摘要

This study explores two nonparametric machine learning methods, namely support vector regression (SVR) and artificial neural networks (ANN), for understanding and predicting high-speed rail (HSR) travelers' choices of ticket purchase timings, train types, and travel classes, using ticket sales data. In the train choice literature, discrete choice analysis is the predominant approach and many variants of logit models have been developed. Alternatively, emerging travel choice studies adopt non-utility-based methods, especially nonparametric machine learning methods including SVR and ANN, because (1) those methods do not rely on assumptions on the relations between choices and explanatory variables or any prior knowledge of the underlying relations; (2) they have superb capabilities of iteratively identifying patterns and extracting rules from data. This paper thus contributes to the HSR train choice literature by applying and comparing SVR and ANN with a real-world case study of the Shanghai-Beijing HSR market in China. A new normalized metric capturing both the load factor and the booking lead time is proposed as the target variable and several train service attributes, such as day of week, departure time, travel time, fare, are identified as input variables. Computational results demonstrate that both SVR and ANN can predict the train choice behavior with high accuracy, outperforming the linear regression approach. Potential applications of this study, such as rail pricing reform, have also been identified.