A novel method of symbolic representation in diving data mining: A case study of highways in China

作者:Sun, Chuan*; Liu, Wei; Chu, Duanfeng; Li, Wushuang; Lu, Zhenji; Wang, Jianyu
来源:Concurrency and Computation: Practice and Experience (CCPE) , 2018, 30(24): e4976.
DOI:10.1002/cpe.4976

摘要

Vehicle field test can be conducted smoothly because of the automobile-mounted monitoring system and abundant diving data have been collected. Driving data mining is in an urgent need of new thoughts introduced to break through the original technical bottleneck. This paper presented a novel method of symbolic representation in diving data mining and applied the idea of time series symbolization to traffic engineering. The sample data is processed by normalization, dimensionality reduction, discretization, and symbolization based on the three steps of symbolic aggregate approximation (SAX) with driving data characteristics taken into adequate consideration. The results showed that the high-dimensionality miscellaneous driving time series data was rationally converted into highly readable, easy to search and locate symbolic series after semantic encoding, and the main characteristics of time series data was preserved after a substantial reduction of data dimensionality. Finally, the paper demonstrated the positive effects of this method on the analysis of actual vehicle driving safety based on case study, and it explored the application of SAX to speed and acceleration data from driving data set.