Deep feature learning for cover song identification

作者:Fang Jiunn Tsair; Day Chi Ting; Chang Pao Chi
来源:Multimedia Tools and Applications, 2017, 76(22): 23225-23238.
DOI:10.1007/s11042-016-4107-6

摘要

The identification of a cover song, which is an alternative version of a previously recorded song, for music retrieval has received increasing attention. Methods for identifying a cover song typically involve comparing the similarity of chroma features between a query song and another song in the data set. However, considerable time is required for pairwise comparisons. In this study, chroma features were patched to preserve the melody. An intermediate representation was trained to reduce the dimension of each patch of chroma features. The training was performed using an autoencoder, commonly used in deep learning for dimensionality reduction. Experimental results showed that the proposed method achieved better accuracy for identification and spent less time for similarity matching in both covers80 dataset and Million Song Dataset as compared with traditional approaches.