Sparse and structured decomposition of audio signals on hybrid dictionaries using musical priors

作者:Papadopoulos Helene*; Kowalski Matthieu
来源:Journal of the Acoustical Society of America, 2013, 134(1): 666-685.
DOI:10.1121/1.4807821

摘要

This paper investigates the use of musical priors for sparse expansion of audio signals of music, on an overcomplete dual-resolution dictionary taken from the union of two orthonormal bases that can describe both transient and tonal components of a music audio signal. More specifically, chord and metrical structure information are used to build a structured model that takes into account dependencies between coefficients of the decomposition, both for the tonal and for the transient layer. The denoising task application is used to provide a proof of concept of the proposed musical priors. Several configurations of the model are analyzed. Evaluation on monophonic and complex polyphonic excerpts of real music signals shows that the proposed approach provides results whose quality measured by the signal-to-noise ratio is competitive with state-of-the-art approaches, and more coherent with the semantic content of the signal. A detailed analysis of the model in terms of sparsity and in terms of interpretability of the representation is also provided and shows that the model is capable of giving a relevant and legible representation of Western tonal music audio signals.

  • 出版日期2013-7

全文