Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

作者:Nikunen Joonas*; Virtanen Tuomas; Vilermo Miikka
来源:Journal of the Audio Engineering Society, 2012, 60(10): 794-806.

摘要

This article proposes a new spatial audio coding (SAC) method that is based on parametrization of multichannel audio by sound objects using non-negative tensor factorization (NTF). The NTF model represents the multichannel audio signal with a linear combination of objects that are composed of fixed spectral bases with a time-varying gain and a channel-dependent spatial gain. The parameters of the model are estimated using perceptually motivated NTF model and are used for upmixing a downmixed and encoded mixture signal in Wiener filtering manner. The performance of the proposed coding is evaluated using listening tests, which prove the coding performance being almost equal to conventional SAC methods. Additionally, the proposed coding enables controlling the upmix content by meaningful objects and the sound source separation possibility of the encoding scheme is demonstrated by examples.

  • 出版日期2012-10