Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

Nikunen Joonas<sup>*</sup>; Virtanen Tuomas; Vilermo Miikka

摘要

This article proposes a new spatial audio coding (SAC) method that is based on parametrization of multichannel audio by sound objects using non-negative tensor factorization (NTF). The NTF model represents the multichannel audio signal with a linear combination of objects that are composed of fixed spectral bases with a time-varying gain and a channel-dependent spatial gain. The parameters of the model are estimated using perceptually motivated NTF model and are used for upmixing a downmixed and encoded mixture signal in Wiener filtering manner. The performance of the proposed coding is evaluated using listening tests, which prove the coding performance being almost equal to conventional SAC methods. Additionally, the proposed coding enables controlling the upmix content by meaningful objects and the sound source separation possibility of the encoding scheme is demonstrated by examples.

出版日期2012-10

收藏分享被引(3) 浏览

更新时间：2018-04-10 21:15

Multichannel Audio Upmixing by Time-Frequency Filtering Using Non-Negative Tensor Factorization

摘要

产品服务

站内浏览

服务支持

联系方式

科研之友