Multi-Channel Audio Source Separation Using Multiple Deformed References

Souviraa Labastic Nathan<sup>*</sup>; Olivero Anaik; Vincent Emmanuel; Bimbot Frederic

doi:10.1109/TASLP.2015.2450494

摘要

We present a general multi-channel source separation framework where additional audio references are available for one (or more) source(s) of a given mixture. Each audio reference is another mixture which is supposed to contain at least one source similar to one of the target sources. Deformations between the sources of interest and their references are modeled in a linear manner using a generic formulation. This is done by adding transformation matrices to an excitation-filter model, hence affecting different axes, namely frequency, dictionary component or time. A nonnegative matrix co-factorization algorithm and a generalized expectation-maximization algorithm are used to estimate the parameters of the model. Different model parameterizations and different combinations of algorithms are tested on music plus voice mixtures guided by music and/or voice references and on professionally-produced music recordings guided by cover references. Our algorithms improve the signal-to-distortion ratio (SDR) of the sources with the lowest intensity by 9 to 15 decibels (dB) with respect to original mixtures.

出版日期2015-11
单位INRIA

全文

访问全文

收藏分享被引(12) 浏览

更新时间：2021-03-13 16:22

Multi-Channel Audio Source Separation Using Multiple Deformed References

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友