A General Compression Approach to Multi-Channel Three-Dimensional Audio

Cheng Bin<sup>*</sup>; Ritz Christian; Burnett Ian; Zheng Xiguang

doi:10.1109/TASL.2013.2260156

摘要

This paper presents a technique for low bit rate compression of three-dimensional (3D) audio produced by multiple loudspeaker channels. The approach is based on the time-frequency analysis of the localization of spatial sound sources within the 3D space as rendered by a multi-channel audio signal (in this case 16 channels). This analysis results in the derivation of a stereo downmix signal representing the original 16 channels. Alternatively, a mono-downmix signal with side information representing the location of sound sources within the 3D spatial scene can also be derived. The resulting downmix signals are then compressed with a traditional audio coder, resulting in a representation of the 3D soundfield at bit rates comparable with existing stereo audio coders while maintaining the perceptual quality produced from separate encoding of each channel.

出版日期2013-8

全文

访问全文

收藏分享被引(13) 浏览

更新时间：2019-03-28 02:45

A General Compression Approach to Multi-Channel Three-Dimensional Audio

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友