Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

Raczynski Stanislaw A<sup>*</sup>; Vincent Emmanuel; Sagayama Shigeki

doi:10.1109/TASL.2013.2258012

摘要

Symbolic pitch modeling is a way of incorporating knowledge about relations between pitches into the process of analyzing musical information or signals. In this paper, we propose a family of probabilistic symbolic polyphonic pitch models, which account for both the "horizontal" and the "vertical" pitch structure. These models are formulated as linear or log-linear interpolations of up to five sub-models, each of which is responsible for modeling a different type of relation. The ability of the models to predict symbolic pitch data is evaluated in terms of their cross-entropy, and of a newly proposed "contextual cross-entropy" measure. Their performance is then measured on synthesized polyphonic audio signals in terms of the accuracy of multiple pitch estimation in combination with a Nonnegative Matrix Factorization-based acoustic model. In both experiments, the log-linear combination of at least one "vertical" (e.g., harmony) and one "horizontal" (e.g., note duration) sub-model outperformed a pitch-dependent Bernoulli prior by more than 60% in relative cross-entropy and 3% in absolute multiple pitch estimation accuracy. This work provides a proof of concept of the usefulness of model interpolation, which may be used for improved symbolic modeling of other aspects of music in the future.

出版日期2013-9

全文

访问全文

收藏分享被引(19) 浏览

更新时间：2024-04-16 14:02

Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友