Asymptotic Convergence Properties of the EM Algorithm for Mixture of Experts

Yang, Yan<sup>*</sup>; Ma, Jinwen

doi:10.1162/NECO_a_00154

摘要

Mixture of experts (ME) is a modular neural network architecture for supervised classification. The double-loop expectation-maximization (EM) algorithm has been developed for learning the parameters of the ME architecture, and the iteratively reweighted least squares (IRLS) algorithm and the Newton-Raphson algorithm are two popular schemes for learning the parameters in the inner loop or gating network. In this letter, we investigate asymptotic convergence properties of the EM algorithm for ME using either the IRLS or Newton-Raphson approach. With the help of an overlap measure for the ME model, we obtain an upper bound of the asymptotic convergence rate of the EM algorithm in each case. Moreover, we find that for the Newton approach as a specific Newton-Raphson approach to learning the parameters in the inner loop, the upper bound of asymptotic convergence rate of the EM algorithm locally around the true solution Theta* is o(e(0.5-epsilon)(Theta*)), where epsilon > 0 is an arbitrarily small number, o(x) means that it is a higher-order infinitesimal as x -> 0, and e(Theta*) is a measure of the average overlap of the ME model. That is, as the average overlap of the true ME model with large sample tends to zero, the EM algorithm with the Newton approach to learning the parameters in the inner loop tends to be asymptotically superlinear. Finally, we substantiate our theoretical results by simulation experiments.

出版日期2011-8
单位北京大学

全文

访问全文

收藏分享被引(4) 浏览

更新时间：2023-11-13 19:15

Asymptotic Convergence Properties of the EM Algorithm for Mixture of Experts

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友