A binning formula of bi-histogram for joint entropy estimation using mean square error minimization

Hacine Gharbi Abdenour; Ravier Philippe<sup>*</sup>

doi:10.1016/j.patrec.2017.11.007

摘要

Histograms have extensively been used as a simple tool for nonparametric probability density function estimation. However, practically, the accuracy of some histogram-based derived quantities, such as the marginal entropy (ME), the joint entropy (JE), or the mutual information (MI) depends on the number of bins chosen for the histogram. In this paper, we investigate the binning problem of bi-histogram for the estimation of JE. By minimizing a theoretical mean square error (MSE) of JE estimation, we derive a new formula for the optimal number of bins of bi-histogram for continuous random variables. This novel JE estimation has been used in the MI estimation to avoid the error accumulation of joint MI between the class variable and feature subset in the feature selection. In a synthetic Gaussian feature selection problem, only the proposed method permits to retrieve the exact number of relevant features that explain the class variable when compared to a concurrent univariate estimator based on binning formula that has been proposed for ME estimation. In speech and speaker recognition applications, the proposed method permits to select a limited number of features which guaranties approximately the same or an even better recognition rate than using the total number of features.

出版日期2018-1-1

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2021-03-17 21:53

A binning formula of bi-histogram for joint entropy estimation using mean square error minimization

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友