Differentially private classification with decision tree ensemble

Liu, Xiaoqian; Li, Qianmu<sup>*</sup>; Li, Tao<sup>*</sup>; Chen, Dong

doi:10.1016/j.asoc.2017.09.010

摘要

In decision tree classification with differential privacy, it is query intensive to calculate the impurity metrics, such as information gain and gini index. More queries imply more noise addition. Therefore, a straightforward implementation of differential privacy often yields poor accuracy and stableness. This motivates us to adopt better impurity metric for evaluating attributes to build the tree structure recursively. In this paper, we first give a detailed analysis for the statistical queries involved in decision tree induction. Second, we propose a private decision tree algorithm based on the noisy maximal vote. We also present an effective privacy budget allocation strategy. Third, to boost the accuracy and improve the stableness, we construct the ensemble model, where multiple private decision trees are built on bootstrapped samples. Extensive experiments are executed on real datasets to demonstrate that the proposed ensemble model provides accurate and reliable classification results.

出版日期2018-1
单位南京邮电大学; 哈尔滨工业大学; 南京理工大学

全文

访问全文

收藏分享被引(18) 浏览

更新时间：2024-05-11 09:20

Differentially private classification with decision tree ensemble

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友