Active learning for regression using greedy sampling

Wu, Dongrui<sup>*</sup>; Lin, Chin-Teng; Huang, Jian<sup>*</sup>

doi:10.1016/j.ins.2018.09.060

摘要

Regression problems are pervasive in real-world applications. Generally a substantial amount of labeled samples are needed to build a regression model with good generalization ability. However, many times it is relatively easy to collect a large number of unlabeled samples, but time-consuming or expensive to label them. Active learning for regression (ALR) is a methodology to reduce the number of labeled samples, by selecting the most beneficial ones to label, instead of random selection. This paper proposes two new ALR approaches based on greedy sampling (GS). The first approach (GSy) selects new samples to increase the diversity in the output space, and the second (iGS) selects new samples to increase the diversity in both input and output spaces. Extensive experiments on 10 UCI and CMU StatLib datasets from various domains, and on 15 subjects on EEG-based driver drowsiness estimation, verified their effectiveness and robustness.

出版日期2019-2
单位华中科技大学

全文

访问全文

收藏分享被引(54) 浏览

更新时间：2024-04-04 16:59

Active learning for regression using greedy sampling

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友