摘要

Building a supervised model with good generalization ability in data space using a tiny number of labeled samples is always practically significant and attractive, because labeled samples are generally very expensive to obtain, as the labeling process usually takes much time and resource. In this paper we propose a new supervised two-step learning framework, Posterior Distribution Learning (PDL), to build a robust supervised model in data space by first describing a new constrained graph method to estimate the posterior probability of the unlabeled samples in learning set and then extending a real function regression model to a vector-valued function model to fit a nonlinear function for the posterior probability distribution in the input data space. Compared with traditional supervised learning method, PDL can significantly reduce the demand upon training samples and exhibits the potential to perform nonlinear manifold classification. Experimental results on both synthetic and real world data sets are presented to demonstrate the validity of the proposed supervised architecture.