A novel multimodal retrieval model based on ELM

Zhang, Yu<sup>*</sup>; Yuan, Ye; Wang, Yishu; Wang, Guoren

doi:10.1016/j.neucom.2017.03.095

摘要

In this paper, we propose a novel multimodal retrieval model based on the Extreme Learning Machine (ELM). We exploit two multimedia modalities, the image and text, to achieve the multimodal retrieval. To begin with, we employ the probabilistic Latent Semantic Analysis (pLSA) to respectively simulate the generating processes of texts and images. So we obtain the appropriate representations of the images and those of the texts. Furthermore, ELM is used for training the correlation between the representations of the images and those of the texts. So the multimodal retrieval is implemented by the learned single-hidden layer feedforward neural networks (SLFNs). Additionally, the binary classifiers are trained to improve the accuracy of the multimodal retrieval model. This multimodal model can easily be extended into other modalities and extensive experimental results demonstrate the effectiveness and efficiency of this model based on ELM.

出版日期2018-2-14
单位东北大学

全文

访问全文

收藏分享被引(3) 浏览

更新时间：2021-11-23 11:35

A novel multimodal retrieval model based on ELM

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友