A Compounds Generation Method Based on Visual Semantic Representation

Liu Haipeng<sup>*</sup>; Wang Xiaojie; Zhong Yixin

摘要

In order to improve the generation method in vision-grounded language model ViMac, a core-based visual semantic representation is proposed. With core-based semantic representation, ViMac can work with Compounds generation method to output more accurate compounds instead of single words. Compounds generation method can describe unseen visual feature values by creating new compounds and overcome the subjective variabilities imported during the learning phase. In the experiment, three generation methods are compared by the generation error rate. Gaussian model based generation method gets the result of 82%, KNN generation method gets the result of 69%, and Compounds method gets the result of 54%, which reduces at least 15% on the generation error rate. In another comparison experiment on execution time of nonparametric generation methods, KNN method gets the result of 35.2s. Compound method gets the result of 15.7s, which is almost half of the time cost by KNN method. Experimental results indicate that Compounds generation method can greatly reduce both the generation error rate and the computational complexity compared with KNN method and Gaussian model based method.

出版日期2011-7
单位北京邮电大学

全文

下载全文

收藏分享被引浏览

更新时间：2023-11-17 06:36

A Compounds Generation Method Based on Visual Semantic Representation

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友