标题:Multimodal question answering over structured data with ambiguous entities
作者:Li, Huadong ;Wang, Yafang ;De Melo, Gerard ;Tu, Changhe ;Chen, Baoquan
通讯作者:Wang, Yafang
作者机构:[Li, H] Shandong University, Jinan Shandong, China;[ Wang, Y] Shandong University, Jinan Shandong, China;[ De Melo, G] Rutgers University, New Brunswi 更多
会议名称:26th International World Wide Web Conference, WWW 2017 Companion
会议日期:3 April 2017 through 7 April 2017
来源:26th International World Wide Web Conference 2017, WWW 2017 Companion
出版年:2019
页码:79-88
DOI:10.1145/3041021.3054135
关键词:Multimedia knowledge bases; Multimodal; Question answering
摘要:In recent years, we have witnessed profound changes in the way people satisfy their information needs. For instance, with the ubiquitous 24/7 availability of mobile devices, the number of search engine queries on mobile devices has reportedly overtaken that of queries on regular personal computers. In this paper, we consider the task of multimodal question answering over structured data, in which a user supplies not just a natural language query but also an image. Our system addresses this by optimizing a non-convex objective function capturing multimodal constraints. Our experiments show that this enables it to answer even very challenging ambiguous entity queries with high accuracy. © 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License.
收录类别:EI;SCOPUS
Scopus被引频次:3
资源类型:会议论文;期刊论文
原文链接:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85027419208&doi=10.1145%2f3041021.3054135&partnerID=40&md5=3b642f86db27b92297b4c68ea9621d41
TOP