arXiv Analytics

Sign in

arXiv:1511.02570 [cs.CV]AbstractReferencesReviewsResources

Explicit Knowledge-based Reasoning for Visual Question Answering

Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel, Anthony Dick

Published 2015-11-09Version 1

We describe a method for visual question answering which is capable of reasoning about contents of an image on the basis of information extracted from a large-scale knowledge base. The method not only answers natural language questions using concepts not contained in the image, but can provide an explanation of the reasoning by which it developed its answer. The method is capable of answering far more complex questions than the predominant long short-term memory-based approach, and outperforms it significantly in the testing. We also provide a dataset and a protocol by which to evaluate such methods, thus addressing one of the key issues in general visual ques- tion answering.

Related articles: Most relevant | Search more
arXiv:2006.14264 [cs.CV] (Published 2020-06-25)
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
arXiv:1907.12133 [cs.CV] (Published 2019-07-28)
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
arXiv:1906.10169 [cs.CV] (Published 2019-06-24)
RUBi: Reducing Unimodal Biases in Visual Question Answering