arXiv Analytics

Sign in

arXiv:1504.00325 [cs.CV]AbstractReferencesReviewsResources

Microsoft COCO Captions: Data Collection and Evaluation Server

Xinlei Chen, Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollar, C. Lawrence Zitnick

Published 2015-04-01Version 1

In this paper we describe the Microsoft COCO Caption dataset and evaluation server. When completed, the dataset will contain over one and a half million captions describing over 330,000 images. For the training and validation images, five independent human generated captions will be provided. To ensure consistency in evaluation of automatic caption generation algorithms, an evaluation server is used. The evaluation server receives candidate captions and scores them using several popular metrics, including BLEU, METEOR, ROUGE and CIDEr. Instructions for using the evaluation server are provided.

Related articles: Most relevant | Search more
arXiv:2211.04769 [cs.CV] (Published 2022-11-09)
Interpretable Explainability in Facial Emotion Recognition and Gamification for Data Collection
arXiv:2106.10733 [cs.CV] (Published 2021-06-20)
Mobile Sensing for Multipurpose Applications in Transportation
arXiv:2108.10992 [cs.CV] (Published 2021-08-24)
OOWL500: Overcoming Dataset Collection Bias in the Wild