arXiv Analytics

Sign in

arXiv:2101.09903 [cs.CV]AbstractReferencesReviewsResources

A Two-stage Framework for Compound Figure Separation

Weixin Jiang, Eric Schwenker, Trevor Spreadbury, Nicola Ferrier, Maria K. Y. Chan, Oliver Cossairt

Published 2021-01-25Version 1

Scientific literature contains large volumes of complex, unstructured figures that are compound in nature (i.e. composed of multiple images, graphs, and drawings). Separation of these compound figures is critical for information retrieval from these figures. In this paper, we propose a new strategy for compound figure separation, which decomposes the compound figures into constituent subfigures while preserving the association between the subfigures and their respective caption components. We propose a two-stage framework to address the proposed compound figure separation problem. In particular, the subfigure label detection module detects all subfigure labels in the first stage. Then, in the subfigure detection module, the detected subfigure labels help to detect the subfigures by optimizing the feature selection process and providing the global layout information as extra features. Extensive experiments are conducted to validate the effectiveness and superiority of the proposed framework, which improves the detection precision by 9%.

Related articles: Most relevant | Search more
arXiv:1703.05105 [cs.CV] (Published 2017-03-15)
A Data Driven Approach for Compound Figure Separation Using Convolutional Neural Networks
arXiv:2208.14357 [cs.CV] (Published 2022-08-30)
Compound Figure Separation of Biomedical Images: Mining Large Datasets for Self-supervised Learning
Tianyuan Yao et al.
arXiv:2204.09924 [cs.CV] (Published 2022-04-21)
Progressive Training of A Two-Stage Framework for Video Restoration