arXiv Analytics

Sign in

arXiv:1909.03360 [cs.CV]AbstractReferencesReviewsResources

Meta-Transfer Networks for Zero-Shot Learning

Yunlong Yu, Zhongfei Zhang, Jungong Han

Published 2019-09-08Version 1

Zero-Shot Learning (ZSL) aims at recognizing unseen categories using some class semantics of the categories. The existing studies mostly leverage the seen categories to learn a visual-semantic interaction model to infer the unseen categories. However, the disjointness between the seen and unseen categories cannot ensure that the models trained on the seen categories generalize well to the unseen categories. In this work, we propose an episode-based approach to accumulate experiences on addressing disjointness issue by mimicking extensive classification scenarios where training classes and test classes are disjoint. In each episode, a visual-semantic interaction model is first trained on a subset of seen categories as a learner that provides an initial prediction for the rest disjoint seen categories and then a meta-learner fine-tunes the learner by minimizing the differences between the prediction and the ground-truth labels in a pre-defined space. By training extensive episodes on the seen categories, the model is trained to be an expert in predicting the mimetic unseen categories, which will generalize well to the real unseen categories. Extensive experiments on four datasets under both the traditional ZSL and generalized ZSL tasks show that our framework outperforms the state-of-the-art approaches by large margins.

Related articles: Most relevant | Search more
arXiv:1903.00502 [cs.CV] (Published 2019-03-01)
Learning where to look: Semantic-Guided Multi-Attention Localization for Zero-Shot Learning
arXiv:2104.02236 [cs.CV] (Published 2021-04-06)
Hippocampus-heuristic Character Recognition Network for Zero-shot Learning
arXiv:1603.00550 [cs.CV] (Published 2016-03-02)
Synthesized Classifiers for Zero-Shot Learning