arXiv:1807.01990 Abstract | arXiv Analytics

arXiv:1807.01990 [cs.CV]Abstract References Reviews Resources

Transfer Learning From Synthetic To Real Images Using Variational Autoencoders For Precise Position Detection

Tadanobu Inoue, Subhajit Chaudhury, Giovanni De Magistris, Sakyasingha Dasgupta

Published 2018-07-04Version 1

Capturing and labeling camera images in the real world is an expensive task, whereas synthesizing labeled images in a simulation environment is easy for collecting large-scale image data. However, learning from only synthetic images may not achieve the desired performance in the real world due to a gap between synthetic and real images. We propose a method that transfers learned detection of an object position from a simulation environment to the real world. This method uses only a significantly limited dataset of real images while leveraging a large dataset of synthetic images using variational autoencoders. Additionally, the proposed method consistently performed well in different lighting conditions, in the presence of other distractor objects, and on different backgrounds. Experimental results showed that it achieved accuracy of 1.5mm to 3.5mm on average. Furthermore, we showed how the method can be used in a real-world scenario like a "pick-and-place" robotic task.

Categories: cs.CV

Keywords: real images, precise position detection, variational autoencoders, transfer learning, real world

Related articles: Most relevant | Search more

arXiv:2305.18769 [cs.CV] (Published 2023-05-30)

DualVAE: Controlling Colours of Generated and Real Images

Keerth Rathakumar, David Liebowitz, Christian Walder, Kristen Moore, Salil S. Kanhere

arXiv:2108.11005 [cs.CV] (Published 2021-08-25)

Wanderlust: Online Continual Object Detection in the Real World

Jianren Wang, Xin Wang, Yue Shang-Guan, Abhinav Gupta

arXiv:1709.00751 [cs.CV] (Published 2017-09-03)

Sushi Dish - Object detection and classification from real images

Yeongjin Oh, Seunghyun Son, Gyumin Sim