arXiv:2301.09648 [astro-ph.GA]AbstractReferencesReviewsResources
Identification of galaxy shreds in large photometric catalogs using Convolutional Neural Networks
Enrico M. Di Teodoro, Josh E. G. Peek, John F. Wu
Published 2023-01-23Version 1
Contamination from galaxy fragments, identified as sources, is a major issue in large photometric galaxy catalogs. In this paper, we prove that this problem can be easily addressed with computer vision techniques. We use image cutouts to train a convolutional neural network (CNN) to identify catalogued sources that are in reality just star formation regions and/or shreds of larger galaxies. The CNN reaches an accuracy ~98% on our testing datasets. We apply this CNN to galaxy catalogs from three amongst the largest surveys available today: the Sloan Digital Sky Survey (SDSS), the DESI Legacy Imaging Surveys and the Panoramic Survey Telescope and Rapid Response System Survey (Pan-STARSS). We find that, even when strict selection criteria are used, all catalogs still show a ~5% level of contamination from galaxy shreds. Our CNN gives a simple yet effective solution to clean galaxy catalogs from these contaminants.