arXiv Analytics

Sign in

arXiv:2212.12952 [cs.CV]AbstractReferencesReviewsResources

Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program

Tiange Luo, Honglak Lee, Justin Johnson

Published 2022-12-25Version 1

3D shapes have complementary abstractions from low-level geometry to part-based hierarchies to languages, which convey different levels of information. This paper presents a unified framework to translate between pairs of shape abstractions: $\textit{Text}$ $\Longleftrightarrow$ $\textit{Point Cloud}$ $\Longleftrightarrow$ $\textit{Program}$. We propose $\textbf{Neural Shape Compiler}$ to model the abstraction transformation as a conditional generation process. It converts 3D shapes of three abstract types into unified discrete shape code, transforms each shape code into code of other abstract types through the proposed $\textit{ShapeCode Transformer}$, and decodes them to output the target shape abstraction. Point Cloud code is obtained in a class-agnostic way by the proposed $\textit{Point}$VQVAE. On Text2Shape, ShapeGlot, ABO, Genre, and Program Synthetic datasets, Neural Shape Compiler shows strengths in $\textit{Text}$ $\Longrightarrow$ $\textit{Point Cloud}$, $\textit{Point Cloud}$ $\Longrightarrow$ $\textit{Text}$, $\textit{Point Cloud}$ $\Longrightarrow$ $\textit{Program}$, and Point Cloud Completion tasks. Additionally, Neural Shape Compiler benefits from jointly training on all heterogeneous data and tasks.

Comments: project page: https://tiangeluo.github.io/projectpages/shapecompiler.html
Categories: cs.CV, cs.AI
Related articles: Most relevant | Search more
arXiv:2112.03549 [cs.CV] (Published 2021-12-07, updated 2022-03-20)
GaTector: A Unified Framework for Gaze Object Prediction
arXiv:2309.16126 [cs.CV] (Published 2023-09-28)
UVL: A Unified Framework for Video Tampering Localization
arXiv:2403.15705 [cs.CV] (Published 2024-03-23)
UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation