arXiv:2202.00914 Abstract | arXiv Analytics

arXiv:2202.00914 [cs.LG]Abstract References Reviews Resources

Lipschitz-constrained Unsupervised Skill Discovery

Seohong Park, Jongwook Choi, Jaekyeom Kim, Honglak Lee, Gunhee Kim

Published 2022-02-02Version 1

We study the problem of unsupervised skill discovery, whose goal is to learn a set of diverse and useful skills with no external reward. There have been a number of skill discovery methods based on maximizing the mutual information (MI) between skills and states. However, we point out that their MI objectives usually prefer static skills to dynamic ones, which may hinder the application for downstream tasks. To address this issue, we propose Lipschitz-constrained Skill Discovery (LSD), which encourages the agent to discover more diverse, dynamic, and far-reaching skills. Another benefit of LSD is that its learned representation function can be utilized for solving goal-following downstream tasks even in a zero-shot manner - i.e., without further training or complex planning. Through experiments on various MuJoCo robotic locomotion and manipulation environments, we demonstrate that LSD outperforms previous approaches in terms of skill diversity, state space coverage, and performance on seven downstream tasks including the challenging task of following multiple goals on Humanoid. Our code and videos are available at https://shpark.me/projects/lsd/.

Comments: Accepted to ICLR 2022

Categories: cs.LG, cs.AI, cs.RO

Keywords: lipschitz-constrained unsupervised skill discovery, downstream tasks, objectives usually prefer static skills, mi objectives usually prefer static, skill discovery methods

Related articles: Most relevant | Search more

arXiv:2211.03782 [cs.LG] (Published 2022-11-07)

On minimal variations for unsupervised representation learning

Vivien Cabannes, Alberto Bietti, Randall Balestriero

arXiv:2309.17002 [cs.LG] (Published 2023-09-29)

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Hao Chen et al.

arXiv:2310.15318 [cs.LG] (Published 2023-10-23)

HetGPT: Harnessing the Power of Prompt Tuning in Pre-Trained Heterogeneous Graph Neural Networks

Yihong Ma, Ning Yan, Jiayu Li, Masood Mortazavi, Nitesh V. Chawla

arXiv Analytics

arXiv:2202.00914 [cs.LG]Abstract References Reviews Resources

Lipschitz-constrained Unsupervised Skill Discovery

Links

Toolbox

arXiv:2202.00914 [cs.LG]AbstractReferencesReviewsResources

Lipschitz-constrained Unsupervised Skill Discovery

Links

Toolbox

arXiv:2202.00914 [cs.LG]Abstract References Reviews Resources