arXiv:1906.09243 Abstract | arXiv Analytics

arXiv:1906.09243 [stat.ML]Abstract References Reviews Resources

On Tree-based Methods for Similarity Learning

Published 2019-06-21Version 1

In many situations, the choice of an adequate similarity measure or metric on the feature space dramatically determines the performance of machine learning methods. Building automatically such measures is the specific purpose of metric/similarity learning. In Vogel et al. (2018), similarity learning is formulated as a pairwise bipartite ranking problem: ideally, the larger the probability that two observations in the feature space belong to the same class (or share the same label), the higher the similarity measure between them. From this perspective, the ROC curve is an appropriate performance criterion and it is the goal of this article to extend recursive tree-based ROC optimization techniques in order to propose efficient similarity learning algorithms. The validity of such iterative partitioning procedures in the pairwise setting is established by means of results pertaining to the theory of U-processes and from a practical angle, it is discussed at length how to implement them by means of splitting rules specifically tailored to the similarity learning task. Beyond these theoretical/methodological contributions, numerical experiments are displayed and provide strong empirical evidence of the performance of the algorithmic approaches we propose.

Comments: 17 pages, 4 figures

Categories: stat.ML, cs.LG

Keywords: tree-based methods, efficient similarity learning algorithms, adequate similarity measure, feature space belong, feature space dramatically determines

Related articles: Most relevant | Search more

arXiv:1903.05179 [stat.ML] (Published 2019-03-12)

Unbiased Measurement of Feature Importance in Tree-Based Methods

Zhengze Zhou, Giles Hooker

arXiv:2204.13916 [stat.ML] (Published 2022-04-29)

A study of tree-based methods and their combination

Yinuo Zeng

arXiv:2004.07383 [stat.ML] (Published 2020-04-15)

Exploiting Categorical Structure Using Tree-Based Methods

Brian Lucena

arXiv Analytics

arXiv:1906.09243 [stat.ML]Abstract References Reviews Resources

On Tree-based Methods for Similarity Learning

Links

Toolbox

arXiv:1906.09243 [stat.ML]AbstractReferencesReviewsResources

On Tree-based Methods for Similarity Learning

Links

Toolbox

arXiv:1906.09243 [stat.ML]Abstract References Reviews Resources