arXiv:2004.07383 [stat.ML]AbstractReferencesReviewsResources
Exploiting Categorical Structure Using Tree-Based Methods
Published 2020-04-15Version 1
Standard methods of using categorical variables as predictors either endow them with an ordinal structure or assume they have no structure at all. However, categorical variables often possess structure that is more complicated than a linear ordering can capture. We develop a mathematical framework for representing the structure of categorical variables and show how to generalize decision trees to make use of this structure. This approach is applicable to methods such as Gradient Boosted Trees which use a decision tree as the underlying learner. We show results on weather data to demonstrate the improvement yielded by this approach.
Comments: To appear in AISTATS 2020 Proceedings
Related articles: Most relevant | Search more
arXiv:1905.01413 [stat.ML] (Published 2019-05-04)
ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables
arXiv:2204.13916 [stat.ML] (Published 2022-04-29)
A study of tree-based methods and their combination
arXiv:2003.12127 [stat.ML] (Published 2020-03-26)
Gryffin: An algorithm for Bayesian optimization for categorical variables informed by physical intuition with applications to chemistry