arXiv:1805.00811 Abstract | arXiv Analytics

arXiv:1805.00811 [stat.ML]Abstract References Reviews Resources

An Evaluation of Classification and Outlier Detection Algorithms

Published 2018-05-02Version 1

This paper evaluates algorithms for classification and outlier detection accuracies in temporal data. We focus on algorithms that train and classify rapidly and can be used for systems that need to incorporate new data regularly. Hence, we compare the accuracy of six fast algorithms using a range of well-known time-series datasets. The analyses demonstrate that the choice of algorithm is task and data specific but that we can derive heuristics for choosing. Gradient Boosting Machines are generally best for classification but there is no single winner for outlier detection though Gradient Boosting Machines (again) and Random Forest are better. Hence, we recommend running evaluations of a number of algorithms using our heuristics.

Categories: stat.ML, cs.LG

Keywords: outlier detection algorithms, classification, evaluation, gradient boosting machines, well-known time-series datasets

Related articles: Most relevant | Search more

arXiv:2211.02912 [stat.ML] (Published 2022-11-05)

New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound

Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora

arXiv:1803.00276 [stat.ML] (Published 2018-03-01)

Model-Based Clustering and Classification of Functional Data

Faicel Chamroukhi, Hien D. Nguyen

arXiv:1808.03064 [stat.ML] (Published 2018-08-09)

Gradient and Newton Boosting for Classification and Regression

Fabio Sigrist

arXiv Analytics

arXiv:1805.00811 [stat.ML]Abstract References Reviews Resources

An Evaluation of Classification and Outlier Detection Algorithms

Links

Toolbox

arXiv:1805.00811 [stat.ML]AbstractReferencesReviewsResources

An Evaluation of Classification and Outlier Detection Algorithms

Links

Toolbox

arXiv:1805.00811 [stat.ML]Abstract References Reviews Resources