arXiv:1906.01998 Abstract | arXiv Analytics

arXiv:1906.01998 [cs.LG]Abstract References Reviews Resources

The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis

Published 2019-06-04Version 1

Despite the widespread usage of machine learning throughout organizations, there are some key principles that are commonly missed. In particular: 1) There are at least four main families for supervised learning: logical modeling methods, linear combination methods, case-based reasoning methods, and iterative summarization methods. 2) For many application domains, almost all machine learning methods perform similarly (with some caveats). Deep learning methods, which are the leading technique for computer vision problems, do not maintain an edge over other methods for most problems (and there are reasons why). 3) Neural networks are hard to train and weird stuff often happens when you try to train them. 4) If you don't use an interpretable model, you can make bad mistakes. 5) Explanations can be misleading and you can't trust them. 6) You can pretty much always find an accurate-yet-interpretable model, even for deep neural networks. 7) Special properties such as decision making or robustness must be built in, they don't happen on their own. 8) Causal inference is different than prediction (correlation is not causation). 9) There is a method to the madness of deep neural architectures, but not always. 10) It is a myth that artificial intelligence can do anything.

Comments: INFORMS TutORial 2019

Categories: cs.LG, stat.ML

Keywords: data analysis, linear combination methods, machine learning methods perform, machine learning throughout organizations, computer vision problems

Related articles: Most relevant | Search more

arXiv:2008.03452 [cs.LG] (Published 2020-08-08)

Partitioning signal classes using transport transforms for data analysis and machine learning

Akram Aldroubi, Shiying Li, Gustavo K. Rohde

arXiv:2306.04066 [cs.LG] (Published 2023-06-06)

Intelligent sampling for surrogate modeling, hyperparameter optimization, and data analysis

Chandrika Kamath

arXiv:2202.12875 [cs.LG] (Published 2022-02-25)

DataLab: A Platform for Data Analysis and Intervention

Yang Xiao et al.

arXiv Analytics

arXiv:1906.01998 [cs.LG]Abstract References Reviews Resources

The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis

Links

Toolbox

arXiv:1906.01998 [cs.LG]AbstractReferencesReviewsResources

The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis

Links

Toolbox

arXiv:1906.01998 [cs.LG]Abstract References Reviews Resources