arXiv Analytics

Sign in

arXiv:1906.01998 [cs.LG]AbstractReferencesReviewsResources

The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis

Cynthia Rudin, David Carlson

Published 2019-06-04Version 1

Despite the widespread usage of machine learning throughout organizations, there are some key principles that are commonly missed. In particular: 1) There are at least four main families for supervised learning: logical modeling methods, linear combination methods, case-based reasoning methods, and iterative summarization methods. 2) For many application domains, almost all machine learning methods perform similarly (with some caveats). Deep learning methods, which are the leading technique for computer vision problems, do not maintain an edge over other methods for most problems (and there are reasons why). 3) Neural networks are hard to train and weird stuff often happens when you try to train them. 4) If you don't use an interpretable model, you can make bad mistakes. 5) Explanations can be misleading and you can't trust them. 6) You can pretty much always find an accurate-yet-interpretable model, even for deep neural networks. 7) Special properties such as decision making or robustness must be built in, they don't happen on their own. 8) Causal inference is different than prediction (correlation is not causation). 9) There is a method to the madness of deep neural architectures, but not always. 10) It is a myth that artificial intelligence can do anything.

Related articles: Most relevant | Search more
arXiv:2008.03452 [cs.LG] (Published 2020-08-08)
Partitioning signal classes using transport transforms for data analysis and machine learning
arXiv:2306.04066 [cs.LG] (Published 2023-06-06)
Intelligent sampling for surrogate modeling, hyperparameter optimization, and data analysis
arXiv:2202.12875 [cs.LG] (Published 2022-02-25)
DataLab: A Platform for Data Analysis and Intervention
Yang Xiao et al.