arXiv Analytics

Sign in

arXiv:1402.4653 [stat.ML]AbstractReferencesReviewsResources

Retrieval of Experiments by Efficient Estimation of Marginal Likelihood

Sohan Seth, John Shawe-Taylor, Samuel Kaski

Published 2014-02-19Version 1

We study the task of retrieving relevant experiments given a query experiment. By experiment, we mean a collection of measurements from a set of `covariates' and the associated `outcomes'. While similar experiments can be retrieved by comparing available `annotations', this approach ignores the valuable information available in the measurements themselves. To incorporate this information in the retrieval task, we suggest employing a retrieval metric that utilizes probabilistic models learned from the measurements. We argue that such a metric is a sensible measure of similarity between two experiments since it permits inclusion of experiment-specific prior knowledge. However, accurate models are often not analytical, and one must resort to storing posterior samples which demands considerable resources. Therefore, we study strategies to select informative posterior samples to reduce the computational load while maintaining the retrieval performance. We demonstrate the efficacy of our approach on simulated data with simple linear regression as the models, and real world datasets.

Related articles: Most relevant | Search more
arXiv:1912.12945 [stat.ML] (Published 2019-12-30)
Localized Debiased Machine Learning: Efficient Estimation of Quantile Treatment Effects, Conditional Value at Risk, and Beyond
arXiv:2003.12408 [stat.ML] (Published 2020-03-27)
On the role of surrogates in the efficient estimation of treatment effects with limited outcome data
arXiv:2010.09443 [stat.ML] (Published 2020-10-19)
Efficient Estimation and Evaluation of Prediction Rules in Semi-Supervised Settings under Stratified Sampling