arXiv Analytics

Sign in

arXiv:2311.08979 [cs.LG]AbstractReferencesReviewsResources

A Multimodal Dataset of 21,412 Recorded Nights for Sleep and Respiratory Research

Alon Diament, Maria Gorodetski, Adam Jankelow, Ayya Keshet, Tal Shor, Daphna Weissglas-Volkov, Hagai Rossman, Eran Segal

Published 2023-11-15Version 1

This study introduces a novel, rich dataset obtained from home sleep apnea tests using the FDA-approved WatchPAT-300 device, collected from 7,077 participants over 21,412 nights. The dataset comprises three levels of sleep data: raw multi-channel time-series from sensors, annotated sleep events, and computed summary statistics, which include 447 features related to sleep architecture, sleep apnea, and heart rate variability (HRV). We present reference values for Apnea/Hypopnea Index (AHI), sleep efficiency, Wake After Sleep Onset (WASO), and HRV sample entropy, stratified by age and sex. Moreover, we demonstrate that the dataset improves the predictive capability for various health related traits, including body composition, bone density, blood sugar levels and cardiovascular health. These results illustrate the dataset's potential to advance sleep research, personalized healthcare, and machine learning applications in biomedicine.

Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 14 pages
Categories: cs.LG, eess.SP
Related articles: Most relevant | Search more
arXiv:1903.11027 [cs.LG] (Published 2019-03-26)
nuScenes: A multimodal dataset for autonomous driving
arXiv:2212.08279 [cs.LG] (Published 2022-12-16)
Werewolf Among Us: A Multimodal Dataset for Modeling Persuasion Behaviors in Social Deduction Games
Bolin Lai et al.
arXiv:2406.04940 [cs.LG] (Published 2024-06-07)
CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling