In order to accelerate research on pediatric sleep and its connection to health, Nationwide Children's Hospital (NCH) and Carnegie Mellon University (CMU) introduce the NCH Sleep DataBank. This dataset has 3,984 pediatric sleep studies on 3,673 unique patients conducted at NCH in Columbus, Ohio, USA between 2017 and 2019, along with the patients' longitudinal clinical data. The published polysomnography (PSG) contains the patient's physiological signals as well as the technician’s assessment of the sleep stages and descriptions of additional irregularities.
The novelties of this dataset include:
The NCH Sleep DataBank is a valuable resource for advancing automatic sleep scoring and real-time sleep disorder prediction, among many other potential scientific discoveries. Accompanying code in Python to assist users in interacting with the dataset is published on GitHub.
The NCH Sleep DataBank is only available for non-commercial use.
When using this dataset, please cite the following:
Zhang GQ, Cui L, Mueller R, Tao S, Kim M, Rueschman M, Mariani S, Mobley D, Redline S. The National Sleep Research Resource: towards a sleep data commons. J Am Med Inform Assoc. 2018 Oct 1;25(10):1351-1358. doi: 10.1093/jamia/ocy064. PMID: 29860441; PMCID: PMC6188513.
Lee H, Li B, DeForte S, Splaingard ML, Huang Y, Chi Y, Linwood SL. A large collection of real-world pediatric sleep studies. Sci Data. 2022 Jul 19;9(1):421. doi: 10.1038/s41597-022-01545-6. PMID: 35853958; PMCID: PMC9296671.
Please include the following text in the Acknowledgements:
NCH Sleep DataBank was supported by the National Institute of Biomedical Imaging and Bioengineering of the National Institutes of Health under Award Number R01EB025018. The National Sleep Research Resource was supported by the U.S. National Institutes of Health, National Heart Lung and Blood Institute (R24 HL114473, 75N92019R002).
Covariate datasets derived from the original health_data files. The NSRR team recommends using the
nchsdb-dataset-harmonized dataset, which contains variables (e.g., nsrr_age, nsrr_bmi) that match other NSRR harmonized datasets.
Data (CSV) from the NCH clinical data warehouse. Click here for an overview of the file formats and contents.
Raw physiological data (EDF) and annotations (TSV) from overnight polysomnography.