TY - JOUR
T1 - An analysis-ready and quality controlled resource for pediatric brain white-matter research
AU - Richie-Halford, Adam
AU - Cieslak, Matthew
AU - Ai, Lei
AU - Caffarra, Sendy
AU - Covitz, Sydney
AU - Franco, Alexandre R.
AU - Karipidis, Iliana I.
AU - Kruper, John
AU - Milham, Michael
AU - Avelar-Pereira, Bárbara
AU - Roy, Ethan
AU - Sydnor, Valerie J.
AU - Yeatman, Jason D.
AU - Abbott, Nicholas J.
AU - Anderson, John A.E.
AU - Gagana, B.
AU - Bleile, Mary Lena
AU - Bloomfield, Peter S.
AU - Bottom, Vince
AU - Bourque, Josiane
AU - Boyle, Rory
AU - Brynildsen, Julia K.
AU - Calarco, Navona
AU - Castrellon, Jaime J.
AU - Chaku, Natasha
AU - Chen, Bosi
AU - Chopra, Sidhant
AU - Coffey, Emily B.J.
AU - Colenbier, Nigel
AU - Cox, Daniel J.
AU - Crippen, James Elliott
AU - Crouse, Jacob J.
AU - David, Szabolcs
AU - Leener, Benjamin De
AU - Delap, Gwyneth
AU - Deng, Zhi De
AU - Dugre, Jules Roger
AU - Eklund, Anders
AU - Ellis, Kirsten
AU - Ered, Arielle
AU - Farmer, Harry
AU - Faskowitz, Joshua
AU - Finch, Jody E.
AU - Flandin, Guillaume
AU - Flounders, Matthew W.
AU - Fonville, Leon
AU - Frandsen, Summer B.
AU - Garic, Dea
AU - Garrido-Vásquez, Patricia
AU - Gonzalez-Escamilla, Gabriel
N1 - Funding Information:
We would like to thank Anisha Keshavan for useful discussions of community science and web-based quality control and for her work on SwipesForScience. We would like to thank Adina S. Wagner, Yaroslav O. Halchenko, and Michael Hanke for their guidance in creating the HBN-POD2 Datalad dataset. We thank Samuel Buck Johnson for useful discussions of data quality for the HBN structural MRI data. This manuscript was prepared using a limited access dataset obtained from the Child Mind Institute Biobank, The Healthy Brain Network dataset. This manuscript reflects the views of the authors and does not necessarily reflect the opinions or views of the Child Mind Institute. This work was supported via BRAIN Initiative grant 1RF1MH121868-01 from the National Institutes of Mental Health. Additional support was provided by grant 1R01EB027585-01 from the National Institutes of Biomedical Imaging and Bioengineering (PI: Eleftherios Garyfallidis). Additional support was provided by R01MH120482 and the Penn/CHOP Lifespan Brain Institute.
Funding Information:
We would like to thank Anisha Keshavan for useful discussions of community science and web-based quality control and for her work on SwipesForScience. We would like to thank Adina S. Wagner, Yaroslav O. Halchenko, and Michael Hanke for their guidance in creating the HBN-POD2 Datalad dataset. We thank Samuel Buck Johnson for useful discussions of data quality for the HBN structural MRI data. This manuscript was prepared using a limited access dataset obtained from the Child Mind Institute Biobank, The Healthy Brain Network dataset. This manuscript reflects the views of the authors and does not necessarily reflect the opinions or views of the Child Mind Institute. This work was supported via BRAIN Initiative grant 1RF1MH121868-01 from the National Institutes of Mental Health. Additional support was provided by grant 1R01EB027585-01 from the National Institutes of Biomedical Imaging and Bioengineering (PI: Eleftherios Garyfallidis). Additional support was provided by R01MH120482 and the Penn/CHOP Lifespan Brain Institute.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - We created a set of resources to enable research based on openly-available diffusion MRI (dMRI) data from the Healthy Brain Network (HBN) study. First, we curated the HBN dMRI data (N = 2747) into the Brain Imaging Data Structure and preprocessed it according to best-practices, including denoising and correcting for motion effects, susceptibility-related distortions, and eddy currents. Preprocessed, analysis-ready data was made openly available. Data quality plays a key role in the analysis of dMRI. To optimize QC and scale it to this large dataset, we trained a neural network through the combination of a small data subset scored by experts and a larger set scored by community scientists. The network performs QC highly concordant with that of experts on a held out set (ROC-AUC = 0.947). A further analysis of the neural network demonstrates that it relies on image features with relevance to QC. Altogether, this work both delivers resources to advance transdiagnostic research in brain connectivity and pediatric mental health, and establishes a novel paradigm for automated QC of large datasets.
AB - We created a set of resources to enable research based on openly-available diffusion MRI (dMRI) data from the Healthy Brain Network (HBN) study. First, we curated the HBN dMRI data (N = 2747) into the Brain Imaging Data Structure and preprocessed it according to best-practices, including denoising and correcting for motion effects, susceptibility-related distortions, and eddy currents. Preprocessed, analysis-ready data was made openly available. Data quality plays a key role in the analysis of dMRI. To optimize QC and scale it to this large dataset, we trained a neural network through the combination of a small data subset scored by experts and a larger set scored by community scientists. The network performs QC highly concordant with that of experts on a held out set (ROC-AUC = 0.947). A further analysis of the neural network demonstrates that it relies on image features with relevance to QC. Altogether, this work both delivers resources to advance transdiagnostic research in brain connectivity and pediatric mental health, and establishes a novel paradigm for automated QC of large datasets.
KW - Brain/diagnostic imaging
KW - Child
KW - Diffusion Magnetic Resonance Imaging/methods
KW - Humans
KW - Image Processing, Computer-Assisted/methods
KW - Neuroimaging
KW - White Matter/diagnostic imaging
UR - http://www.scopus.com/inward/record.url?scp=85139753890&partnerID=8YFLogxK
U2 - 10.1038/s41597-022-01695-7
DO - 10.1038/s41597-022-01695-7
M3 - Article
C2 - 36224186
AN - SCOPUS:85139753890
SN - 2052-4463
VL - 9
JO - Scientific data
JF - Scientific data
IS - 1
M1 - 616
ER -