TY - JOUR
T1 - Individualized early prediction of familial risk of dyslexia
T2 - A study of infant vocabulary development
AU - Chen, Ao
AU - Wijnen, Frank
AU - Koster, Charlotte
AU - Schnack, Hugo
PY - 2017/2/21
Y1 - 2017/2/21
N2 - We examined early vocabulary development in children at familial risk (FR) of dyslexia and typically developing (TD) children between 17 and 35 months of age. We trained a support vector machine to classify TD and FR using these vocabulary data at the individual level. The Dutch version of the McArthur-Bates Communicative Development Inventory (Words and Sentences) (N-CDI) was used to measure vocabulary development. We analyzed group-level differences for both total vocabulary as well as lexical classes: common nouns, predicates, and closed class words. The generalizability of the classification model was tested using cross-validation. At the group level, for both total vocabulary and the composites, the difference between TD and FR was most pronounced at 19-20 months, with FRs having lower scores. For the individual prediction, highest cross-validation accuracy (68%) was obtained at 19-20 months, with sensitivity (correctly classified FR) being 70% and specificity (correctly classified TD) being 67%. There is a sensitive window in which the difference between FR and TD is most evident. Machine learning methods are promising techniques for separating FR and TD children at an early age, before they start reading.
AB - We examined early vocabulary development in children at familial risk (FR) of dyslexia and typically developing (TD) children between 17 and 35 months of age. We trained a support vector machine to classify TD and FR using these vocabulary data at the individual level. The Dutch version of the McArthur-Bates Communicative Development Inventory (Words and Sentences) (N-CDI) was used to measure vocabulary development. We analyzed group-level differences for both total vocabulary as well as lexical classes: common nouns, predicates, and closed class words. The generalizability of the classification model was tested using cross-validation. At the group level, for both total vocabulary and the composites, the difference between TD and FR was most pronounced at 19-20 months, with FRs having lower scores. For the individual prediction, highest cross-validation accuracy (68%) was obtained at 19-20 months, with sensitivity (correctly classified FR) being 70% and specificity (correctly classified TD) being 67%. There is a sensitive window in which the difference between FR and TD is most evident. Machine learning methods are promising techniques for separating FR and TD children at an early age, before they start reading.
KW - Developmental trajectories
KW - Dyslexia
KW - Machine learning
KW - Predictions
KW - Vocabulary acquisition
UR - http://www.scopus.com/inward/record.url?scp=85014288402&partnerID=8YFLogxK
U2 - 10.3389/fpsyg.2017.00156
DO - 10.3389/fpsyg.2017.00156
M3 - Article
C2 - 28270778
AN - SCOPUS:85014288402
SN - 1664-1078
VL - 8
JO - Frontiers in Psychology
JF - Frontiers in Psychology
IS - FEB
M1 - 156
ER -