TY - JOUR
T1 - Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene-based approaches
AU - Schürch, A. C.
AU - Arredondo-Alonso, S.
AU - Willems, R. J.L.
AU - Goering, R. V.
PY - 2018/4/1
Y1 - 2018/4/1
N2 - Background: Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. Aims: This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Sources: Personal collection of relevant publications. Content: When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. Implications: As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation.
AB - Background: Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. Aims: This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Sources: Personal collection of relevant publications. Content: When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. Implications: As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation.
KW - Accessory genome
KW - CgMLST
KW - Core genome
KW - Pan-genome
KW - Relatedness threshold
KW - SNP
KW - Whole genome sequencing
UR - http://www.scopus.com/inward/record.url?scp=85041353485&partnerID=8YFLogxK
U2 - 10.1016/j.cmi.2017.12.016
DO - 10.1016/j.cmi.2017.12.016
M3 - Article
AN - SCOPUS:85041353485
SN - 1198-743X
JO - Clinical Microbiology and Infection
JF - Clinical Microbiology and Infection
IS - 4
ER -