TY - JOUR
T1 - Mge-cluster
T2 - a reference-free approach for typing bacterial plasmids
AU - Arredondo-Alonso, Sergio
AU - Gladstone, Rebecca A
AU - Pöntinen, Anna K
AU - Gama, João A
AU - Schürch, Anita C
AU - Lanza, Val F
AU - Johnsen, Pål Jarle
AU - Samuelsen, Ørjan
AU - Tonkin-Hill, Gerry
AU - Corander, Jukka
N1 - Funding Information:
European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Actions [801 133 to S.A.-A., A.K.P.]; Trond Mohn Foundation [TMS2019TMT04 to A.K.P., R.A.G., Ø.S., P.J.J., J.C.]; European Research Council [742 158 to J.C.]; ZonMW (The Netherlands) [541 003 005 to A.C.S.].
Publisher Copyright:
© 2023 The Author(s).
PY - 2023/9/1
Y1 - 2023/9/1
N2 - Extrachromosomal elements of bacterial cells such as plasmids are notorious for their importance in evolution and adaptation to changing ecology. However, high-resolution population-wide analysis of plasmids has only become accessible recently with the advent of scalable long-read sequencing technology. Current typing methods for the classification of plasmids remain limited in their scope which motivated us to develop a computationally efficient approach to simultaneously recognize novel types and classify plasmids into previously identified groups. Here, we introduce
mge-cluster that can easily handle thousands of input sequences which are compressed using a unitig representation in a de Bruijn graph. Our approach offers a faster runtime than existing algorithms, with moderate memory usage, and enables an intuitive visualization, classification and clustering scheme that users can explore interactively within a single framework.
M
ge-cluster platform for plasmid analysis can be easily distributed and replicated, enabling a consistent labelling of plasmids across past, present, and future sequence collections. We underscore the advantages of our approach by analysing a population-wide plasmid data set obtained from the opportunistic pathogen
Escherichia coli, studying the prevalence of the colistin resistance gene
mcr-1.1 within the plasmid population, and describing an instance of resistance plasmid transmission within a hospital environment.
AB - Extrachromosomal elements of bacterial cells such as plasmids are notorious for their importance in evolution and adaptation to changing ecology. However, high-resolution population-wide analysis of plasmids has only become accessible recently with the advent of scalable long-read sequencing technology. Current typing methods for the classification of plasmids remain limited in their scope which motivated us to develop a computationally efficient approach to simultaneously recognize novel types and classify plasmids into previously identified groups. Here, we introduce
mge-cluster that can easily handle thousands of input sequences which are compressed using a unitig representation in a de Bruijn graph. Our approach offers a faster runtime than existing algorithms, with moderate memory usage, and enables an intuitive visualization, classification and clustering scheme that users can explore interactively within a single framework.
M
ge-cluster platform for plasmid analysis can be easily distributed and replicated, enabling a consistent labelling of plasmids across past, present, and future sequence collections. We underscore the advantages of our approach by analysing a population-wide plasmid data set obtained from the opportunistic pathogen
Escherichia coli, studying the prevalence of the colistin resistance gene
mcr-1.1 within the plasmid population, and describing an instance of resistance plasmid transmission within a hospital environment.
UR - http://www.scopus.com/inward/record.url?scp=85165974052&partnerID=8YFLogxK
U2 - 10.1093/nargab/lqad066
DO - 10.1093/nargab/lqad066
M3 - Article
C2 - 37435357
SN - 2631-9268
VL - 5
JO - NAR genomics and bioinformatics
JF - NAR genomics and bioinformatics
IS - 3
M1 - lqad066
ER -