Skip to main navigation Skip to search Skip to main content

gplasCC: classification and recovery of plasmids from short-read sequencing data for any bacterial species

  • Julian A. Paganini
  • , Jesse J. Kerkvliet*
  • , Oscar Jordan
  • , Gijs Teunis
  • , Nienke L. Plantinga
  • , Rodrigo Meneses
  • , Rob J.L. Willems
  • , Sergio Arredondo-Alonso
  • , Anita C. Schürch*
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

1 Downloads (Pure)

Abstract

Plasmids play a pivotal role in the spread of antibiotic resistance genes. Accurately reconstructing plasmids often requires long-read sequencing, but bacterial genomic data in publicly accessible repositories have historically been derived from short-read sequencing technology. We recently presented an approach for recovering Escherichia coli antimicrobial resistance plasmids using Illumina short reads. This method consisted of combining a robust binary classification tool named plasmidEC with gplas2, which is a tool that makes use of features of the assembly graph to bin predicted plasmid contigs into individual plasmids. Here, we developed plasmidCC, an upgrade from plasmidEC, capable of classifying plasmid contigs using Centrifuge databases. We have developed seven plasmidCC databases in addition to the database for E. coli: six species-specific models (Acinetobacter baumannii, Enterococcus faecium, Enterococcus faecalis, Klebsiella pneumoniae, Staphylococcus aureus, and Salmonella enterica) and one species-independent model for less frequently studied bacterial species. We combined these models with gplasCC to recover plasmids from >100 bacterial species. This approach allows comprehensive analysis of the wealth of bacterial short-read sequencing data available in public repositories and advances our understanding of microbial plasmids.

Original languageEnglish
Article numberlqag028
JournalNAR genomics and bioinformatics
Volume8
Issue number1
DOIs
Publication statusPublished - 1 Mar 2026

Fingerprint

Dive into the research topics of 'gplasCC: classification and recovery of plasmids from short-read sequencing data for any bacterial species'. Together they form a unique fingerprint.

Cite this