Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection

  • Duong Vu*
  • , Michel de Vries
  • , Bert Gerrits van den Ende
  • , Jos Houbraken
  • , R Henrik Nilsson
  • , Balázs Brankovics
  • , Magarita Hernández-Restrepo
  • , Johannes Z Groenewald
  • , Pedro W Crous
  • , Ferry Hagen
  • , Gerard J M Verkley
  • , Marizeth Groenewald
  • *Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Yeast identification is essential in fields ranging from microbiology and biotechnology to food science and medicine. While DNA barcoding has become the standard for identifying cultured strains, environmental DNA (eDNA) metabarcoding has revolutionised microbial community profiling, providing deeper insights into yeast communities across diverse ecosystems. A major challenge in DNA (meta)barcoding remains the limited availability of high-quality reference sequences, which are critical for accurate species identification and comprehensive taxonomic profiling of both environmental and clinical samples. To address this gap, the Westerdijk Fungal Biodiversity Institute (WI) launched a DNA barcoding initiative in 2006 to generate high-quality, often type-derived ITS and LSU barcodes for all ~100,000 fungal strains preserved in the CBS culture collection, including approximately 15,000 yeasts. Building on the yeast barcode dataset released in 2016, we now present an expanded set of 2856 ITS and 3815 LSU sequences, representing 911 and 1137 yeast species, respectively. Notably, 27%-29% of these sequences are derived from ex-type cultures. Using both newly generated and previously published barcodes, we assess the taxonomic resolution of commonly used yeast metabarcoding markers (ITS, ITS1, ITS2 and LSU) and propose marker-specific similarity cutoffs for different yeast taxonomic groups. These results provide actionable guidance for marker selection and improve the interpretation of metabarcoding data. We further demonstrate the impact of well-curated reference databases with up-to-date taxonomy by reanalyzing Human Microbiome Project data, revealing how diet and environment shape the gut mycobiota.

Original languageEnglish
Article numbere70082
JournalMolecular Ecology Resources
Volume26
Issue number1
DOIs
Publication statusPublished - Jan 2026
Externally publishedYes

Keywords

  • DNA Barcoding, Taxonomic/methods
  • DNA, Fungal/genetics
  • DNA, Ribosomal Spacer/genetics
  • High-Throughput Nucleotide Sequencing/methods
  • Sequence Analysis, DNA
  • Yeasts/classification

Fingerprint

Dive into the research topics of 'Advancing Yeast Identification Using High-Throughput DNA Barcode Data From a Curated Culture Collection'. Together they form a unique fingerprint.

Cite this