Skip to main navigation Skip to search Skip to main content

plasmaCHORD: A Machine Learning Approach to Distinguish Clonal Hematopoiesis-Derived Variants in Liquid Biopsies from Patients with Solid Tumors

  • Jenna V. Canzoniero
  • , Daniel Rabizadeh
  • , Ilias Ziakas
  • , Jaime Wehr
  • , Archana Balan
  • , Amna Jamali
  • , Blair V. Landon
  • , Lavanya Sivapalan
  • , Susan Scott
  • , Gavin Pereira
  • , Vincent K. Lam
  • , Christine L. Hann
  • , Christine M. Lovly
  • , Jessica Tao
  • , Patrick M. Forde
  • , Joseph C. Murray
  • , Mark Sausen
  • , Gerrit A. Meijer
  • , Geraldine R. Vink
  • , Remond J.A. Fijneman
  • Victor E. Velculescu, Jillian Phallen, Robert B. Scharpf, Valsamo Anagnostou,

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

PURPOSE: Targeted next-generation sequencing (NGS) of cell-free DNA (cfDNA) enables comprehensive molecular profiling and can guide the selection of genotype-targeted therapies. However, the detection of variants derived from clonal hematopoiesis (CH) is a significant confounder in liquid biopsies. EXPERIMENTAL DESIGN: Using a training cohort of 426 variants identified in cfDNA NGS from 225 patients with stage I to IV solid tumors, we developed plasma Clonal Hematopoiesis ORigin Detection (plasmaCHORD), a machine learning model that includes fragment-, variant-, and patient-level features to distinguish between tumor and CH origin for each variant detected by liquid biopsies. Model performance was assessed by comparison with the reference origin for each plasma variant determined from matched white blood cell and tumor NGS. Following the locking of the model parameters, we applied plasmaCHORD to an independent validation cohort of 1,418 plasma variants detected in 114 patients with metastatic cancers, as well as to cfDNA NGS from patients enrolled in a prospective clinical trial (NCT05585684). RESULTS: plasmaCHORD predicted tumor origin versus CH origin in the training set with high accuracy (AUC = 0.94). In the independent validation cohort, the locked model maintained similar overall accuracy (AUC = 0.9) and demonstrated significant improvement in accuracy for clinically significant genes. When applied to clinically challenging cases in the context of a precision oncology clinical trial, plasmaCHORD precisely determined variant origin, preventing mismatches with genotype-targeted therapies. CONCLUSIONS: plasmaCHORD, a multifeature machine learning model, can significantly enhance the ability to identify bona fide tumor variants in routine plasma-only NGS, addressing a critical need for implementing liquid biopsy-guided therapy by minimizing misinterpretation caused by CH.

Original languageEnglish
Pages (from-to)1729-1744
Number of pages16
JournalClinical cancer research : an official journal of the American Association for Cancer Research
Volume32
Issue number9
DOIs
Publication statusPublished - 1 May 2026

Fingerprint

Dive into the research topics of 'plasmaCHORD: A Machine Learning Approach to Distinguish Clonal Hematopoiesis-Derived Variants in Liquid Biopsies from Patients with Solid Tumors'. Together they form a unique fingerprint.

Cite this