When the whole is greater than the sum of its parts: why machine learning and conventional statistics are complementary for predicting future health outcomes

  • Roemer J Janse*
  • , Ameen Abu-Hanna
  • , Iacopo Vagliano
  • , Vianda S Stel
  • , Kitty J Jager
  • , Giovanni Tripepi
  • , Carmine Zoccali
  • , Friedo W Dekker
  • , Merel van Diepen
  • *Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract

An artificial intelligence boom is currently ongoing, mainly due to large language models, leading to significant interest in artificial intelligence and subsequently also in machine learning (ML). One area where ML is often applied, prediction modelling, has also long been a focus of conventional statistics. As a result, multiple studies have aimed to prove superiority of one of the two scientific disciplines over the other. However, we argue that ML and conventional statistics should not be competing fields. Instead, both fields are intertwined and complementary to each other. To illustrate this, we discuss some essentials of prediction modelling, elaborate on prediction modelling using techniques from conventional statistics, and explain prediction modelling using common ML techniques such as support vector machines, random forests, and artificial neural networks. We then showcase that conventional statistics and ML are in fact similar in many aspects, including underlying statistical concepts and methods used in model development and validation. Finally, we argue that conventional statistics and ML can and should be seen as a single integrated field. This integration can further improve prediction modelling for both disciplines (e.g. regarding fairness and reporting standards) and will support the ultimate goal: developing the best performing prediction models for the patient and healthcare provider.

Original languageEnglish
Article numberfaf059
JournalClinical Kidney Journal
Volume18
Issue number4
DOIs
Publication statusPublished - Apr 2025
Externally publishedYes

Fingerprint

Dive into the research topics of 'When the whole is greater than the sum of its parts: why machine learning and conventional statistics are complementary for predicting future health outcomes'. Together they form a unique fingerprint.

Cite this