Subgroup identification by recursive segmentation

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

A new modeling approach called ‘recursive segmentation’ is proposed to support the supervised exploration and identification of subgroups or clusters. It is based on the frameworks of recursive partitioning and the Patient Rule Induction Method (PRIM). Through combining these methods, recursive segmentation aims to exploit their respective strengths while reducing their weaknesses. Consequently, recursive segmentation can be applied in a very general way, that is in any (multivariate) regression, classification or survival (time-to-event) problem, using conditional inference, evolutionary learning or the CART algorithm, with predictor variables of any scale and with missing values. Furthermore, results of a synthetic example and a benchmark application study that comprises 26 data sets suggest that recursive segmentation achieves a competitive prediction accuracy and provides more accurate definitions of subgroups by models of less complexity as compared to recursive partitioning and PRIM. An application to the German Breast Cancer Study Group data demonstrates the improved interpretability and reliability of results produced by the new approach. The method is made publicly available through the R-package rseg (http://rseg.r-forge.r-project.org/).

Original languageEnglish
Pages (from-to)2864-2887
Number of pages24
JournalJournal of Applied Statistics
Volume45
Issue number15
DOIs
StatePublished - 18 Nov 2018

Keywords

  • CART
  • PRIM
  • Subgroup analysis
  • benchmarking
  • recursive partitioning
  • supervised clustering

Fingerprint

Dive into the research topics of 'Subgroup identification by recursive segmentation'. Together they form a unique fingerprint.

Cite this