Treffer: Random forest for dynamic risk prediction of recurrent events: a pseudo-observation approach.

Title:
Random forest for dynamic risk prediction of recurrent events: a pseudo-observation approach.
Authors:
Loe A; Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, United States., Murray S; Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, United States., Wu Z; Department of Biostatistics, University of Michigan, 1415 Washington Heights, Ann Arbor, MI 48109, United States.; Michigan Institute for Data and AI in Society, University of Michigan, 500 Church Street, Ann Arbor, MI 48109, United States.
Source:
Biostatistics (Oxford, England) [Biostatistics] 2024 Dec 31; Vol. 26 (1).
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Oxford University Press Country of Publication: England NLM ID: 100897327 Publication Model: Print Cited Medium: Internet ISSN: 1468-4357 (Electronic) Linking ISSN: 14654644 NLM ISO Abbreviation: Biostatistics Subsets: MEDLINE
Imprint Name(s):
Original Publication: Oxford : Oxford University Press
Grant Information:
University of Michigan
Contributed Indexing:
Keywords: censored data; longitudinal data; pseudo-observations; random forest; recurrent events
Entry Date(s):
Date Created: 20250314 Date Completed: 20250513 Latest Revision: 20250513
Update Code:
20250513
DOI:
10.1093/biostatistics/kxaf007
PMID:
40083192
Database:
MEDLINE

Weitere Informationen

Recurrent events are common in clinical, healthcare, social, and behavioral studies, yet methods for dynamic risk prediction of these events are limited. To overcome some long-standing challenges in analyzing censored recurrent event data, a recent regression analysis framework constructs a censored longitudinal dataset consisting of times to the first recurrent event in multiple pre-specified follow-up windows of length $ \tau $(XMT models). Traditional regression models struggle with nonlinear and multiway interactions, with success depending on the skill of the statistical programmer. With a staggering number of potential predictors being generated from genetic, -omic, and electronic health records sources, machine learning approaches such as the random forest regression are growing in popularity, as they can nonparametrically incorporate information from many predictors with nonlinear and multiway interactions involved in prediction. In this article, we (i) develop a random forest approach for dynamically predicting probabilities of remaining event-free during a subsequent $ \tau $-duration follow-up period from a reconstructed censored longitudinal data set, (ii) modify the XMT regression approach to predict these same probabilities, subject to the limitations that traditional regression models typically have, and (iii) demonstrate how to incorporate patient-specific history of recurrent events for prediction in settings where this information may be partially missing. We show the increased ability of our random forest algorithm for predicting the probability of remaining event-free over a $ \tau $-duration follow-up window when compared to our modified XMT method for prediction in settings where association between predictors and recurrent event outcomes is complex in nature. We also show the importance of incorporating past recurrent event history in prediction algorithms when event times are correlated within a subject. The proposed random forest algorithm is demonstrated using recurrent exacerbation data from the trial of Azithromycin for the Prevention of Exacerbations of Chronic Obstructive Pulmonary Disease.
(© The Author(s) 2025. Published by Oxford University Press. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact journals.permissions@oup.com.)