Treffer: Using machine learning to predict postoperative complications of total shoulder arthroplasty.

Title:
Using machine learning to predict postoperative complications of total shoulder arthroplasty.
Authors:
Powell, Carter M.1 (AUTHOR), Newton, William N.1 (AUTHOR), Reis, Robert J.1 (AUTHOR), Moore, John W.1 (AUTHOR), Rogalski, Brandon L.1 (AUTHOR), Eichinger, Josef K.1 (AUTHOR), Friedman, Richard J.1 (AUTHOR) friedman@musc.edu
Source:
Seminars in Arthroplasty: JSES. Jun2025, Vol. 35 Issue 2, p203-209. 7p.
Database:
Supplemental Index

Weitere Informationen

Previous efforts to use machine learning to predict complications following primary total shoulder arthroplasty (TSA) have shown promise, but the clinical significance of such predictive models has been limited by inadequate sample sizes and short (∼30 day) follow-up periods. The Nationwide Readmissions Database, with a large sample size and longer follow-up period, has the potential to reduce the noise of previous modeling efforts. The purpose of this study is to evaluate the accuracy and effectiveness of 4 different models for predicting 180-day complications, extended length of stay (LOS), and mechanical failures in patients undergoing primary TSA. The Nationwide Readmissions Database was queried for patients who underwent TSA from 2016 to 2020. Primary outcomes were complications within 180 days, extended LOS (defined as >2 days), and mechanical failure. For each outcome, 4 models were created using Python v3.9. Models included a weighted logistic regression, random forest classifier, gradient boosting classifier, and an artificial neural network. Model performance was assessed using accuracy, area under the receiver operating characteristic curve (area under the curve [AUC]), sensitivity, positive predictive value (PPV), and F1 score. A total of 178,003 patients underwent primary TSA from 2016 to 2020. For predicting 180-day complications, gradient-boosted classification had the highest discriminative ability and sensitivity (accuracy: 0.69, AUC: 0.71, sensitivity: 0.59, PPV: 0.21, and F1: 0.31). For predicting extended LOS, an artificial neural network proved most effective (accuracy: 0.79, AUC: 0.82, sensitivity: 0.67, PPV: 0.43, and F1: 0.52; Table II). For mechanical complications, all models were equally poor at predicting complications. Machine learning has the potential to accurately predict rare outcomes from heterogenous data; however, the quality of predictive models is dependent on the quality of the input data. Although machine-learning models are superior to simpler methods at predicting certain outcomes, such as extended LOS, they currently lack the sensitivity and PPV to be clinically significant. [ABSTRACT FROM AUTHOR]