Treffer: Leveraging Machine Learning and Robotic Process Automation to Identify and Convert Unstructured Colonoscopy Results Into Actionable Data: Proof-of-Concept Study.

Title:
Leveraging Machine Learning and Robotic Process Automation to Identify and Convert Unstructured Colonoscopy Results Into Actionable Data: Proof-of-Concept Study.
Authors:
Stevens ER; Department of Population Health, Grossman School of Medicine, New York University, 227 E30th Street, Rm 636, New York, 10016, United States, 1 6465012558.; Department of Health Informatics, Medical Center Information Technology, NYU Langone Health, New York, NY, United States., Hartman J; Department of Health Informatics, Medical Center Information Technology, NYU Langone Health, New York, NY, United States., Testa P; Department of Health Informatics, Medical Center Information Technology, NYU Langone Health, New York, NY, United States., Mansukhani A; MCIT Clinical Systems NYU Langone, New York, NY, United States., Monina C; MCIT Clinical Systems NYU Langone, New York, NY, United States., Shunk A; Department of Population Health, Grossman School of Medicine, New York University, 227 E30th Street, Rm 636, New York, 10016, United States, 1 6465012558.; School of Medicine, Tulane University, New Orleans, LA, United States., Ranson D; MCIT Clinical Systems NYU Langone, New York, NY, United States., Imberg Y; MCIT Clinical Systems NYU Langone, New York, NY, United States., Cote A; MCIT Clinical Systems NYU Langone, New York, NY, United States., Prabhu D; MCIT Clinical Systems NYU Langone, New York, NY, United States., Szerencsy A; Department of Health Informatics, Medical Center Information Technology, NYU Langone Health, New York, NY, United States.
Source:
JMIR medical informatics [JMIR Med Inform] 2025 Nov 20; Vol. 13, pp. e73504. Date of Electronic Publication: 2025 Nov 20.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: JMIR Publications Country of Publication: Canada NLM ID: 101645109 Publication Model: Electronic Cited Medium: Internet ISSN: 2291-9694 (Electronic) Linking ISSN: 22919694 NLM ISO Abbreviation: JMIR Med Inform Subsets: MEDLINE
Imprint Name(s):
Original Publication: Toronto : JMIR Publications, [2013]-
References:
BMC Med Inform Decis Mak. 2023 May 15;23(1):94. (PMID: 37189148)
JAMIA Open. 2020 Dec 05;3(4):619-627. (PMID: 33758798)
Clin Transl Gastroenterol. 2024 Dec 01;15(12):e00785. (PMID: 39665587)
Gastroenterology. 2020 Jan;158(2):418-432. (PMID: 31394083)
Can J Gastroenterol Hepatol. 2016;2016:9423142. (PMID: 27446877)
JAMA. 2013 Apr 3;309(13):1351-2. (PMID: 23549579)
Am Fam Physician. 2021 Mar 1;103(5):314-316. (PMID: 33630559)
Health Informatics J. 2022 Jan-Mar;28(1):14604582211058081. (PMID: 34986681)
CA Cancer J Clin. 2023 May-Jun;73(3):233-254. (PMID: 36856579)
Hematol Oncol Clin North Am. 2022 Jun;36(3):449-470. (PMID: 35577711)
Gastroenterology. 2020 Mar;158(4):1131-1153.e5. (PMID: 32044092)
Clin Gastroenterol Hepatol. 2013 Jun;11(6):689-94. (PMID: 23313839)
Gastroenterology. 2021 Aug;161(2):701-711. (PMID: 34334168)
JAMA Intern Med. 2022 May 1;182(5):564-566. (PMID: 35344006)
JAMA. 2022 Aug 9;328(6):523-524. (PMID: 35834286)
Best Pract Res Clin Gastroenterol. 2017 Aug;31(4):419-424. (PMID: 28842051)
JAMA Netw Open. 2022 Sep 1;5(9):e2234194. (PMID: 36173627)
Health Technol (Berl). 2021;11(2):349-357. (PMID: 33520588)
J Gen Pract (Los Angel). 2014 Jan 5;2:. (PMID: 25411657)
J Clin Transl Sci. 2022 Dec 14;7(1):e29. (PMID: 36845316)
BMC Med Inform Decis Mak. 2013 Aug 09;13:86. (PMID: 24070335)
Sci Rep. 2024 Nov 5;14(1):26783. (PMID: 39500759)
Contributed Indexing:
Keywords: RPA; artificial intelligence; automation; colonoscopy; colorectal cancer; machine learning; robotic process automation
Entry Date(s):
Date Created: 20251120 Date Completed: 20251120 Latest Revision: 20251123
Update Code:
20251123
PubMed Central ID:
PMC12634012
DOI:
10.2196/73504
PMID:
41264858
Database:
MEDLINE

Weitere Informationen

Background: With rising patient volumes and a focus on quality, our health system had the objective to create a more efficient way to ensure accurate documentation of colorectal cancer (CRC) screening intervals from inbound colonoscopy reports to ensure timely follow-up. We developed an integrated end-to-end workflow solution using machine learning (ML) and robotic process automation (RPA) to extract and update electronic health record (EHR) follow-up dates from unstructured data.
Objective: This study aimed to automate data extraction from external, free-text colonoscopy reports to identify and document recommended follow-up dates for CRC screening in structured EHR fields.
Methods: As proof of concept, we outline the process development, validity, and implementation of an approach that integrates available tools to automate data retrieval and entry within the EHR of a large academic health system. The health system uses Epic Systems as its EHR platform, and the ML model used was trained on health system patient colonoscopy reports. This proof-of-concept process study consisted of six stages: (1) identification of gaps in documenting recommendations for follow-up CRC screening from external colonoscopy reports, (2) defining process objectives, (3) identification of technologies, (4) creation of process architecture, (5) process validation, and (6) health system-wide implementation. A chart review was performed to validate process outcomes and estimate impact.
Results: We developed an automated process with 3 primary steps leveraging ML and RPA to create a fully orchestrated workflow to update CRC screening recall dates based on colonoscopy reports received from external sources. Process validity was assessed with 690 scanned colonoscopy reports. During process validation, the overall automated process achieved an accuracy of 80.7% (557/690, 95% CI 77.8%-83.7%) for correctly identifying the presence or absence of a valid follow-up date and a follow-up date false negative identification rate of 32.9% (130/395, 95% CI 29.4%-36.4%). From the organization-wide implementation to go-live until December 31, 2024, the system processed 16,563 external colonoscopy reports. Of these, 35.3% (5841/16,563) had a follow-up date meeting the relevant ML model threshold and thus were identified as ready for RPA processing.
Conclusions: Implementation of an automated workflow to extract and update CRC screening follow-up dates from colonoscopy reports is feasible and has the potential to improve accuracy in patient recall while reducing documentation burden. By standardizing data ingestion, extending this approach to various unstructured data types can address deficiencies in structured EHR documentation and solve for a lack of data integration and reporting for quality measures. Automated workflows leveraging ML and RPA offer practical solutions to overcome interoperability challenges and the use of unstructured data within health care systems.
(© Elizabeth R Stevens, Jager Hartman, Paul Testa, Ajay Mansukhani, Casey Monina, Amelia Shunk, David Ranson, Yana Imberg, Ann Cote, Dinesha Prabhu, Adam Szerencsy. Originally published in JMIR Medical Informatics (https://medinform.jmir.org).)