Treffer: Leveraging Machine Learning and Robotic Process Automation to Identify and Convert Unstructured Colonoscopy Results Into Actionable Data: Proof-of-Concept Study.
JAMIA Open. 2020 Dec 05;3(4):619-627. (PMID: 33758798)
Clin Transl Gastroenterol. 2024 Dec 01;15(12):e00785. (PMID: 39665587)
Gastroenterology. 2020 Jan;158(2):418-432. (PMID: 31394083)
Can J Gastroenterol Hepatol. 2016;2016:9423142. (PMID: 27446877)
JAMA. 2013 Apr 3;309(13):1351-2. (PMID: 23549579)
Am Fam Physician. 2021 Mar 1;103(5):314-316. (PMID: 33630559)
Health Informatics J. 2022 Jan-Mar;28(1):14604582211058081. (PMID: 34986681)
CA Cancer J Clin. 2023 May-Jun;73(3):233-254. (PMID: 36856579)
Hematol Oncol Clin North Am. 2022 Jun;36(3):449-470. (PMID: 35577711)
Gastroenterology. 2020 Mar;158(4):1131-1153.e5. (PMID: 32044092)
Clin Gastroenterol Hepatol. 2013 Jun;11(6):689-94. (PMID: 23313839)
Gastroenterology. 2021 Aug;161(2):701-711. (PMID: 34334168)
JAMA Intern Med. 2022 May 1;182(5):564-566. (PMID: 35344006)
JAMA. 2022 Aug 9;328(6):523-524. (PMID: 35834286)
Best Pract Res Clin Gastroenterol. 2017 Aug;31(4):419-424. (PMID: 28842051)
JAMA Netw Open. 2022 Sep 1;5(9):e2234194. (PMID: 36173627)
Health Technol (Berl). 2021;11(2):349-357. (PMID: 33520588)
J Gen Pract (Los Angel). 2014 Jan 5;2:. (PMID: 25411657)
J Clin Transl Sci. 2022 Dec 14;7(1):e29. (PMID: 36845316)
BMC Med Inform Decis Mak. 2013 Aug 09;13:86. (PMID: 24070335)
Sci Rep. 2024 Nov 5;14(1):26783. (PMID: 39500759)
Weitere Informationen
Background: With rising patient volumes and a focus on quality, our health system had the objective to create a more efficient way to ensure accurate documentation of colorectal cancer (CRC) screening intervals from inbound colonoscopy reports to ensure timely follow-up. We developed an integrated end-to-end workflow solution using machine learning (ML) and robotic process automation (RPA) to extract and update electronic health record (EHR) follow-up dates from unstructured data.
Objective: This study aimed to automate data extraction from external, free-text colonoscopy reports to identify and document recommended follow-up dates for CRC screening in structured EHR fields.
Methods: As proof of concept, we outline the process development, validity, and implementation of an approach that integrates available tools to automate data retrieval and entry within the EHR of a large academic health system. The health system uses Epic Systems as its EHR platform, and the ML model used was trained on health system patient colonoscopy reports. This proof-of-concept process study consisted of six stages: (1) identification of gaps in documenting recommendations for follow-up CRC screening from external colonoscopy reports, (2) defining process objectives, (3) identification of technologies, (4) creation of process architecture, (5) process validation, and (6) health system-wide implementation. A chart review was performed to validate process outcomes and estimate impact.
Results: We developed an automated process with 3 primary steps leveraging ML and RPA to create a fully orchestrated workflow to update CRC screening recall dates based on colonoscopy reports received from external sources. Process validity was assessed with 690 scanned colonoscopy reports. During process validation, the overall automated process achieved an accuracy of 80.7% (557/690, 95% CI 77.8%-83.7%) for correctly identifying the presence or absence of a valid follow-up date and a follow-up date false negative identification rate of 32.9% (130/395, 95% CI 29.4%-36.4%). From the organization-wide implementation to go-live until December 31, 2024, the system processed 16,563 external colonoscopy reports. Of these, 35.3% (5841/16,563) had a follow-up date meeting the relevant ML model threshold and thus were identified as ready for RPA processing.
Conclusions: Implementation of an automated workflow to extract and update CRC screening follow-up dates from colonoscopy reports is feasible and has the potential to improve accuracy in patient recall while reducing documentation burden. By standardizing data ingestion, extending this approach to various unstructured data types can address deficiencies in structured EHR documentation and solve for a lack of data integration and reporting for quality measures. Automated workflows leveraging ML and RPA offer practical solutions to overcome interoperability challenges and the use of unstructured data within health care systems.
(© Elizabeth R Stevens, Jager Hartman, Paul Testa, Ajay Mansukhani, Casey Monina, Amelia Shunk, David Ranson, Yana Imberg, Ann Cote, Dinesha Prabhu, Adam Szerencsy. Originally published in JMIR Medical Informatics (https://medinform.jmir.org).)