Serviceeinschränkungen vom 12.-22.02.2026 - weitere Infos auf der UB-Homepage

Treffer: IntelliExtract: An End-to-End Framework for Chinese Resume Information Extraction from Document Images

Title:
IntelliExtract: An End-to-End Framework for Chinese Resume Information Extraction from Document Images
Authors:
Source:
Advances in Engineering Technology Research. 6:570
Publisher Information:
Madison Academic Press, 2023.
Publication Year:
2023
Document Type:
Fachzeitschrift Article
ISSN:
2790-1688
DOI:
10.56028/aetr.6.1.570.2023
Accession Number:
edsair.doi...........ceb33891eaeeec1d3541ef11fa2f7170
Database:
OpenAIRE

Weitere Informationen

Traditional document processing can be labor-intensive and time-consuming to manually extract and organize the information in a document. This manual process is often inefficient and error-prone. In order to improve processing efficiency and accuracy of document data, we develop IntelliExtract, an end-to-end framework designed for document information extraction. This is a comprehensive framework that includes image text detection and recognition, information extraction, and document intelligent question-answering. Some recent models and algorithms are employed, OCR models for converting scanned documents into machine readable text, layout analysis algorithms for understanding the spatial arrangement of document elements, and information extraction techniques for extracting structured data from unstructured documents. To evaluate the effectiveness of the framework, we conducted experiments by employing a Chinese Talent Resumes Dataset for visualizing the results. For named entity extraction, the confidence level of the extracted results from the text in the images is generally above 0.95. The proposed framework provides a powerful tool for enterprises, educational institutions, and other entities in processing document information, and holds promise for significant practical applications.