Treffer: Doc Assist: Intelligent Document Processing Assistance for Enhanced Accessibility
Title:
Doc Assist: Intelligent Document Processing Assistance for Enhanced Accessibility
Authors:
Source:
International Research Journal on Advanced Engineering Hub (IRJAEH). 3:2210-2214
Publisher Information:
RSP Science Hub, 2025.
Publication Year:
2025
Document Type:
Fachzeitschrift
Article
ISSN:
2584-2137
DOI:
10.47392/irjaeh.2025.0324
Rights:
CC BY NC
Accession Number:
edsair.doi...........b90ed906ea77d9f790e57b2de702d571
Database:
OpenAIRE
Weitere Informationen
This project presents a desktop assistant designed to retrieve information from non-machine-readable documents, such as scanned images and PDFs. Using Tesseract OCR, the system extracts text, and BM25 is employed for effective document ranking based on user-provided keywords. Additionally, word embeddings are integrated to improve semantic search accuracy. The application is built with Tkinter, offering an intuitive, offline experience. The system's architecture is optimized for quick document retrieval, ensuring minimal resource consumption while maintaining relevance. This documentation covers the design, implementation, and challenges encountered during development.