Treffer: Data Science methodologies for predictive analytics in Smart Cities
info:eu-repo/semantics/openAccess
Italian
English
Schiano Di Cola, Vincenzo (2022) Data Science methodologies for predictive analytics in Smart Cities. [Tesi di dottorato]
1429129790
From OAIster®, provided by the OCLC Cooperative.
Weitere Informationen
The goal of this PhD dissertation is to conduct academic and industrial research on Data Science in a variety of fields. An interdisciplinary approach was required to address today's scientific and societal challenges. A three-year training path applied Data Science to two Smart City application domains: Cultural Heritage (CH) and E-Health, with a focus on machine learning (ML) and knowledge graphs (KG). The first application is on classifying and forecasting visitor flow within a museum. By applying Machine Learning to the CH sector, the study examined a mixed dataset of numerical and categorical values. A framework for data processing and information extraction for clustering visitor behaviors was developed to save time. The dissertation then focuses on two e-health topics: healthcare booking prescriptions and image processing for biosensors. Prescriptions issued by general practitioners were modeled as a KG to help optimize government and local e-health services. This dissertation aimed to identify more patterns in data than a legacy dataset and thus make more accurate predictions. The final biosensor application recognizes point of interests in smartphone photos and uses machine learning algorithms to determine their chemical composition. The tool predicts the amount of a compound based on the liquid sample's luminescence. This dissertation's specific research questions concentrate around one question: how can Data Science help construct Smart Cities? This is addressed through a framework for analyzing people moving indoors, an extension of a legacy SQL database to a Knowledge Graph, and the building of a lab-on-hand proof of concept. All of this is accomplished through the use of a wide range of mathematical and software methods, such as machine learning (clustering and classification), image processing, and KG embedding. Python and R with Grakn, AmpliGraph, OpenCV, and scikit-learn have been utilized as toolkits. Among the most important contributions made by