Treffer: USE OF MACHINE LEARNING TOOLS AND PYTHON LIBRARIES FOR PREDICTION THROUGH REGRESSIONS

Title:
USE OF MACHINE LEARNING TOOLS AND PYTHON LIBRARIES FOR PREDICTION THROUGH REGRESSIONS
Publisher Information:
Zenodo
Publication Year:
2024
Collection:
Zenodo
Document Type:
Fachzeitschrift article in journal/newspaper
Language:
unknown
ISSN:
2786-9024
DOI:
10.5281/zenodo.10804947
Rights:
Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
Accession Number:
edsbas.C334F146
Database:
BASE

Weitere Informationen

The purpose of this work is forecasting the average check amount depending on the time spent by customers in the retail area and forecasting the average check amount based on seasonal demand characteristics considering the constant annual price growth. Forecast methods were analysed. The use of Python libraries was considered. The code was developed for solving the above-mentioned forecasting tasks through linear, polynomial and random forest regressions, specifically, the search for the dependence of the average check amount on the time spent by customers in the retail area and the dependence of the average check amount on seasonal price fluctuations over several years. A comparison of the results of linear and polynomial regression was made under conditions of smaller or larger volumes of the input data, explanations were provided regarding the appropriateness of choosing one or the other when modelling non-cyclical processes and why the random forest model provides adequate results for cyclical processes. The source code and examples of input data were shared for public access. Scientific novelty lies in the ways of applying regressions for modelling and forecasting the average check amount depending on the time customers spend in the retail area as well as the average check amount depending on seasonal demand characteristics with consideration of constant price growth. The selected examples of cyclical and non-cyclical economic phenomena are the most common ones, and practitioners face similar tasks daily. Therefore, the results of this work provide them with ready-made solutions that require minimal adaptation to practical needs. Additionally, it demonstrates the feasibility of deploying them in the AWS cloud environment and the potential for integration with various data sources and other information systems. Keywords: machine learning, regression, prediction, polynomial regression, random forest, python.