Treffer: SAES: A Python Library for Statistical Evaluation of Stochastic Artificial Ingelligence Algorithms

Title:
SAES: A Python Library for Statistical Evaluation of Stochastic Artificial Ingelligence Algorithms
Publisher Information:
Zenodo
Publication Year:
2025
Collection:
Zenodo
Document Type:
E-Ressource software
Language:
English
DOI:
10.5281/zenodo.15683420
Rights:
Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
Accession Number:
edsbas.C6476D9F
Database:
BASE

Weitere Informationen

SAES (Stochastic Algorithm Evaluation Suite) is a Python library for the rigorous statistical analysis and comparison of stochastic algorithms in artificial intelligence. Stochastic methods—such as metaheuristics and some machine learning techniques—are widely applied in optimization, learning, and simulation for their ability to explore complex solution spaces and avoid local optima. However, their inherent randomness complicates reliable evaluation and fair benchmarking. SAES addresses this challenge by offering a unified framework that combines robust statistical methods with intuitive visualizations. The library includes non-parametric tests (Friedman, Friedman aligned-rank, Quade, and Wilcoxon signed-rank), parametric tests (t-test and ANOVA), and appropriate post hoc analyses, such as the Nemenyi test. It simplifies experimental workflows by automating CSV data handling, statistical inference, customizable visualizations (e.g., boxplots, critical distance diagrams, and Bayesian posterior plots), and LaTeX report generation for reproducible research. Case studies demonstrate its effectiveness in benchmarking multi-objective metaheuristics and comparing machine learning models. By standardizing evaluation practices for stochastic AI algorithms, SAES promotes research reproducibility, enables fair comparisons, and provides an accessible platform for both researchers and practitioners.