Treffer: SAES: A Python Library for Statistical Evaluation of Stochastic Artificial Ingelligence Algorithms

Title:

SAES: A Python Library for Statistical Evaluation of Stochastic Artificial Ingelligence Algorithms

Authors:

Carreira, Emilio R., orcid:0000-0001-5580-, Nebro, Antonio J.

Publisher Information:

Zenodo

Publication Year:

2025

Collection:

Zenodo

Subject Terms:

Stochastic algorithms, Statistical evaluation, Benchmarking, Artificial Intelligence

Document Type:

E-Ressource software

Language:

English

Relation:

https://zenodo.org/records/15683420; oai:zenodo.org:15683420; https://doi.org/10.5281/zenodo.15683420

DOI:

10.5281/zenodo.15683420

Availability:

https://doi.org/10.5281/zenodo.15683420
https://zenodo.org/records/15683420

Rights:

Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode

Accession Number:

edsbas.C6476D9F

Database:

BASE

Weitere Informationen

SAES (Stochastic Algorithm Evaluation Suite) is a Python library for the rigorous statistical analysis and comparison of stochastic algorithms in artificial intelligence. Stochastic methods—such as metaheuristics and some machine learning techniques—are widely applied in optimization, learning, and simulation for their ability to explore complex solution spaces and avoid local optima. However, their inherent randomness complicates reliable evaluation and fair benchmarking. SAES addresses this challenge by offering a unified framework that combines robust statistical methods with intuitive visualizations. The library includes non-parametric tests (Friedman, Friedman aligned-rank, Quade, and Wilcoxon signed-rank), parametric tests (t-test and ANOVA), and appropriate post hoc analyses, such as the Nemenyi test. It simplifies experimental workflows by automating CSV data handling, statistical inference, customizable visualizations (e.g., boxplots, critical distance diagrams, and Bayesian posterior plots), and LaTeX report generation for reproducible research. Case studies demonstrate its effectiveness in benchmarking multi-objective metaheuristics and comparing machine learning models. By standardizing evaluation practices for stochastic AI algorithms, SAES promotes research reproducibility, enables fair comparisons, and provides an accessible platform for both researchers and practitioners.

Treffer: SAES: A Python Library for Statistical Evaluation of Stochastic Artificial Ingelligence Algorithms

Weitere Informationen

Links

Zusatz-Funktionen