Treffer: Scaling data-intensive analytics with Heat: a Python library for massively-parallel array computing and machine learning

Title:
Scaling data-intensive analytics with Heat: a Python library for massively-parallel array computing and machine learning
Source:
Helmholtz AI Conference, Düsseldorf, Germany, 2024-06-12 - 2024-06-14
Publication Year:
2024
Collection:
Forschungszentrum Jülich: JuSER (Juelich Shared Electronic Resources)
Subject Geographic:
DE
Document Type:
Konferenz conference object
Language:
English
Rights:
info:eu-repo/semantics/closedAccess
Accession Number:
edsbas.1887AF33
Database:
BASE

Weitere Informationen

Handling and analyzing massive data sets is highly important for the vast majority of research communities, but it is also challenging, especially for those communities without a background in HPC. The Helmholtz Analytics Toolkit (Heat) library offers a solution to this problem by providing memory-distributed and hardware-accelerated array manipulation, data analytics, and machine learning algorithms in Python, targeting the usage by non-experts in HPC. In short: Heats objective is to make array computing and machine learning as easy on a CPU/GPU-cluster as it is on a workstation.Our poster provides an overview of Heats design principles, its current features and capabilities, and discusses its role in the ecosystem of distributed array computing and machine learning in Python.