Treffer: The Hopsworks Feature Store for Machine Learning
info:eu-repo/semantics/restrictedAccess
0000-0002-4486-5343
0000-0002-6779-7435
0000-0002-9484-6714
doi:10.1145/3626246.3653389
Scopus 2-s2.0-85196429961
1457578244
From OAIster®, provided by the OCLC Cooperative.
Weitere Informationen
Data management is the most challenging aspect of building Machine Learning (ML) systems. ML systems can read large volumes of historical data when training models, but inference workloads are more varied, depending on whether it is a batch or online ML system. The feature store for ML has recently emerged as a single data platform for managing ML data throughout the ML lifecycle, from feature engineering to model training to inference. In this paper, we present the Hopsworks feature store for machine learning as a highly available platform for managing feature data with API support for columnar, row-oriented, and similarity search query workloads. We introduce and address challenges solved by the feature stores related to feature reuse, how to organize data transformations, and how to ensure correct and consistent data between feature engineering, model training, and model inference. We present the engineering challenges in building high-performance query services for a feature store and show how Hopsworks outperforms existing cloud feature stores for training and online inference query workloads.
QC 20240628Part of ISBN 979-840070422-2