Treffer: Designing production-friendly machine learning

Title:
Designing production-friendly machine learning
Authors:
Source:
Proceedings of the VLDB Endowment ; volume 14, issue 13, page 3420-3420 ; ISSN 2150-8097
Publisher Information:
Association for Computing Machinery (ACM)
Publication Year:
2021
Document Type:
Fachzeitschrift article in journal/newspaper
Language:
English
DOI:
10.14778/3484224.3484241
Accession Number:
edsbas.B2CA7548
Database:
BASE

Weitere Informationen

Building production ML applications is difficult because of their resource cost and complex failure modes. I will discuss these challenges from two perspectives: the Stanford DAWN Lab and experience with large-scale commercial ML users at Databricks. I will then present two emerging ideas to help address these challenges. The first is "ML platforms", an emerging class of software systems that standardize the interfaces used in ML applications to make them easier to build and maintain. I will give a few examples, including the open-source MLflow system from Databricks [3]. The second idea is models that are more "production-friendly" by design. As a concrete example, I will discuss retrieval-based NLP models such as Stanford's ColBERT [1, 2] that query documents from an updateable corpus to perform tasks such as question-answering, which gives multiple practical advantages, including low computational cost, high interpretability, and very fast updates to the model's "knowledge". These models are an exciting alternative to large language models such as GPT-3.