Treffer: The Need for Language Support for Fault-Tolerant Distributed Systems

Title:
The Need for Language Support for Fault-Tolerant Distributed Systems
Contributors:
Cezara Dragoi and Thomas A. Henzinger and Damien Zufferey
Publisher Information:
Schloss Dagstuhl – Leibniz-Zentrum für Informatik
Publication Year:
2015
Collection:
DROPS - Dagstuhl Research Online Publication Server (Schloss Dagstuhl - Leibniz Center for Informatics )
Document Type:
Fachzeitschrift article in journal/newspaper<br />conference object
File Description:
application/pdf
Language:
English
Relation:
Is Part Of LIPIcs, Volume 32, 1st Summit on Advances in Programming Languages (SNAPL 2015); https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.SNAPL.2015.90
DOI:
10.4230/LIPIcs.SNAPL.2015.90
Accession Number:
edsbas.3CFACCDC
Database:
BASE

Weitere Informationen

Fault-tolerant distributed algorithms play an important role in many critical/high-availability applications. These algorithms are notoriously difficult to implement correctly, due to asynchronous communication and the occurrence of faults, such as the network dropping messages or computers crashing. Nonetheless there is surprisingly little language and verification support to build distributed systems based on fault-tolerant algorithms. In this paper, we present some of the challenges that a designer has to overcome to implement a fault-tolerant distributed system. Then we review different models that have been proposed to reason about distributed algorithms and sketch how such a model can form the basis for a domain-specific programming language. Adopting a high-level programming model can simplify the programmer's life and make the code amenable to automated verification, while still compiling to efficiently executable code. We conclude by summarizing the current status of an ongoing language design and implementation project that is based on this idea.