Treffer: Data Fusion of Observability Signals to Detect Anomalies and Failures in Microservices
Weitere Informationen
Microservices have become a reality, with large companies adopting this architecture due to its advantages, such as development agility, decoupling, scalability, resilience, and operational efficiency. However, this approach has drawbacks, such as the need for complex cloud infrastructure and communication between microservices through protocols. In organizations with hundreds of microservices, any failure in one of them can trigger cascading effects, resulting in more failures. Identifying failures in microservices is not a trivial task. The proposed work focuses on data fusion for detecting failures and anomalies in microservices, combining two pillars of observability: distributed tracing and metrics, with the help of OpenTelemetry. This fusion aims to provide faster and more efficient failure detection, as well as the identification of anomalies in metrics. This allows problems to be detected and addressed more quickly, improving the overall stability and reliability of the system.