SLIDE 11 SLO Violation Mitigation (1)
- Observability improved through online
distributed tracing
- Auto-labeled training data and RL online
learning driven by the performance anomaly injector
- SLO violation detection and localization via
critical path analysis
- SVM-based critical component extraction
- SLO violation mitigation based on
reinforcement learning
- Identifies low-level resource contention
- Estimates the right amount to reprovision
RL-based Resource Estimator
Re-allocation Actions Performance Counters
Deployment Module
CPU LLC Memory I/O Network Replicas
Controlled Resources
Extractor Critical Path Extraction Critical Instance Extraction
Execution History Graph Telemetry Data Candidates cr i t i cal Com ponent ( ) l ongest Pat h( ) Critical Paths
Performance Anomaly Injector Microservices Deployment & Service Dependency Graph
Nginx PHP-FPM Load Balancer Tracing Module Microservice Instance Replica Set
Tracing Coordinator
11