RPC Metrics at Google
JBD, Google (@rakyll)
RPC Metrics at Google JBD, Google (@rakyll) gRPC Metrics at - - PowerPoint PPT Presentation
RPC Metrics at Google JBD, Google (@rakyll) gRPC Metrics at Google JBD, Google (@rakyll) Request Metrics at Google JBD, Google (@rakyll) "100% is the wrong reliability target for basically everything." -- Benjamin Treynor
JBD, Google (@rakyll)
JBD, Google (@rakyll)
JBD, Google (@rakyll)
@rakyll
@rakyll
@rakyll
Principled way of saying what level of downtime is acceptable.
@rakyll
Analytics frontend server Authentication Reporting Users ... Spanner Blob Store
@rakyll
Questions infra teams want to ask:
@rakyll
Breaking down the metrics data...
@rakyll
Query the collected data in various ways:
@rakyll
Analytics frontend server Authentication Reporting Users ... Spanner Blob Store
...
@rakyll
Blob store read errors by originator
@rakyll
(split between recording and aggregation)
@rakyll
@rakyll
@rakyll
http://server:7777/debug/rpcz
@rakyll
Monarch, Prometheus, and more.
@rakyll
@rakyll