On January 11, 2022, C2C members
60 Minutes Summed Up in 60 Seconds
-
Yuri Grinshteyn, Customer SRE at Google Cloud, was the guest of the session.
-
Also known as "Stack Doctor" on YouTube, Grinshteyn advocates the best way to monitor, observe and follow the SRE best practices as learnt by Google in their own service SRE teams.
-
Grinshteyn explained the difference between monitoring and observability:
-
Monitoring is "only" the data about a service, a resource.
-
Observability is the behavior of the service metrics through time.
-
-
To observe data, you need different data sources; metrics, of course, but also logs and traces.
-
There are several tools available, but the purpose of each is observability: FluentD, Open Sensus, Prometheus, Graphana, etc. All are open-source, portable, and compliant with Cloud Operations.
-
The overhead of instrumented code is quite invisible, and the provided metrics are much more important than the few CPU cycles lost because of it.
-
Microservices and monoliths should use trace instrumentation. Even a monolith never works alone: it uses Google Cloud Services, APIs, Databases, etc. Trace allows us to understand North-South and East-West traffic.
Get in on the Monitoring and Observability Conversation!
Despite its 30-minute time limit, this conversation didn't stop. Monitoring and observability is a hot topic, and it certainly kept everyone's attention. The group spent time on monitoring, logging, error budget, SRE, and other topics such as:
-
Cloud Operations
-
Managed Services for Prometheus
-
Cloud Monitoring
Members also shared likes and dislikes. For example, one guest, Mehdi, "found it unfortunate not to have out of the box metrics on GKE to monitor golden signals," and said "it's difficult to convince ops to install Istio just for observability."
Preview What's Next
Two upcoming sessions will cover topics that came up but didn't make it to the discussion floor:
If either of these events interests you, be sure to sign up to get in touch with the group!
Extra Credit
Looking for more Google Cloud products news and resources? We got you. The following links were shared with attendees and are now available to you!
-
Sre.google website