Site Reliability Troubleshooting with Stackdriver APMGo to Lab
This one has been quite confusing the way is layout
Time was amost almost full
custom.googleapis.com/opencensus/grpc.io/client/roundtrip_latency did not produce any results in stackdriver and I was stuck. Very disappointing lab!
Stack Driver is very buggy as it does get affected by browser caching slowing down the rate at which users can perform tasks
Was not able not set up the Error_Rate_SLI as alert. The Error_Rate_SLI was created as stated in the lab. Copied and pasted from my lab: resource.type="k8s_container" severity>=ERROR labels."k8s-pod/app"="currencyservice" The "Error_Rate_SLI" was available for the alert, but the resource type was missing. Even after waiting more than 10 minutes and reloading the page for the resource type it stated: None. As this was the second time i started the lab (another problem while doing this the first time) I don't want to spend any more credits into this lab.
Some instructions didn't work as expected
The lab didn't start