drmorr,
@drmorr@hachyderm.io avatar

"We completely and utterly fucked it up by defining observability to mean “gigachad-scale JSON logs parser with a fancy search engine.” Really? Really? That’s the “we solve Real Serious Business Problems™” strategy we went with?"

@hazelweakly killing it here

https://hazelweakly.me/blog/redefining-observability/

adrianco,
@adrianco@mastodon.social avatar

@drmorr @hazelweakly Lots of useful ideas here, but I still prefer the control theory definitions. Can you see what’s going on is observability. Can you figure out what to do about it is modelability but people don’t seem call it that. Can you change the system is controllability. That’s a feedback loop. The whole thing is resilience. Modelability is where most of the human aspects come in.

hazelweakly,
@hazelweakly@hachyderm.io avatar

@adrianco @drmorr I think the reason people don't use modelability is because it sounds backwards in a sense. Or it's not entirely obvious what makes sense there?

I think really something I could've articulated better is that "distributed tracing != Observability" and somehow the control theory definition got very tightly bundled with distributed tracing.

Q: doesn't resilience also want adaptive capacity? Or is that implied by controllability somehow?

hazelweakly,
@hazelweakly@hachyderm.io avatar

@adrianco @drmorr that said, it does sound like the (data + human bits + action) collection of concerns is the thing that is kinda missing and that a lot of people do agree it exists. Whether or not you call all of that observability or just part of it is a different story :)

Am I interpreting that right?

adrianco,
@adrianco@mastodon.social avatar

@hazelweakly @drmorr Model-ability: can you make sense of the observations. That depends on training, experience, up to date run books. Whether someone managed to code an automated model that works well enough most of the time. (E.g. CPU is high so increase autoscaler limit). Tracing is a really good observability upgrade as long as you can digest the results. You have resilience if the system is “in control” and working inside its capacity.

hazelweakly,
@hazelweakly@hachyderm.io avatar

@adrianco @drmorr makes sense! I haven't heard that specific definition before but it tracks for me for sure

(Although I usually hear resilience combined with adaptive capacity, but I don't think that's incompatible with what you're saying)

msylor,
@msylor@noc.social avatar

@adrianco @drmorr @hazelweakly I prefer the term understandability to modelability just because it’s more familiar and hence simpler. Understandability has always been the missing link in observability (monitoring) systems. It does no good to collect tons of information if humans don’t understand what it means and what control actions it should imply.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • Engineering
  • DreamBathrooms
  • InstantRegret
  • GTA5RPClips
  • magazineikmin
  • mdbf
  • rosin
  • Youngstown
  • khanakhh
  • everett
  • slotface
  • thenastyranch
  • osvaldo12
  • kavyap
  • ngwrru68w68
  • JUstTest
  • ethstaker
  • modclub
  • cubers
  • cisconetworking
  • Durango
  • tacticalgear
  • tester
  • normalnudes
  • Leos
  • megavids
  • provamag3
  • anitta
  • lostlight
  • All magazines