#Observability - kbin.social

FTWynn, 7 months ago to random

If you aren't keeping track of these 4 usage areas in your Observability tooling, you'll never be able to optimize its value.

#observability #o11y #monitoring

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

adrianamvillela, 7 months ago to random

What happens when you're an Observability vendor migrating to @opentelemetry? @jea knows exactly what that's like, as he shares the story of how he worked on migrating to OpenTelemetry at ServiceNow Cloud Observability (formerly Lightstep).

📺: https://youtu.be/pHHINe9D94w

#observability #openTelemetry #o11y #yttech #techvideos #sitereliabilityengineering #otip #otelInPractice #otelEndUserWorkingGroup

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ FTWynn

asmodai, 8 months ago to cisco

Cisco beefs up cybersecurity play with $28 bln Splunk deal

https://www.reuters.com/markets/deals/cisco-acquire-splunk-28-billion-2023-09-21/

#Cisco #Splunk #Observability

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ avuko

mdepalol, 8 months ago to random

I've just finished reading the "Observability Engineering" book by O'Reilly.

It's a good book. I must admit I've learnt a lot about #observability, even if I already had a good understanding of the subject.

I've especially enjoyed the data storage chapters and some gems about the "cultural" aspects of observability.

My most important take though is the concept of the Error Budget (https://www.blameless.com/blog/error-budget), definitely I'm going to put that into good practice soon.

That said, while the book is great I feel that it's too long and I think that the authors could have taken a more pragmatic approach to writing some of the chapters. I think there are lots of "repetitions".

Easier said that done of course.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ FTWynn

FTWynn, 8 months ago to random

Outside of "how much" and "where is all of it," what should you talk to your users about re: their #o11y data needs?

Workflows?
Tooling gaps?
Metrics to improve?
Platform feature requests?
Current toil that feels unnecessary?
What other data should you bring to the discussion?

#observability #monitoring

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago to random

The most important factor in getting your logs under control is routing them to the right place, /dev/null included. If you're trying to optimize log costs in a system that's already charged you dollars per gig on ingress, you've already lost the battle.

#observability #o11y #monitoring

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago to random

Teams should have a regular review to determine what of their #Observability data is actually being used. Otherwise, "just in case" becomes a value-less justification with uncapped costs.

#o11y #monitoring

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago to random

Saving money on #Observability tooling is incredibly simple. Turn off all the tooling. Maximum savings instantly achieved.

But if you wanted something short of that extreme, you'll need a coherent #o11y framework, an understanding of your business, and some judgment.

#monitoring

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

adrianamvillela, 8 months ago to random

✨#OTel Q&A TODAY!!✨ @hazelweakly joins us this week to share some gold nuggets on her personal experiences with #Observability at this week's OTel Q&A:

DATE: 2023.08.31
TIME: 13:00 EDT/10:00 PDT
CALENDAR DETAILS: https://shorturl.at/dghy2

#otelQandA #otelEUWG

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hazelweakly

adrianamvillela, 8 months ago to opensource

✨Learn how to contribute to OpenTelemetry!! ✨Are you an #OpenTelemetry practitioner? Have you ever wanted to contribute back to OpenTelemetry, but didn’t know where to begin? Then check out my latest blog post! 👇

https://adri-v.medium.com/how-to-contribute-to-opentelemetry-5962e8b2447e

#openSource #cncf #OTel #observability #openSourceContribution

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ hazelweakly

FTWynn, 8 months ago to random

The Speed of Light Will Cap Traditional Centralized #Observability

There are lots of reasons that DevOps teams have been looking into #o11y Pipelines and their in-flight processing possibilities: cost, performance. But I rarely hear about the hardest limit:

The Speed of Light

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago

Normally, the Red<>Green band is much wider for cloud migrations. I've shifted it specifically for #Observability, where data's half-life is short and its immediacy is vital.

Put simply, there is a hard limit to how much data you can get across the wire in the needed time.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago to random

What are the most important inputs and outputs to track in Observability? A few ideas...

Inputs:

Data ingested

Time spent building/updating tools

Outputs:

Costs

MTTR

Bugs caught

Time spent in tools

o11y support requests

of user queries and dashboards

#observability #o11y

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Martindotnet, 8 months ago to random

Going live soon with @jessitron giving another look at #ChaosMesh and the #OpenTelemetry demo app to see what other ways we can break it! because #ChaosEngineering without #Observability is just Chaos...

https://twitch.tv/ssObservability

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ khalidabuhakmeh

FTWynn, 8 months ago to random

Because Observability is a meta-practice, at what point does it deserve focused attention instead of being an afterthought? Launch? A scale threshold? Downtime thresholds? Dev burnout?

#observability #o11y

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 9 months ago to random

In order to improve your Observability practice, you first need to write down what you want from it. Otherwise, the path beyond Collect > Search > Display becomes impossibly murky.

#observability #o11y

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

adrianamvillela, 9 months ago to python

Super stoked to have had my latest blog post (OTel Python Logging Auto-Instrumentation with the OTel Operator ) featured on O11y News. Check it out!

https://o11y.news/2023-08-21/

#observability #o11y #OTel #opentelemetry #python

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ FTWynn

FTWynn, 9 months ago to random

I'm looking forward to how all the Observability tools change as OTel gains more and more mindshare. If collection isn't the primary value for a vendor, what is?

#observability #o11y #opentelemetry

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 9 months ago to random

Building an Observability practice that's dependent on large amounts of egress, the highest margin product in cloud, is not sustainable.

#observability #o11y

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 9 months ago to random

Searching for the Ideal Orange Peeling Method Has Taught Me the 4 Most Important Principles in Observability

#observability #o11y

Many of us have hobbies. Many of them are beautiful or useful to the world. Mine is not.

My personal white whale is to find the perfect way to peel an orange. Years of research and experimentation have not yet led to an ideal solution, but that's also precisely why it's taught me 4 key principles about Observability.

1/5

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jcuff, 9 months ago to linux

2023 sysadmin resume:

“Installed and orchestrated a 1,037,089 node #autoscaling #Linux #hpc cluster for a popular iOS #ai app to autodetect #catsofmastodon on the #kubernetes in an afternoon. Deployed #cloudycloudboop (v0.014), and wrote a novel global #observability pipeline in #ramblesplurt to stream over 5,000PB/minute of #ngnix and #syslog data to a set of fifty billion #distributed #cloud objects. RedHat certified.”

2001 sysadmin resume:

“Managed to exit #vi once. RedHat certified”

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ solimanhindy

FTWynn, 9 months ago to random

3 𝐓𝐚𝐤𝐞𝐚𝐰𝐚𝐲𝐬 𝐟𝐫𝐨𝐦 𝐓𝐡𝐢𝐬 𝐖𝐞𝐞𝐤'𝐬 𝐌𝐞𝐧𝐭𝐚𝐥 𝐌𝐨𝐝𝐞𝐥𝐢𝐧𝐠 𝐭𝐡𝐚𝐭 𝐖𝐢𝐥𝐥 𝐈𝐦𝐩𝐫𝐨𝐯𝐞 𝐘𝐨𝐮𝐫 𝐃𝐞𝐯𝐎𝐩𝐬 𝐎𝐛𝐬𝐞𝐫𝐯𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐏𝐫𝐚𝐜𝐭𝐢𝐜𝐞
#observability #o11y #mentalmodels

🚘 𝐖𝐡𝐞𝐧 𝐝𝐫𝐢𝐯𝐢𝐧𝐠 𝐚 𝐜𝐚𝐫, 𝐭𝐢𝐫𝐞 𝐡𝐞𝐚𝐥𝐭𝐡 𝐢𝐬 𝐯𝐢𝐭𝐚𝐥 𝐭𝐨 𝐝𝐫𝐢𝐯𝐢𝐧𝐠 𝐬𝐦𝐨𝐨𝐭𝐡𝐥𝐲, 𝐛𝐮𝐭 𝐭𝐢𝐫𝐞𝐬 𝐝𝐨𝐧'𝐭 𝐡𝐚𝐯𝐞 𝐚𝐧𝐲 𝐝𝐢𝐫𝐞𝐜𝐭 𝐦𝐞𝐭𝐫𝐢𝐜𝐬

Are there any "tires" in your system?

Are you using a proxy for the tires in your (...)

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 9 months ago

🍲 𝐖𝐡𝐞𝐧 𝐥𝐨𝐨𝐤𝐢𝐧𝐠 𝐚𝐭 𝐡𝐨𝐰 𝐰𝐞𝐥𝐥 𝐩𝐫𝐞𝐩𝐚𝐫𝐢𝐧𝐠 𝐚 𝐦𝐞𝐚𝐥 𝐰𝐞𝐧𝐭, 𝐲𝐨𝐮 𝐬𝐡𝐨𝐮𝐥𝐝 𝐛𝐨𝐭𝐡 𝐚𝐬𝐤 𝐭𝐡𝐞 𝐞𝐚𝐭𝐞𝐫𝐬 𝐀𝐍𝐃 𝐥𝐨𝐨𝐤 𝐭𝐨 𝐬𝐞𝐞 𝐢𝐟 𝐭𝐡𝐞𝐲 𝐮𝐬𝐞𝐝 𝐞𝐱𝐭𝐫𝐚 𝐬𝐚𝐥𝐭 𝐨𝐫 𝐤𝐞𝐭𝐜𝐡𝐮𝐩

Are you blending feedback methods to see if your practice is working for those who use it?

Have you done surveys in addition to understanding usage metrics of the tools?

#observability #o11y #mentalmodels

2/2

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 9 months ago to random

What is your go-to mental model for thinking about Observability?

In talking with DevOps, SRE, and application teams, I find that there aren't enough very detailed mental models for how to think through what an Observability practice is and what it should do.

So here's a short list of models with potential:

Driving a car

Flying a plane

Cooking a big meal

What are other mental models you use to think through running your applications?

#o11y #observability

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cdanis, 9 months ago to devops

Hello! I'm Chris, a Site Reliability Engineer (#SRE #devops) working at @wikimediafoundation, the non-profit that administers #Wikipedia and other projects. My posts here will focus on improving #latency and #reliability, user-centric #observability, working with #kubernetes and #Python, sometimes #PHP or #Puppet. I love silly #shell #oneliners and tinkering with systems. For fun I dabble in photography and gaming.

#introductions #introduction

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Binder, BigAngBlack, wikimediafoundation