Viss, to opensource
@Viss@mastodon.social avatar

so is zabbix the go to opensource system monitoring tool now? or is it observium?

thisismissem, to Redis
@thisismissem@hachyderm.io avatar

Does anyone know of a tool to monitor pub/sub topics? e.g., messages per second on topics, average message size in bytes, etc.

I know the INFO command gives total stats for all commands, but it'd be nice to have more detail into pub/sub. I'm sure we used to have this at pusher.com, but not sure how, and that was over a decade ago, before they moved to an in-house developed messaging infrastructure

paigerduty, to random
@paigerduty@hachyderm.io avatar

when you hear the term "monitoring debt" what comes to mind?

DefectiveWings, to tech

What are y'all using for on-prem service health/ ? Our install needs a rebuild, and now is as good a time as any to look at other options.

We have a mixed environment of Windows Server 2012r2, 2022, CentOS 7 and Ubuntu 20.04, if that makes any difference.

taco, to sysadmin

another reason i love is being able to see what went wrong with a request. here you can see that https://cyberfurz.social went down for a few minutes, but recovered. 502 is a bad gateway error, so most likely nothing was wrong with the server software itself, just a faulty network connection down the line. really neat stuff!

i'm adding more monitors to https://uptime.birdcat.cafe/status/fediverse now :3

rolle, to IT
@rolle@mementomori.social avatar

Just set up a log watcher via Datadog for my Mastodon server cluster, because it's the most pretty and easiest for me in this situation. Works amazing and now I will receive a notification from even a tiniest anomaly!

Check out the status of the servers at https://status.mementomori.social

You are in safe hands here.

Logging in mementomori.social's Datadog panel
CPU usage graph
Fatal errors monitor

FTWynn, to random

If you aren't keeping track of these 4 usage areas in your Observability tooling, you'll never be able to optimize its value.

vwbusguy, to devops
@vwbusguy@mastodon.online avatar

Hey there, and friends - Have you tried ? Do you like it? Was it worth the money?

If you've used regular Nagios or Centreon before, how does it compare?

Did it integrate well with your existing kubernetes/prometheus infra?

How was it to migrate into? Was the service discovery intuitive?

https://checkmk.com

iulia_, to python

Looking to build a simple on top of my search project for a nicer and eventually doing some & of user activity. Haven’t worked on this since uni when we used and sometimes a bit of . Anyone have any good insights / best practices / favorite reference for something like this?

freekmurze, to random
@freekmurze@phpc.social avatar

💪 Reached a nice milestone: https://ohdear.app has sent out 10 million notifications 🚨

vinzv, to grafana German

Prometheus und Grafana installiert, zum ersten Mal damit rumgespielt. Jessas, ich bin völlig überfordert. 🤯

beandev, to 3DPrinting German
@beandev@social.tchncs.de avatar

Welche Kamera habt ihr eigentlich mit im Einsatz, um euch den remote anzuschauen oder zu filmen?

vordenken, to random German
@vordenken@fosstodon.org avatar

Successfully installed netdata on my proxmox homelab host for monitoring. Thought I wouldnt need monitoring with notifications but yesterday I looked at the host stats and the cpu usage spiked a few days ago... Took me a while to figure out but the issue was one vm ran out of disk space because of influxdb...

Now I get a notification via telegram if something is wrong - Nice!

taco, to bot

formally introducing: watchbird! 🦅

@watchbird is an for instances around the fediverse. it will CAW when something's up, like when it can't connect to the instance main page. it's powered by + + the .

if you want your added, just let me know here on mastodon @taco (reply or dm, whatever)! i'm constantly adding more servers that i find that i want to keep an eye on.

you can see the full page of monitored instances here: https://uptime.birdcat.cafe/status/fediverse

flameeyes, to sysadmin
@flameeyes@mastodon.social avatar

How do people make sure their CentOS 9 base host doesn't get stale?

I want to be alerted if any security update is due within a few hours. Is there something that can do that with Prometheus?

RichiH, to random
@RichiH@chaos.social avatar

For the 2024 edition of the and devroom received 61 submissions, a record high, and the average quality was also quite high.

Building a schedule is a luxury problem... I just sent out the first ten acceptance emails; we have two more slots and will build the schedule with actual times once the reconfirmations come in.

Good problem to have, but it was still hard...

dis, to homelab

Maybe I'm not high enough yet, but has anyone thrown together a UPS monitor using an ESP32 or PiZero?
Something I can slap onto the USB port and spit metrics either to MQTT or in Prometheus..

jfbucas, to debian

In the new version of , if you press the Tab key, you can see the for each process 😲

beandev, to 3DPrinting German
@beandev@social.tchncs.de avatar

So, die Logitech C615 scheint an mit zu funktionieren. Die Settings sind etwas tricky, weil das mit dem Autofocus nicht so einfach einzustellen ist.

FTWynn, to random

Teams should have a regular review to determine what of their data is actually being used. Otherwise, "just in case" becomes a value-less justification with uncapped costs.

technotim, to opensource
@technotim@mastodon.social avatar

Over the last few weeks I have been looking for a more advanced self-hosted monitoring system. One that gives me more than just a simple up and down status and one that is config based. I think I found it!

Check it out!

https://www.youtube.com/watch?v=LeZQjWlDUHs

resingm, to devops

To all you and folks out there - what would you recommend for a centralized and solution? I am using a central -ng service with a backend these days. But I am looking for a more sophisticated setup. Preferrably something that can be

If you can point me to some more resources, that would be great. The Internet suggests a whole bunch of different rabbit holes. I want to understand, how companies actually manage their log and monitoring stack without outsourcing to a 3rd party.

Cheers!

beandev, to random German
@beandev@social.tchncs.de avatar
GregCocks, to geopolitics
@GregCocks@techhub.social avatar
mboelen, to linux
@mboelen@mastodon.social avatar

The command iftop shows ongoing bandwidth usage on one or more network interfaces and is a great tool for troubleshooting network issues.

Doing some tool ⚙️ and network analysis, so great option to combine things: https://linux-audit.com/system-administration/commands/iftop/

Feedback and boosts welcome 🚀

  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • tester
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • osvaldo12
  • ethstaker
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • ngwrru68w68
  • kavyap
  • GTA5RPClips
  • JUstTest
  • cisconetworking
  • InstantRegret
  • khanakhh
  • cubers
  • everett
  • Durango
  • tacticalgear
  • Leos
  • modclub
  • normalnudes
  • provamag3
  • anitta
  • lostlight
  • All magazines