#monitoring - kbin.social

Viss, 1 year ago to opensource

so is zabbix the go to opensource system monitoring tool now? or is it observium?

#sysadmin #sre #zabbix #observium #opensource #monitoring

reply

expand (19)

collapse (19)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jerry

thisismissem, 11 months ago to Redis

Does anyone know of a tool to monitor #redis pub/sub topics? e.g., messages per second on topics, average message size in bytes, etc.

I know the INFO command gives total stats for all commands, but it'd be nice to have more detail into pub/sub. I'm sure we used to have this at pusher.com, but not sure how, and that was over a decade ago, before they moved to an in-house developed messaging infrastructure

#monitoring #devops

reply

expand (14)

collapse (14)

report

activity

copy /kbin url

copy original url

open original url

Loading...

paigerduty, 18 days ago to random

when you hear the term "monitoring debt" what comes to mind?

#monitoring #observability #o11y #prometheus #OpenTelemetry #sre

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

DefectiveWings, 1 year ago to tech

What are y'all using for on-prem service health/ #monitoring? Our #nagios install needs a rebuild, and now is as good a time as any to look at other options.

We have a mixed environment of Windows Server 2012r2, 2022, CentOS 7 and Ubuntu 20.04, if that makes any difference.

#tech #it #sysadmin #systems

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

taco, 10 months ago to sysadmin

another reason i love #uptimekuma is being able to see what went wrong with a request. here you can see that https://cyberfurz.social went down for a few minutes, but recovered. 502 is a bad gateway error, so most likely nothing was wrong with the server software itself, just a faulty network connection down the line. really neat stuff!

i'm adding more monitors to https://uptime.birdcat.cafe/status/fediverse now :3

#monitoring #sre #sysadmin #mastoadmin

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

rolle, 6 months ago to IT

Just set up a log watcher via Datadog for my Mastodon server cluster, because it's the most pretty and easiest for me in this situation. Works amazing and now I will receive a notification from even a tiniest anomaly!

Check out the status of the servers at https://status.mementomori.social

You are in safe hands here.

#MementoMoriSocial #MastoAdmin #Datadog #Monitoring #SysOp #Servers

Logging in mementomori.social's Datadog panel
CPU usage graph
Fatal errors monitor

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 7 months ago to random

If you aren't keeping track of these 4 usage areas in your Observability tooling, you'll never be able to optimize its value.

#observability #o11y #monitoring

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 2 months ago to devops

Hey there, #DevOps and #SysAdmin friends - Have you tried #CheckMK? Do you like it? Was it worth the money?

If you've used regular Nagios or Centreon before, how does it compare?

Did it integrate well with your existing kubernetes/prometheus infra?

How was it to migrate into? Was the service discovery intuitive?

https://checkmk.com

#monitoring

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

iulia_, 4 months ago to python

Looking to build a simple #python #app on top of my search project for a nicer #UI and eventually doing some #monitoring & #logging of user activity. Haven’t worked on this since uni when we used #flask and sometimes a bit of #django. Anyone have any good insights / best practices / favorite reference for something like this?

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ CodenameTim

freekmurze, 9 months ago to random

💪 Reached a nice milestone: https://ohdear.app has sent out 10 million notifications 🚨
#monitoring

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vinzv, 8 months ago to grafana German

Prometheus und Grafana installiert, zum ersten Mal damit rumgespielt. Jessas, ich bin völlig überfordert. 🤯

#Monitoring #Grafana #Prometheus

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

beandev, 8 months ago to 3DPrinting German

Welche Kamera habt ihr eigentlich mit #Klipper im Einsatz, um euch den #3ddruck remote anzuschauen oder zu filmen?

#3dprinting #camera #monitoring

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vordenken, 11 months ago to random German

Successfully installed netdata on my proxmox homelab host for monitoring. Thought I wouldnt need monitoring with notifications but yesterday I looked at the host stats and the cpu usage spiked a few days ago... Took me a while to figure out but the issue was one vm ran out of disk space because of influxdb...

Now I get a notification via telegram if something is wrong - Nice!

#homelab #proxmox #monitoring

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

taco, 10 months ago to bot

formally introducing: watchbird! 🦅

@watchbird is an #uptime #bot for instances around the fediverse. it will CAW when something's up, like when it can't connect to the instance main page. it's powered by #uptimekuma + #apprise + the #mastodon #api.

if you want your #instance added, just let me know here on mastodon @taco (reply or dm, whatever)! i'm constantly adding more servers that i find that i want to keep an eye on.

you can see the full page of monitored instances here: https://uptime.birdcat.cafe/status/fediverse

#bots #mastodonbots #mastoadmin #monitoring #monitor

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

flameeyes, 8 months ago to sysadmin

How do people make sure their CentOS 9 base host doesn't get stale?

I want to be alerted if any security update is due within a few hours. Is there something that can do that with Prometheus?

#SysAdmin #CentOS9 #Monitoring

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

RichiH, 5 months ago to random

For the 2024 edition of #FOSDEM the #Monitoring and #Observability devroom received 61 submissions, a record high, and the average quality was also quite high.

Building a schedule is a luxury problem... I just sent out the first ten acceptance emails; we have two more slots and will build the schedule with actual times once the reconfirmations come in.

Good problem to have, but it was still hard...

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

dis, 7 months ago to homelab

Maybe I'm not high enough yet, but has anyone thrown together a UPS monitor using an ESP32 or PiZero?
Something I can slap onto the USB port and spit metrics either to MQTT or in Prometheus.. #infrastructure #homelab #ups #snarkhome #esp32 #monitoring #mqtt #prometheus

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kkarhan

jfbucas, 8 months ago to debian

In the new #Debian #Bookworm version of #htop, if you press the Tab key, you can see the #IO for each process 😲

#unix #monitoring #linux #sysadm #sysadmin

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kkarhan

beandev, 8 months ago to 3DPrinting German

So, die Logitech C615 scheint an #Crowsnest mit #Klipper zu funktionieren. Die Settings sind etwas tricky, weil das mit dem Autofocus nicht so einfach einzustellen ist.

#3dprinting #monitoring

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

FTWynn, 8 months ago to random

Teams should have a regular review to determine what of their #Observability data is actually being used. Otherwise, "just in case" becomes a value-less justification with uncapped costs.

#o11y #monitoring

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

technotim, 2 months ago to opensource

Over the last few weeks I have been looking for a more advanced self-hosted monitoring system. One that gives me more than just a simple up and down status and one that is config based. I think I found it!

Check it out!

https://www.youtube.com/watch?v=LeZQjWlDUHs

#uptime #opensource #monitoring #dashboard

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

resingm, 8 months ago to devops

To all you #devops and #sysadmin folks out there - what would you recommend for a centralized #monitoring and #logging solution? I am using a central #syslog-ng service with a #postgresql backend these days. But I am looking for a more sophisticated setup. Preferrably something that can be #selfhosted

If you can point me to some more resources, that would be great. The Internet suggests a whole bunch of different rabbit holes. I want to understand, how companies actually manage their log and monitoring stack without outsourcing to a 3rd party.

Cheers!

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

beandev, 11 months ago to random German

Ich habe nun ein #Shelly 3EM.

#HomeAssistant #energy #monitoring

Animiertes "Monitoring" mit blauen Wellenformen von irgendwas, was nicht real ist

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

GregCocks, 28 days ago to geopolitics

The Widest-Ever Global Coral Crisis Will Hit Within Weeks, Scientists Say [remote sensing]

https://www.nytimes.com/2024/04/15/climate/coral-reefs-bleaching.html <-- shared media article

https://www.noaa.gov/news-release/noaa-confirms-4th-global-coral-bleaching-event <-- NOAA technical announcement

https://coralreefwatch.noaa.gov/ <-- NOAA Coral Reef Watch home page

#GIS #spatial #mapping #remotesensing #earthobservation #global #satellite #coralreefwatch #coral #coralreef #coralbleaching #bleaching #temperatures #climatechange #algae #ecosystems #monitoring #fish #fisheries #foodsecurity #economicbenefits #marine #coast #coastal
@NOAA

photo - bleached coral
photos - bleached coral
global map - NOAA Coral Reef Watch 5km Bleaching Alert Area Maximum, 01/01/23 --> 04/10/24

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ pinskal, ai6yr

mboelen, 3 days ago to linux

The command iftop shows ongoing bandwidth usage on one or more network interfaces and is a great tool for troubleshooting network issues.

Doing some tool ⚙️ and network analysis, so great option to combine things: https://linux-audit.com/system-administration/commands/iftop/

Feedback and boosts welcome 🚀

#linux #devops #monitoring #performance

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ncrav