Hi, I have a bunch of Raspberry Pies hosting all kinds of stuff and I want to have a monitoring solution for all of that. What would be your recommendations?

My goal is to be able to have an overview of CPU load, network load, CPU temp and to see what’s going on inside docker containers as I have everything dockerized. I’d like the solution to be open source. I want the solution to be web browser accessible and have nice load graphs with history. I don’t want to spend too much time setting it up.

All my Pies are running RaspberryOS, which is Debian based.

  • @johntash@eviltoast.org
    link
    fedilink
    English
    311 months ago

    I didnt see it recommended yet, UptimeKuma is really simple if you just want to monitor the basics like if a url works or ping, tcp, etc without an agent.

    It doesn’t do CPU/memory style metrics, but I find myself checking it more often because of how simple it is.

    • @Aux@lemmy.worldOP
      link
      fedilink
      English
      011 months ago

      I need CPU and other metrics because recently one of my Docker containers got infected with DDOS software and CPU spike was a tell tale.

      • TheMurphy
        link
        fedilink
        English
        111 months ago

        Omg I have CPU spikes on my Raspberry Pi. Maybe it’s infected too, and how would I ever find out?

        Is there some software I can run to check?

        • @Aux@lemmy.worldOP
          link
          fedilink
          English
          011 months ago

          Are they small spikes spread across time or large chunks of heavy load, like 80%+ load for hours? If it’s the first, then probably it’s just normal operation. Otherwise check your running processes and start tracking what’s going on during high loads.

          • TheMurphy
            link
            fedilink
            English
            111 months ago

            I would say it’s 100% load for maybe 3 minutes, so maybe it’s normal.

            It makes my system overload so my PiHole stops processing.

            But it sounds like maybe it’s normal and a background service using too much sometimes?