Closed Bug 1588596 Opened 5 years ago Closed 3 years ago

Increase host monitoring on posix workers

Categories

(Infrastructure & Operations :: RelOps: Posix OS, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: dhouse, Assigned: dhouse)

References

Details

Attachments

(1 file)

(deleted), text/x-github-pull-request
Details

In bug 1585750, we are starting to monitor the cpu/memory/disk/thermal+ on the macos mojave machines. Let's do that on the linux hardware also:

  1. in telegraf on the machines (already in place, need to turn on the cpu/mem/disk input plugins)
  2. through the moonshot ipmi interface, can we check temperature, power-usage, other for the chassis and cartridges? or route the alerts into telegraf instead of email(current)?
Attached file GitHub Pull Request (deleted) —

I've run telegraf cpu/disk/temp inputs on a handful of the linux64 and yosemite workers over the last week. I had to modify the temp collection for yosemite (from what I am using on mojave) because of older awk, but it is working the same now.

Status: NEW → RESOLVED
Closed: 3 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Creator:
Created:
Updated:
Size: