Ubuntu

Monitoring probes and alerting service (for UEC and more)

Registered by Mathias Gug on 2010-10-15

As a UEC admin I'm alerted by UEC when physical nodes go down or services are flaky.
As a UEC admin I can integrate my UEC deployement into my existing nagios system.

Notes: nagios probes, monitoring (munin, ganglia) probes.

Read the full specification

Blueprint information

Status:: Not started

Approver:: Robbie Williamson

Priority:: Undefined

Drafter:: None

Direction:: Needs approval

Assignee:: None

Definition:: Review

Series goal:: None

Implementation:: Deferred

Milestone target:: None

Related branches

Related bugs

Sprints

uds-n

Whiteboard

To be discusses:
Should monitoring (collectd) and logging (rsyslog) be using one network transport? If so, which one: collectd, relp syslog, reconnoiter?

Work Items:
Move collectd to main (MIR).
Refine relevant measures for UEC deployments.
Write collectd input plugins for each of them.
Refine monitoring probes for UEC deployments.
Provide nagios plugins for each of them.
Install collectd on every UEC components.
Install all monitoring and measuring probes on every UEC components.
Automatically setup collectd to send all monitoring data to central monitoring server (CLC) with puppet recipes.
Investigate graphing solutions (munin, graphite, reconnoiter (omniti - not packaged), visage, ganglia).

(?)

Work Items

This blueprint contains Public information

Everyone can see this information.

Subscribers

Andres Rodriguez

Boris Devouge

Clint Byrum

Dave Walker

Dustin Kirkland 

Eric Hammond

fosk

James Page

Joseph Salisbury

Mark Russell

Nick Barcet

Serge Hallyn

Soren Hansen

Tom Ellis

Torsten Spindler