Alarming refactoring
In order to enable several uses cases we must rework alarming framework
Blueprint information
- Status:
- Complete
- Approver:
- None
- Priority:
- Essential
- Drafter:
- Swann Croiset
- Direction:
- Approved
- Assignee:
- Swann Croiset
- Definition:
- Review
- Series goal:
- Accepted for 1.0
- Implementation:
- Implemented
- Milestone target:
- 1.0.0
- Started by
- Swann Croiset
- Completed by
- Swann Croiset
Related branches
Related bugs
Sprints
Whiteboard
Gerrit topic: https:/
Addressed by: https:/
Alarm definition refactoring
Addressed by: https:/
Do not send AFD to Nagios when activate_
Addressed by: https:/
Do not send GSE to Nagios when activate_
Addressed by: https:/
Removed old hiera data
Addressed by: https:/
Alarming refactoring
Addressed by: https:/
Support activate_alarming and enable_notification
Addressed by: https:/
Support activate_alerting and enable_notification properties for GSE
Addressed by: https:/
Add default AFD for unknown fuel roles
Addressed by: https:/
Simply AFD alarm field structure
Addressed by: https:/
Send GSE service clusters status to alerting
Gerrit topic: https:/
Addressed by: https:/
Avoid alarm flapping for Ceph OSD checks
Addressed by: https:/
Fix horizon alarm
Addressed by: https:/
Add alarm for Horizon HTTP 5xx errors
Addressed by: https:/
Monitor all partitions
Addressed by: https:/
Configure alarms for OSD disk(s)
Addressed by: https:/
Include Ceph OSD node to the storage cluster
Addressed by: https:/
Fix the InfluxDB VIP check to map the GSE configuration
Addressed by: https:/
Split top-level clusters health by (control|
Addressed by: https:/
Make Pacemaker global status dependent of controller cluster status
Addressed by: https:/
Decouple aggregator election from Pacemaker resource
Addressed by: https:/
[WIP] remove no_data_policy=skip
Addressed by: https:/
Support alerting attribute per AFD
Addressed by: https:/
Support alerting attribute per AFD
Addressed by: https:/
Enable notifications for HDD errors
Addressed by: https:/
Revert "Remove the no_data_policy=skip for AFD"
Addressed by: https:/
Add no_data_policy=skip for all workers alarms
Addressed by: https:/
Fix rabbitmq-pacemaker related alarms
Addressed by: https:/
Do not send cluster AFDs to Nagios