ibm-mq
Overview
Jsonnet source code is available at github.com/grafana/jsonnet-libs
Alerts
Complete list of pregenerated alerts is available here.
ibm-mq-alerts
IBMMQExpiredMessages
alert: IBMMQExpiredMessages
annotations:
description: The number of expired messages in the {{$labels.qmgr}} is {{$labels.value}}
which is above the threshold of 2.
summary: There are expired messages, which imply that application resilience is
failing.
expr: |
sum without (description,hostname,instance,job,platform) (ibmmq_qmgr_expired_message_count) > 2
for: 5m
labels:
severity: critical
IBMMQStaleMessages
alert: IBMMQStaleMessages
annotations:
description: A stale message with an age of {{$labels.value}} has been sitting in
the {{$labels.queue}} which is above the threshold of 300s.
summary: Stale messages have been detected.
expr: |
sum without (description,instance,job,platform) (ibmmq_queue_oldest_message_age) >= 300
for: 5m
labels:
severity: warning
IBMMQLowDiskSpace
alert: IBMMQLowDiskSpace
annotations:
description: The amount of disk space available for {{$labels.qmgr}} is at {{$labels.value}}%
which is below the threshold of 5%.
summary: There is limited disk available for a queue manager.
expr: |
sum without (description,hostname,instance,job,platform) (ibmmq_qmgr_queue_manager_file_system_free_space_percentage) <= 5
for: 5m
labels:
severity: critical
IBMMQHighQueueManagerCpuUsage
alert: IBMMQHighQueueManagerCpuUsage
annotations:
description: The amount of CPU usage for the queue manager {{$labels.qmgr}} is at
{{$labels.value}}% which is above the threshold of 85%.
summary: There is a high CPU usage estimate for a queue manager.
expr: |
sum without (description,hostname,instance,job,platform) (ibmmq_qmgr_user_cpu_time_estimate_for_queue_manager_percentage) >= 85
for: 5m
labels:
severity: critical
Dashboards
Following dashboards are generated from mixins and hosted on github: