varnish
Overview
Jsonnet source code is available at github.com/grafana/jsonnet-libs
Alerts
Complete list of pregenerated alerts is available here.
varnish-cache
VarnishCacheLowCacheHitRate
alert: VarnishCacheLowCacheHitRate
annotations:
description: The Cache hit rate is {{ printf "%.0f" $value }} percent over the last
5 minutes on {{$labels.instance}}, which is below the threshold of 80 percent.
summary: Cache is not answering a sufficient percentage of read requests.
expr: |
increase(varnish_main_cache_hit[10m]) / (clamp_min((increase(varnish_main_cache_hit[10m]) + increase(varnish_main_cache_miss[10m])), 1)) * 100 < 80 and (increase(varnish_main_cache_hit[10m]) + increase(varnish_main_cache_miss[10m]) > 0)
for: 10m
labels:
severity: warning
VarnishCacheHighMemoryUsage
alert: VarnishCacheHighMemoryUsage
annotations:
description: Current Memory Usage is {{ printf "%.0f" $value }} percent on {{$labels.instance}},
which is above the threshold of 90 percent.
summary: Varnish Cache is running low on available memory.
expr: |
(varnish_sma_g_bytes{type="s0"} / (varnish_sma_g_bytes{type="s0"} + varnish_sma_g_space{type="s0"})) * 100 > 90
for: 5m
labels:
severity: warning
VarnishCacheHighCacheEvictionRate
alert: VarnishCacheHighCacheEvictionRate
annotations:
description: The Cache has evicted {{ printf "%.0f" $value }} objects over the last
5 minutes on {{$labels.instance}}, which is above the threshold of 0.
summary: The cache is evicting too many objects.
expr: |
increase(varnish_main_n_lru_nuked[5m]) > 0
for: 5m
labels:
severity: critical
VarnishCacheHighSaturation
alert: VarnishCacheHighSaturation
annotations:
description: The thread queue length is {{ printf "%.0f" $value }} over the last
5 minutes on {{$labels.instance}}, which is above the threshold of 0.
summary: There are too many threads in queue, Varnish is saturated and responses
are slowed.
expr: |
varnish_main_thread_queue_len > 0
for: 5m
labels:
severity: warning
VarnishCacheSessionsDropping
alert: VarnishCacheSessionsDropping
annotations:
description: The amount of sessions dropped is {{ printf "%.0f" $value }} over the
last 5 minutes on {{$labels.instance}}, which is above the threshold of 0.
summary: Incoming requests are being dropped due to a lack of free worker threads.
expr: |
increase(varnish_main_sessions{type="dropped"}[5m]) > 0
for: 5m
labels:
severity: critical
VarnishCacheBackendUnhealthy
alert: VarnishCacheBackendUnhealthy
annotations:
description: The amount of unhealthy backend statuses detected is {{ printf "%.0f"
$value }} over the last 5 minutes on {{$labels.instance}}, which is above the
threshold of 0.
summary: Backend has been marked as unhealthy due to slow 200 responses.
expr: |
increase(varnish_main_backend_unhealthy[5m]) > 0
for: 5m
labels:
severity: critical
Dashboards
Following dashboards are generated from mixins and hosted on github: