Lili Cosic
01d30382aa
jsonnet/kube-prometheus/alerts: Add namespace to TargetDown msg
2019-10-29 14:36:14 +01:00
Simon Pasquier
b9504efef7
jsonnet/kube-prometheus/alerts: improve TargetDown message
...
Signed-off-by: Simon Pasquier <spasquie@redhat.com >
2019-09-25 09:11:03 +02:00
Kyle Byerly
fda5811975
Add labels for namespace and service to TargetDown
2019-08-28 15:29:26 -06:00
paulfantom
bcbf54805a
jsonnet/kube-prometheus/alerts: remove alerts provided by node_exporter mixin
2019-08-26 10:56:05 +02:00
Lili Cosic
909f51b3bd
jsonnet/kube-prometheus: Prevent many-to-many matching
...
If there is more than one prometheus-operator pod, which happens briefly
when we delete the prometheus-operator pod, we can see the errors of
many-to-many matching, this whitelists the labels matching, and excluded
the pod.
2019-08-16 15:55:17 +02:00
Frederic Branczyk
e7d1ada775
Merge pull request #141 from dominikhahn/master
...
Increase ntp clockskewdetected accuracy to 5ms
2019-08-05 10:31:31 +02:00
Lili Cosic
ccb138374e
jsonnet: Include node name in NodeDiskRunningFull
...
This makes it easier to indetify which node alert is firing for.
Currently only device namespace and pod name were included in
the alert.
2019-07-24 17:52:39 +02:00
Frederic Branczyk
c8c850ef2b
jsonnet: Use upstream prometheus-mixin
2019-07-11 15:35:48 +02:00
Hahn, Dominik
552b341bf6
Increase ntp clockskewdetected accuracy to 5ms
2019-07-05 11:40:07 +02:00
Frederic Branczyk
abd16f1b99
alerts: Replace ntp with timex metrics
...
A local ntp server is optional, while timex is in kernel and always
present. Some ntp daemons like chrony can also only be run in a client
mode, which synchronizes the kernel but doesn't expose an ntp server.
Using the timex metrics is a more reliable approach.
2019-04-24 10:29:13 +02:00
Frederic Branczyk
9524cbb406
alerts: Remove node interface down alert
...
This alert is not specific enough on the interfaces it selects which in
turn makes it too noisy.
2019-04-16 11:47:16 +02:00
Frederic Branczyk
30d2ef0f25
kube-prometheus: Add clock skew and node network interface alerts
2019-04-10 17:36:42 +02:00
Dalton Hubble
78bec4e48b
Fix PrometheusTSDBWALCorruptions alert
...
* `tsdb_wal_corruptions_total metric` name should be
`prometheus_tsdb_wal_corruptions_total`
2019-03-10 14:50:09 -07:00
Frederic Branczyk
31de04d69c
kube-prometheus: Rename DeadMansSwitch to Watchdog
2019-02-05 10:04:18 +01:00
Golubkov Igor
48ab110e09
Update unit tests
2018-11-23 09:39:14 +03:00
Golubkov Igor
95bb49aa4e
Fix AlertmanagerMembersInconsistent rule
2018-11-22 17:33:14 +03:00
Matthias Loibl
49835437f7
Add AlertmanagerMembersInconsistent alerting rule
2018-10-26 17:30:14 +02:00
kkc
012a60163e
contrib: fix typo in contrib prometheus rules
...
Change `Promehteus` to `Prometheus`
2018-10-20 00:15:02 +08:00
Matthias Loibl
e1ed50158b
contrib/kube-prometheus: Fix AlertmanagerConfigInconsistent alert expression
2018-10-12 14:43:20 +02:00
Matthias Loibl
acd8924d57
Add triggered_total metric to alertmanager controller
...
Update client_golang for wrappable registerer
2018-10-10 18:06:00 +02:00
Lucas Serven
3ff5d01277
contrib: remove duplicate alertmanager alert
2018-09-28 11:40:12 +02:00
Lucas Servén Marín
bb2a9e87bc
Merge pull request #1891 from metalmatze/prometheus-operator-alerts
...
kube-prometheus: Add Prometheus Operator alerts
2018-09-14 14:22:10 +02:00
Matthias Loibl
24141c464f
contrib/kube-prometheus: Improve consistency of Prometheus Operator alerts
2018-09-14 13:33:49 +02:00
Matthias Loibl
407aaa5e2f
contrib/kube-prometheus: Alert in 10% erros when reconciling Prom & Alertmanager
2018-09-14 11:08:15 +02:00
Matthias Loibl
8d38e81521
contrib/kube-prometheus: Create missing Prometheus operator alerts
2018-09-14 11:05:28 +02:00
Matthias Loibl
483a1c115f
contrib/kube-prometheus: Fix outdated alerts to use message annotation
2018-09-13 14:59:50 +02:00
Lucas Serven
34dd0a798a
contrib/.../alerts: improve alert messages
2018-09-07 15:23:51 +02:00
Frederic Branczyk
6536496716
kube-prometheus: Improve disk alert
...
Only trigger disk running full alerts when the disk capacity is over 85%
and use recording rule of the kubernetes-mixin to only consider real
physical filesystems.
2018-09-05 23:00:57 +02:00
Henrique Fernandes
25bb07fe7d
Fix typos
2018-08-08 10:14:23 -03:00
Ali Rizwan
d728ab5511
Only alert for nodes that currently exist ( #1661 )
2018-07-23 12:20:54 +02:00
Aleksandar Topuzovic
64c73eafe5
Ignore '/etc/*' mountpoints
...
* Ignores '/etc/resolv.conf', '/etc/hosts' and '/etc/hostname' mountpoints
* Otherwise 3 additional alerts are generated
2018-06-05 14:59:40 +01:00
Frederic Branczyk
64db049d3a
kube-prometheus: Migrate kube-prometheus alerts to jsonnet
2018-05-29 11:06:59 +02:00