Commit Graph

32 Commits

Author SHA1 Message Date
Lili Cosic
01d30382aa jsonnet/kube-prometheus/alerts: Add namespace to TargetDown msg 2019-10-29 14:36:14 +01:00
Simon Pasquier
b9504efef7 jsonnet/kube-prometheus/alerts: improve TargetDown message
Signed-off-by: Simon Pasquier <spasquie@redhat.com>
2019-09-25 09:11:03 +02:00
Kyle Byerly
fda5811975 Add labels for namespace and service to TargetDown 2019-08-28 15:29:26 -06:00
paulfantom
bcbf54805a jsonnet/kube-prometheus/alerts: remove alerts provided by node_exporter mixin 2019-08-26 10:56:05 +02:00
Lili Cosic
909f51b3bd jsonnet/kube-prometheus: Prevent many-to-many matching
If there is more than one prometheus-operator pod, which happens briefly
when we delete the prometheus-operator pod, we can see the errors of
many-to-many matching, this whitelists the labels matching, and excluded
the pod.
2019-08-16 15:55:17 +02:00
Frederic Branczyk
e7d1ada775 Merge pull request #141 from dominikhahn/master
Increase ntp clockskewdetected accuracy to 5ms
2019-08-05 10:31:31 +02:00
Lili Cosic
ccb138374e jsonnet: Include node name in NodeDiskRunningFull
This makes it easier to indetify which node alert is firing for.
Currently only device namespace and pod name were included in
the alert.
2019-07-24 17:52:39 +02:00
Frederic Branczyk
c8c850ef2b jsonnet: Use upstream prometheus-mixin 2019-07-11 15:35:48 +02:00
Hahn, Dominik
552b341bf6 Increase ntp clockskewdetected accuracy to 5ms 2019-07-05 11:40:07 +02:00
Frederic Branczyk
abd16f1b99 alerts: Replace ntp with timex metrics
A local ntp server is optional, while timex is in kernel and always
present. Some ntp daemons like chrony can also only be run in a client
mode, which synchronizes the kernel but doesn't expose an ntp server.
Using the timex metrics is a more reliable approach.
2019-04-24 10:29:13 +02:00
Frederic Branczyk
9524cbb406 alerts: Remove node interface down alert
This alert is not specific enough on the interfaces it selects which in
turn makes it too noisy.
2019-04-16 11:47:16 +02:00
Frederic Branczyk
30d2ef0f25 kube-prometheus: Add clock skew and node network interface alerts 2019-04-10 17:36:42 +02:00
Dalton Hubble
78bec4e48b Fix PrometheusTSDBWALCorruptions alert
* `tsdb_wal_corruptions_total metric` name should be
`prometheus_tsdb_wal_corruptions_total`
2019-03-10 14:50:09 -07:00
Frederic Branczyk
31de04d69c kube-prometheus: Rename DeadMansSwitch to Watchdog 2019-02-05 10:04:18 +01:00
Golubkov Igor
48ab110e09 Update unit tests 2018-11-23 09:39:14 +03:00
Golubkov Igor
95bb49aa4e Fix AlertmanagerMembersInconsistent rule 2018-11-22 17:33:14 +03:00
Matthias Loibl
49835437f7 Add AlertmanagerMembersInconsistent alerting rule 2018-10-26 17:30:14 +02:00
kkc
012a60163e contrib: fix typo in contrib prometheus rules
Change `Promehteus` to `Prometheus`
2018-10-20 00:15:02 +08:00
Matthias Loibl
e1ed50158b contrib/kube-prometheus: Fix AlertmanagerConfigInconsistent alert expression 2018-10-12 14:43:20 +02:00
Matthias Loibl
acd8924d57 Add triggered_total metric to alertmanager controller
Update client_golang for wrappable registerer
2018-10-10 18:06:00 +02:00
Lucas Serven
3ff5d01277 contrib: remove duplicate alertmanager alert 2018-09-28 11:40:12 +02:00
Lucas Servén Marín
bb2a9e87bc Merge pull request #1891 from metalmatze/prometheus-operator-alerts
kube-prometheus: Add Prometheus Operator alerts
2018-09-14 14:22:10 +02:00
Matthias Loibl
24141c464f contrib/kube-prometheus: Improve consistency of Prometheus Operator alerts 2018-09-14 13:33:49 +02:00
Matthias Loibl
407aaa5e2f contrib/kube-prometheus: Alert in 10% erros when reconciling Prom & Alertmanager 2018-09-14 11:08:15 +02:00
Matthias Loibl
8d38e81521 contrib/kube-prometheus: Create missing Prometheus operator alerts 2018-09-14 11:05:28 +02:00
Matthias Loibl
483a1c115f contrib/kube-prometheus: Fix outdated alerts to use message annotation 2018-09-13 14:59:50 +02:00
Lucas Serven
34dd0a798a contrib/.../alerts: improve alert messages 2018-09-07 15:23:51 +02:00
Frederic Branczyk
6536496716 kube-prometheus: Improve disk alert
Only trigger disk running full alerts when the disk capacity is over 85%
and use recording rule of the kubernetes-mixin to only consider real
physical filesystems.
2018-09-05 23:00:57 +02:00
Henrique Fernandes
25bb07fe7d Fix typos 2018-08-08 10:14:23 -03:00
Ali Rizwan
d728ab5511 Only alert for nodes that currently exist (#1661) 2018-07-23 12:20:54 +02:00
Aleksandar Topuzovic
64c73eafe5 Ignore '/etc/*' mountpoints
* Ignores '/etc/resolv.conf', '/etc/hosts' and '/etc/hostname' mountpoints
* Otherwise 3 additional alerts are generated
2018-06-05 14:59:40 +01:00
Frederic Branczyk
64db049d3a kube-prometheus: Migrate kube-prometheus alerts to jsonnet 2018-05-29 11:06:59 +02:00