Commit Graph

57 Commits

Author SHA1 Message Date
Frederic Branczyk
7b9d97de7f Remove rules that have been migrated to kubernetes-mixins 2018-05-28 10:30:37 +02:00
piglei™
a9e667d24c kube-prometheus: fix alert rule K8SManyNodesNotReady (#1313)
* kube-prometheus: fix alert rule K8SManyNodesNotReady

* fix alert "K8SManyNodesNotReady" in helm templates & make generate

* Use sync_kube_prometheus.py to make rules in helm in sync
2018-05-11 10:59:34 +02:00
Frederic Branczyk
0d142fe9da Merge pull request #1205 from zuzzas/cpu-rules
Fixed CPU accounting in recording rules
2018-04-11 15:08:15 +02:00
Arslanbekov Denis
a2d273b11a In description is displayed correctly namespace (#1190)
* in description is displayed correctly namespace

* Bump kube state version

* Update Chart.yaml
2018-04-10 17:18:24 +02:00
Andrey Klimentyev
f9b03ddd9d kube-prometheus: fixed CPU accounting
Currently, node recording rules feature
an incorrect idle CPU accounting. This
change aims to fix that.
2018-04-10 15:14:48 +03:00
Max Leonard Inden
b10e343689 kube-prometheus: Fix minor typo 2018-04-10 10:27:54 +02:00
Sébastien GLON
2c10f81102 Add new alert for samples rejected due ti duplicate timestamp (#1148)
Signed-off-by: Sébastien GLON <sebastien.glon@akeneo.com>
2018-03-26 18:11:40 +02:00
Alexander Holte-Davidsen
4c77a9db1d Update Alert Manager rules for NodeDiskRunningFull with summary 2018-03-22 11:32:38 +01:00
Alexander Holte-Davidsen
8b6ee5c18b Add summary to Alertmanager rules where missing - updated accoring to guidelines 2018-03-05 09:52:51 +01:00
Akihito INOH
7fe4506ae4 Update alert rule for kubelet
Update alert rule check kubelet down ratio from 1% to 10%.
In #774 , it is changed to 1%, so returns to 10%.
2018-03-01 14:10:27 +09:00
Antoine Legrand
0ae6c98a48 Add alert if it no samples are ingested 2018-02-26 14:57:39 +01:00
Frederic Branczyk
85f88025f3 kube-prometheus: Upgrade to grafana v5 2018-02-09 13:21:37 +01:00
Antoine Legrand
bcb0ba9974 Add cert expiration rules 2018-01-16 17:20:00 +01:00
Frederic Branczyk
a74a54cd06 Merge pull request #893 from brancz/bump-versions
kube-prometheus: bump various versions
2018-01-16 10:12:36 +01:00
Frederic Branczyk
aacc95b74c kube-prometheus: bump various versions 2018-01-15 14:13:26 +01:00
Matthias Loibl
85f384876e Update kube-state-metrics rules to 1.2 (#884)
* Update kube-state-metrics rules to 1.2

* Run make generate to update all manifests

* Fix the helm chart kube-state-metrics rules
2018-01-13 11:39:31 +01:00
Frederic Branczyk
d05a3ac486 kube-prometheus: Make grafana dashboards non-editable 2018-01-11 10:56:07 +01:00
Frederic Branczyk
5392443721 kube-prometheus: Add etcd dashboard 2018-01-10 14:46:00 +01:00
Giancarlo Rubio
68517f63b5 Delete chart exporter-kube-api because it has been replaced by kube-controller-manager alerts 2017-12-27 09:36:12 +01:00
Giancarlo Rubio
22eef956af Add script to keep kube-prometheus rules in sync with helm charts
Bump prometheus to 2.0.0, prometheus-operator to 0.15.0, alertmanager to 0.12.0 and node-exporter to 0.15.1, grafana to 4.6.3
migrate prometheus alerts to yaml notation
2017-12-27 09:35:53 +01:00
Frederic Branczyk
a9fedc6343 kube-prometheus: Update etcd3 rules 2017-12-22 16:09:13 +01:00
Bradley
fb01fe91dc Adding requested and limit values to CPU and limit value to memory 2017-12-07 18:35:58 +00:00
Frederic Branczyk
2454f1054e Merge pull request #789 from slok/metric-minor-fix
Fix cluster:container_cpu_usage:ratio rule
2017-12-01 10:43:36 +01:00
Frederic Branczyk
3afc174fc5 kube-prometheus: Add Prometheus 2.0 rules 2017-11-29 10:59:44 +01:00
Xabier Larrakoetxea
d6a2b717d3 Fix cluster:container_cpu_usage:ratio rule on prometheus kubernetes files
Signed-off-by: Xabier Larrakoetxea <slok69@gmail.com>
2017-11-28 14:52:16 +01:00
Frederic Branczyk
a37ad3a270 kube-prometheus: sync rules 2017-11-21 16:43:28 +01:00
Frederic Branczyk
7615244a60 Merge pull request #756 from iJanki/fix_api_latency_rule
Fixing #751 K8SApiServerLatency always triggering
2017-11-16 14:12:38 +01:00
Cesarini, Daniele
727d053dd4 Fixing #751 K8SApiServerLatency always triggering 2017-11-14 15:48:14 +00:00
Aleksandar Topuzovic
598d6779cd Alert on daemonset problems
* If any of the rules is active > 10m
* If all daemonsets are not ready
* If all daemonsets are not scheduled
* If some are miss scheduled
2017-11-14 14:36:22 +00:00
Konstantinos Natsakis
d80eaea23a kube-prometheus: use StatefulSet for dashboard title 2017-11-09 18:33:25 +02:00
Konstantinos Natsakis
85ddb3137c kube-prometheus: add stateful sets dashboard 2017-11-07 16:44:05 +02:00
Arve Knudsen
d04cccc526 Use grafanalib to generate Grafana dashboards 2017-10-30 22:05:25 +01:00
Frederic Branczyk
1b7c8cdf21 *: bump Prometheus to v2.0.0-rc.1 2017-10-17 20:13:40 +02:00
Frederic Branczyk
6ed84502c8 kube-prometheus: fix multiple series error in grafana dashboard 2017-10-16 14:40:29 +02:00
Frederic Branczyk
40fa4ccd15 grafana-dashboards: various small improvements 2017-09-26 15:59:44 +02:00
Frederic Branczyk
c8cb2df928 kube-prometheus: exclude pod log subresource from latency alerts 2017-09-18 11:11:30 +02:00
Frederic Branczyk
dfd2ee2847 assets: modify and add grafana dashboards 2017-09-07 13:44:12 +02:00
crandl201
e48278f397 update kube-state rules for 1.0.0 2017-08-17 20:05:55 -04:00
Zachary Yonash
7010e32130 Added a few extra node rules (#478) 2017-07-27 09:49:25 +02:00
Frederic Branczyk
a5533a4f6c kube-prometheus: ensure triggering alerts on down targets 2017-06-28 14:11:05 +02:00
Frederic Branczyk
915677eaa2 Revert "alerting rules: replace severity with action" 2017-06-15 10:45:51 +02:00
Frederic Branczyk
a1afce8707 alerting rules: replace severity with action 2017-06-15 09:34:59 +02:00
chenxingyu
98cdf68a0c fix alert rule bug 2017-06-13 16:40:56 +08:00
Frederic Branczyk
4da7a872ba kube-prometheus: add comment on apiserver latency unit 2017-06-06 15:34:10 +02:00
Frederic Branczyk
0c35d73e2c kube-prometheus: drop conntrack alerts and direct up alerts 2017-06-06 15:22:28 +02:00
Frederic Branczyk
30cbd76944 kube-prometheus: add PROXY verb to latency alert exclusion 2017-05-31 06:39:35 -07:00
Frederic Branczyk
804f6c187b kube-prometheus: add dead man's switch 2017-05-30 17:15:59 -07:00
Frederic Branczyk
c4b382be6f kube-prometheus: add alerting rules 2017-05-30 17:15:34 -07:00
Gytis
e810357b8f Rename kube_pod_container_requested_memory_bytes -> kube_pod_container_resource_requests_memory_bytes in grafana dashboard 2017-05-09 12:15:59 +03:00
Frederic Branczyk
ce0a9caae7 kube-prometheus: fix deployment dashboard multiple values error 2017-04-26 16:09:15 +02:00