Commit Graph

56 Commits

Author SHA1 Message Date
piglei™
a9e667d24c kube-prometheus: fix alert rule K8SManyNodesNotReady (#1313)
* kube-prometheus: fix alert rule K8SManyNodesNotReady

* fix alert "K8SManyNodesNotReady" in helm templates & make generate

* Use sync_kube_prometheus.py to make rules in helm in sync
2018-05-11 10:59:34 +02:00
Frederic Branczyk
0d142fe9da Merge pull request #1205 from zuzzas/cpu-rules
Fixed CPU accounting in recording rules
2018-04-11 15:08:15 +02:00
Arslanbekov Denis
a2d273b11a In description is displayed correctly namespace (#1190)
* in description is displayed correctly namespace

* Bump kube state version

* Update Chart.yaml
2018-04-10 17:18:24 +02:00
Andrey Klimentyev
f9b03ddd9d kube-prometheus: fixed CPU accounting
Currently, node recording rules feature
an incorrect idle CPU accounting. This
change aims to fix that.
2018-04-10 15:14:48 +03:00
Max Leonard Inden
b10e343689 kube-prometheus: Fix minor typo 2018-04-10 10:27:54 +02:00
Sébastien GLON
2c10f81102 Add new alert for samples rejected due ti duplicate timestamp (#1148)
Signed-off-by: Sébastien GLON <sebastien.glon@akeneo.com>
2018-03-26 18:11:40 +02:00
Alexander Holte-Davidsen
4c77a9db1d Update Alert Manager rules for NodeDiskRunningFull with summary 2018-03-22 11:32:38 +01:00
Alexander Holte-Davidsen
8b6ee5c18b Add summary to Alertmanager rules where missing - updated accoring to guidelines 2018-03-05 09:52:51 +01:00
Akihito INOH
7fe4506ae4 Update alert rule for kubelet
Update alert rule check kubelet down ratio from 1% to 10%.
In #774 , it is changed to 1%, so returns to 10%.
2018-03-01 14:10:27 +09:00
Antoine Legrand
0ae6c98a48 Add alert if it no samples are ingested 2018-02-26 14:57:39 +01:00
Frederic Branczyk
85f88025f3 kube-prometheus: Upgrade to grafana v5 2018-02-09 13:21:37 +01:00
Antoine Legrand
bcb0ba9974 Add cert expiration rules 2018-01-16 17:20:00 +01:00
Frederic Branczyk
a74a54cd06 Merge pull request #893 from brancz/bump-versions
kube-prometheus: bump various versions
2018-01-16 10:12:36 +01:00
Frederic Branczyk
aacc95b74c kube-prometheus: bump various versions 2018-01-15 14:13:26 +01:00
Matthias Loibl
85f384876e Update kube-state-metrics rules to 1.2 (#884)
* Update kube-state-metrics rules to 1.2

* Run make generate to update all manifests

* Fix the helm chart kube-state-metrics rules
2018-01-13 11:39:31 +01:00
Frederic Branczyk
d05a3ac486 kube-prometheus: Make grafana dashboards non-editable 2018-01-11 10:56:07 +01:00
Frederic Branczyk
5392443721 kube-prometheus: Add etcd dashboard 2018-01-10 14:46:00 +01:00
Giancarlo Rubio
68517f63b5 Delete chart exporter-kube-api because it has been replaced by kube-controller-manager alerts 2017-12-27 09:36:12 +01:00
Giancarlo Rubio
22eef956af Add script to keep kube-prometheus rules in sync with helm charts
Bump prometheus to 2.0.0, prometheus-operator to 0.15.0, alertmanager to 0.12.0 and node-exporter to 0.15.1, grafana to 4.6.3
migrate prometheus alerts to yaml notation
2017-12-27 09:35:53 +01:00
Frederic Branczyk
a9fedc6343 kube-prometheus: Update etcd3 rules 2017-12-22 16:09:13 +01:00
Bradley
fb01fe91dc Adding requested and limit values to CPU and limit value to memory 2017-12-07 18:35:58 +00:00
Frederic Branczyk
2454f1054e Merge pull request #789 from slok/metric-minor-fix
Fix cluster:container_cpu_usage:ratio rule
2017-12-01 10:43:36 +01:00
Frederic Branczyk
3afc174fc5 kube-prometheus: Add Prometheus 2.0 rules 2017-11-29 10:59:44 +01:00
Xabier Larrakoetxea
d6a2b717d3 Fix cluster:container_cpu_usage:ratio rule on prometheus kubernetes files
Signed-off-by: Xabier Larrakoetxea <slok69@gmail.com>
2017-11-28 14:52:16 +01:00
Frederic Branczyk
a37ad3a270 kube-prometheus: sync rules 2017-11-21 16:43:28 +01:00
Frederic Branczyk
7615244a60 Merge pull request #756 from iJanki/fix_api_latency_rule
Fixing #751 K8SApiServerLatency always triggering
2017-11-16 14:12:38 +01:00
Cesarini, Daniele
727d053dd4 Fixing #751 K8SApiServerLatency always triggering 2017-11-14 15:48:14 +00:00
Aleksandar Topuzovic
598d6779cd Alert on daemonset problems
* If any of the rules is active > 10m
* If all daemonsets are not ready
* If all daemonsets are not scheduled
* If some are miss scheduled
2017-11-14 14:36:22 +00:00
Konstantinos Natsakis
d80eaea23a kube-prometheus: use StatefulSet for dashboard title 2017-11-09 18:33:25 +02:00
Konstantinos Natsakis
85ddb3137c kube-prometheus: add stateful sets dashboard 2017-11-07 16:44:05 +02:00
Arve Knudsen
d04cccc526 Use grafanalib to generate Grafana dashboards 2017-10-30 22:05:25 +01:00
Frederic Branczyk
1b7c8cdf21 *: bump Prometheus to v2.0.0-rc.1 2017-10-17 20:13:40 +02:00
Frederic Branczyk
6ed84502c8 kube-prometheus: fix multiple series error in grafana dashboard 2017-10-16 14:40:29 +02:00
Frederic Branczyk
40fa4ccd15 grafana-dashboards: various small improvements 2017-09-26 15:59:44 +02:00
Frederic Branczyk
c8cb2df928 kube-prometheus: exclude pod log subresource from latency alerts 2017-09-18 11:11:30 +02:00
Frederic Branczyk
dfd2ee2847 assets: modify and add grafana dashboards 2017-09-07 13:44:12 +02:00
crandl201
e48278f397 update kube-state rules for 1.0.0 2017-08-17 20:05:55 -04:00
Zachary Yonash
7010e32130 Added a few extra node rules (#478) 2017-07-27 09:49:25 +02:00
Frederic Branczyk
a5533a4f6c kube-prometheus: ensure triggering alerts on down targets 2017-06-28 14:11:05 +02:00
Frederic Branczyk
915677eaa2 Revert "alerting rules: replace severity with action" 2017-06-15 10:45:51 +02:00
Frederic Branczyk
a1afce8707 alerting rules: replace severity with action 2017-06-15 09:34:59 +02:00
chenxingyu
98cdf68a0c fix alert rule bug 2017-06-13 16:40:56 +08:00
Frederic Branczyk
4da7a872ba kube-prometheus: add comment on apiserver latency unit 2017-06-06 15:34:10 +02:00
Frederic Branczyk
0c35d73e2c kube-prometheus: drop conntrack alerts and direct up alerts 2017-06-06 15:22:28 +02:00
Frederic Branczyk
30cbd76944 kube-prometheus: add PROXY verb to latency alert exclusion 2017-05-31 06:39:35 -07:00
Frederic Branczyk
804f6c187b kube-prometheus: add dead man's switch 2017-05-30 17:15:59 -07:00
Frederic Branczyk
c4b382be6f kube-prometheus: add alerting rules 2017-05-30 17:15:34 -07:00
Gytis
e810357b8f Rename kube_pod_container_requested_memory_bytes -> kube_pod_container_resource_requests_memory_bytes in grafana dashboard 2017-05-09 12:15:59 +03:00
Frederic Branczyk
ce0a9caae7 kube-prometheus: fix deployment dashboard multiple values error 2017-04-26 16:09:15 +02:00
Frederic Branczyk
d9086e9875 kube-prometheus: remove duplication in grafana dashboards
Datasource links were duplicated in the grafana dashboads. This now also
allows exporting grafana dashboards from the UI and just dropping them
into the assets directory and they will be wrapped by the manifest
generation script.
2017-03-13 12:08:30 +01:00