piglei™
a9e667d24c
kube-prometheus: fix alert rule K8SManyNodesNotReady ( #1313 )
...
* kube-prometheus: fix alert rule K8SManyNodesNotReady
* fix alert "K8SManyNodesNotReady" in helm templates & make generate
* Use sync_kube_prometheus.py to make rules in helm in sync
2018-05-11 10:59:34 +02:00
Frederic Branczyk
0d142fe9da
Merge pull request #1205 from zuzzas/cpu-rules
...
Fixed CPU accounting in recording rules
2018-04-11 15:08:15 +02:00
Arslanbekov Denis
a2d273b11a
In description is displayed correctly namespace ( #1190 )
...
* in description is displayed correctly namespace
* Bump kube state version
* Update Chart.yaml
2018-04-10 17:18:24 +02:00
Andrey Klimentyev
f9b03ddd9d
kube-prometheus: fixed CPU accounting
...
Currently, node recording rules feature
an incorrect idle CPU accounting. This
change aims to fix that.
2018-04-10 15:14:48 +03:00
Max Leonard Inden
b10e343689
kube-prometheus: Fix minor typo
2018-04-10 10:27:54 +02:00
Sébastien GLON
2c10f81102
Add new alert for samples rejected due ti duplicate timestamp ( #1148 )
...
Signed-off-by: Sébastien GLON <sebastien.glon@akeneo.com >
2018-03-26 18:11:40 +02:00
Alexander Holte-Davidsen
4c77a9db1d
Update Alert Manager rules for NodeDiskRunningFull with summary
2018-03-22 11:32:38 +01:00
Alexander Holte-Davidsen
8b6ee5c18b
Add summary to Alertmanager rules where missing - updated accoring to guidelines
2018-03-05 09:52:51 +01:00
Akihito INOH
7fe4506ae4
Update alert rule for kubelet
...
Update alert rule check kubelet down ratio from 1% to 10%.
In #774 , it is changed to 1%, so returns to 10%.
2018-03-01 14:10:27 +09:00
Antoine Legrand
0ae6c98a48
Add alert if it no samples are ingested
2018-02-26 14:57:39 +01:00
Frederic Branczyk
85f88025f3
kube-prometheus: Upgrade to grafana v5
2018-02-09 13:21:37 +01:00
Antoine Legrand
bcb0ba9974
Add cert expiration rules
2018-01-16 17:20:00 +01:00
Frederic Branczyk
a74a54cd06
Merge pull request #893 from brancz/bump-versions
...
kube-prometheus: bump various versions
2018-01-16 10:12:36 +01:00
Frederic Branczyk
aacc95b74c
kube-prometheus: bump various versions
2018-01-15 14:13:26 +01:00
Matthias Loibl
85f384876e
Update kube-state-metrics rules to 1.2 ( #884 )
...
* Update kube-state-metrics rules to 1.2
* Run make generate to update all manifests
* Fix the helm chart kube-state-metrics rules
2018-01-13 11:39:31 +01:00
Frederic Branczyk
d05a3ac486
kube-prometheus: Make grafana dashboards non-editable
2018-01-11 10:56:07 +01:00
Frederic Branczyk
5392443721
kube-prometheus: Add etcd dashboard
2018-01-10 14:46:00 +01:00
Giancarlo Rubio
68517f63b5
Delete chart exporter-kube-api because it has been replaced by kube-controller-manager alerts
2017-12-27 09:36:12 +01:00
Giancarlo Rubio
22eef956af
Add script to keep kube-prometheus rules in sync with helm charts
...
Bump prometheus to 2.0.0, prometheus-operator to 0.15.0, alertmanager to 0.12.0 and node-exporter to 0.15.1, grafana to 4.6.3
migrate prometheus alerts to yaml notation
2017-12-27 09:35:53 +01:00
Frederic Branczyk
a9fedc6343
kube-prometheus: Update etcd3 rules
2017-12-22 16:09:13 +01:00
Bradley
fb01fe91dc
Adding requested and limit values to CPU and limit value to memory
2017-12-07 18:35:58 +00:00
Frederic Branczyk
2454f1054e
Merge pull request #789 from slok/metric-minor-fix
...
Fix cluster:container_cpu_usage:ratio rule
2017-12-01 10:43:36 +01:00
Frederic Branczyk
3afc174fc5
kube-prometheus: Add Prometheus 2.0 rules
2017-11-29 10:59:44 +01:00
Xabier Larrakoetxea
d6a2b717d3
Fix cluster:container_cpu_usage:ratio rule on prometheus kubernetes files
...
Signed-off-by: Xabier Larrakoetxea <slok69@gmail.com >
2017-11-28 14:52:16 +01:00
Frederic Branczyk
a37ad3a270
kube-prometheus: sync rules
2017-11-21 16:43:28 +01:00
Frederic Branczyk
7615244a60
Merge pull request #756 from iJanki/fix_api_latency_rule
...
Fixing #751 K8SApiServerLatency always triggering
2017-11-16 14:12:38 +01:00
Cesarini, Daniele
727d053dd4
Fixing #751 K8SApiServerLatency always triggering
2017-11-14 15:48:14 +00:00
Aleksandar Topuzovic
598d6779cd
Alert on daemonset problems
...
* If any of the rules is active > 10m
* If all daemonsets are not ready
* If all daemonsets are not scheduled
* If some are miss scheduled
2017-11-14 14:36:22 +00:00
Konstantinos Natsakis
d80eaea23a
kube-prometheus: use StatefulSet for dashboard title
2017-11-09 18:33:25 +02:00
Konstantinos Natsakis
85ddb3137c
kube-prometheus: add stateful sets dashboard
2017-11-07 16:44:05 +02:00
Arve Knudsen
d04cccc526
Use grafanalib to generate Grafana dashboards
2017-10-30 22:05:25 +01:00
Frederic Branczyk
1b7c8cdf21
*: bump Prometheus to v2.0.0-rc.1
2017-10-17 20:13:40 +02:00
Frederic Branczyk
6ed84502c8
kube-prometheus: fix multiple series error in grafana dashboard
2017-10-16 14:40:29 +02:00
Frederic Branczyk
40fa4ccd15
grafana-dashboards: various small improvements
2017-09-26 15:59:44 +02:00
Frederic Branczyk
c8cb2df928
kube-prometheus: exclude pod log subresource from latency alerts
2017-09-18 11:11:30 +02:00
Frederic Branczyk
dfd2ee2847
assets: modify and add grafana dashboards
2017-09-07 13:44:12 +02:00
crandl201
e48278f397
update kube-state rules for 1.0.0
2017-08-17 20:05:55 -04:00
Zachary Yonash
7010e32130
Added a few extra node rules ( #478 )
2017-07-27 09:49:25 +02:00
Frederic Branczyk
a5533a4f6c
kube-prometheus: ensure triggering alerts on down targets
2017-06-28 14:11:05 +02:00
Frederic Branczyk
915677eaa2
Revert "alerting rules: replace severity with action"
2017-06-15 10:45:51 +02:00
Frederic Branczyk
a1afce8707
alerting rules: replace severity with action
2017-06-15 09:34:59 +02:00
chenxingyu
98cdf68a0c
fix alert rule bug
2017-06-13 16:40:56 +08:00
Frederic Branczyk
4da7a872ba
kube-prometheus: add comment on apiserver latency unit
2017-06-06 15:34:10 +02:00
Frederic Branczyk
0c35d73e2c
kube-prometheus: drop conntrack alerts and direct up alerts
2017-06-06 15:22:28 +02:00
Frederic Branczyk
30cbd76944
kube-prometheus: add PROXY verb to latency alert exclusion
2017-05-31 06:39:35 -07:00
Frederic Branczyk
804f6c187b
kube-prometheus: add dead man's switch
2017-05-30 17:15:59 -07:00
Frederic Branczyk
c4b382be6f
kube-prometheus: add alerting rules
2017-05-30 17:15:34 -07:00
Gytis
e810357b8f
Rename kube_pod_container_requested_memory_bytes -> kube_pod_container_resource_requests_memory_bytes in grafana dashboard
2017-05-09 12:15:59 +03:00
Frederic Branczyk
ce0a9caae7
kube-prometheus: fix deployment dashboard multiple values error
2017-04-26 16:09:15 +02:00
Frederic Branczyk
d9086e9875
kube-prometheus: remove duplication in grafana dashboards
...
Datasource links were duplicated in the grafana dashboads. This now also
allows exporting grafana dashboards from the UI and just dropping them
into the assets directory and they will be wrapped by the manifest
generation script.
2017-03-13 12:08:30 +01:00