Antoine Legrand
0ae6c98a48
Add alert if it no samples are ingested
2018-02-26 14:57:39 +01:00
Frederic Branczyk
85f88025f3
kube-prometheus: Upgrade to grafana v5
2018-02-09 13:21:37 +01:00
Antoine Legrand
bcb0ba9974
Add cert expiration rules
2018-01-16 17:20:00 +01:00
Frederic Branczyk
a74a54cd06
Merge pull request #893 from brancz/bump-versions
...
kube-prometheus: bump various versions
2018-01-16 10:12:36 +01:00
Frederic Branczyk
aacc95b74c
kube-prometheus: bump various versions
2018-01-15 14:13:26 +01:00
Matthias Loibl
85f384876e
Update kube-state-metrics rules to 1.2 ( #884 )
...
* Update kube-state-metrics rules to 1.2
* Run make generate to update all manifests
* Fix the helm chart kube-state-metrics rules
2018-01-13 11:39:31 +01:00
Frederic Branczyk
d05a3ac486
kube-prometheus: Make grafana dashboards non-editable
2018-01-11 10:56:07 +01:00
Frederic Branczyk
5392443721
kube-prometheus: Add etcd dashboard
2018-01-10 14:46:00 +01:00
Giancarlo Rubio
68517f63b5
Delete chart exporter-kube-api because it has been replaced by kube-controller-manager alerts
2017-12-27 09:36:12 +01:00
Giancarlo Rubio
22eef956af
Add script to keep kube-prometheus rules in sync with helm charts
...
Bump prometheus to 2.0.0, prometheus-operator to 0.15.0, alertmanager to 0.12.0 and node-exporter to 0.15.1, grafana to 4.6.3
migrate prometheus alerts to yaml notation
2017-12-27 09:35:53 +01:00
Frederic Branczyk
a9fedc6343
kube-prometheus: Update etcd3 rules
2017-12-22 16:09:13 +01:00
Bradley
fb01fe91dc
Adding requested and limit values to CPU and limit value to memory
2017-12-07 18:35:58 +00:00
Frederic Branczyk
2454f1054e
Merge pull request #789 from slok/metric-minor-fix
...
Fix cluster:container_cpu_usage:ratio rule
2017-12-01 10:43:36 +01:00
Frederic Branczyk
3afc174fc5
kube-prometheus: Add Prometheus 2.0 rules
2017-11-29 10:59:44 +01:00
Xabier Larrakoetxea
d6a2b717d3
Fix cluster:container_cpu_usage:ratio rule on prometheus kubernetes files
...
Signed-off-by: Xabier Larrakoetxea <slok69@gmail.com >
2017-11-28 14:52:16 +01:00
Frederic Branczyk
a37ad3a270
kube-prometheus: sync rules
2017-11-21 16:43:28 +01:00
Frederic Branczyk
7615244a60
Merge pull request #756 from iJanki/fix_api_latency_rule
...
Fixing #751 K8SApiServerLatency always triggering
2017-11-16 14:12:38 +01:00
Cesarini, Daniele
727d053dd4
Fixing #751 K8SApiServerLatency always triggering
2017-11-14 15:48:14 +00:00
Aleksandar Topuzovic
598d6779cd
Alert on daemonset problems
...
* If any of the rules is active > 10m
* If all daemonsets are not ready
* If all daemonsets are not scheduled
* If some are miss scheduled
2017-11-14 14:36:22 +00:00
Konstantinos Natsakis
d80eaea23a
kube-prometheus: use StatefulSet for dashboard title
2017-11-09 18:33:25 +02:00
Konstantinos Natsakis
85ddb3137c
kube-prometheus: add stateful sets dashboard
2017-11-07 16:44:05 +02:00
Arve Knudsen
d04cccc526
Use grafanalib to generate Grafana dashboards
2017-10-30 22:05:25 +01:00
Frederic Branczyk
1b7c8cdf21
*: bump Prometheus to v2.0.0-rc.1
2017-10-17 20:13:40 +02:00
Frederic Branczyk
6ed84502c8
kube-prometheus: fix multiple series error in grafana dashboard
2017-10-16 14:40:29 +02:00
Frederic Branczyk
40fa4ccd15
grafana-dashboards: various small improvements
2017-09-26 15:59:44 +02:00
Frederic Branczyk
c8cb2df928
kube-prometheus: exclude pod log subresource from latency alerts
2017-09-18 11:11:30 +02:00
Frederic Branczyk
dfd2ee2847
assets: modify and add grafana dashboards
2017-09-07 13:44:12 +02:00
crandl201
e48278f397
update kube-state rules for 1.0.0
2017-08-17 20:05:55 -04:00
Zachary Yonash
7010e32130
Added a few extra node rules ( #478 )
2017-07-27 09:49:25 +02:00
Frederic Branczyk
a5533a4f6c
kube-prometheus: ensure triggering alerts on down targets
2017-06-28 14:11:05 +02:00
Frederic Branczyk
915677eaa2
Revert "alerting rules: replace severity with action"
2017-06-15 10:45:51 +02:00
Frederic Branczyk
a1afce8707
alerting rules: replace severity with action
2017-06-15 09:34:59 +02:00
chenxingyu
98cdf68a0c
fix alert rule bug
2017-06-13 16:40:56 +08:00
Frederic Branczyk
4da7a872ba
kube-prometheus: add comment on apiserver latency unit
2017-06-06 15:34:10 +02:00
Frederic Branczyk
0c35d73e2c
kube-prometheus: drop conntrack alerts and direct up alerts
2017-06-06 15:22:28 +02:00
Frederic Branczyk
30cbd76944
kube-prometheus: add PROXY verb to latency alert exclusion
2017-05-31 06:39:35 -07:00
Frederic Branczyk
804f6c187b
kube-prometheus: add dead man's switch
2017-05-30 17:15:59 -07:00
Frederic Branczyk
c4b382be6f
kube-prometheus: add alerting rules
2017-05-30 17:15:34 -07:00
Gytis
e810357b8f
Rename kube_pod_container_requested_memory_bytes -> kube_pod_container_resource_requests_memory_bytes in grafana dashboard
2017-05-09 12:15:59 +03:00
Frederic Branczyk
ce0a9caae7
kube-prometheus: fix deployment dashboard multiple values error
2017-04-26 16:09:15 +02:00
Frederic Branczyk
d9086e9875
kube-prometheus: remove duplication in grafana dashboards
...
Datasource links were duplicated in the grafana dashboads. This now also
allows exporting grafana dashboards from the UI and just dropping them
into the assets directory and they will be wrapped by the manifest
generation script.
2017-03-13 12:08:30 +01:00
Frederic Branczyk
9ed63f191f
kube-prometheus: generate manifests without kubectl
...
For `--dry-run` to work with kubectl a Kubernetes cluster's apiserver is
actually used, which is unnecessary for generating these manifests. This
approach also allows further customization, such as adding labels to the
generated manifests.
2017-03-13 11:17:23 +01:00
Mike Bryant
51778eb36e
kube-prometheus: add resource requests dashboard
...
This presents the resource requests vs the allocatable capacity in the cluster.
2017-03-10 20:04:16 +00:00
Frederic Branczyk
83f19b0dd1
Merge pull request #205 from ocadotechnology/fix-kube-state-metrics-redeployment
...
Allow for multiple kube-state-metrics series
2017-03-10 20:00:11 +01:00
Mike Bryant
b85b5b6bcf
Account for multiple copies of kube-state-metrics
...
This can happen if you run multiple replicas, or if you redeploy kube-state-metrics.
In either case, prometheus records multiple metrics with the instance ip in, and the dashboard fails.
Use aggregation functions to get sensible output in either case
2017-03-09 21:23:55 +00:00
Frederic Branczyk
e69a6f69ec
alertmanager: use a secret for the config
2017-03-09 10:04:47 +01:00
Frederic Branczyk
89ed6773e7
Add 'contrib/kube-prometheus/' from commit '81c0d2f4d30f63a4e274c2870c5afc89241827b0'
...
git-subtree-dir: contrib/kube-prometheus
git-subtree-mainline: 050ca21276696c8603375c699513ec487301ed62
git-subtree-split: 81c0d2f4d3
2017-03-06 09:55:36 +01:00