Commit Graph

238 Commits

Author SHA1 Message Date
Prune Sebastien THOMAS
2511a6c529 use real alertmanager name
`alertmanager-main` is hardcoded where it should derive from the real alertmanager name as defined by the user
2019-08-27 11:53:12 -04:00
paulfantom
2396fa9483 jsonnet/kube-prometheus: add node-mixin 2019-08-26 10:56:30 +02:00
paulfantom
bcbf54805a jsonnet/kube-prometheus/alerts: remove alerts provided by node_exporter mixin 2019-08-26 10:56:05 +02:00
Matthias Loibl
da959c6436 Merge pull request #185 from bgagnon/fix-72
Fix wrong config used for node-exporter container limits
2019-08-19 11:11:22 +02:00
Lili Cosic
909f51b3bd jsonnet/kube-prometheus: Prevent many-to-many matching
If there is more than one prometheus-operator pod, which happens briefly
when we delete the prometheus-operator pod, we can see the errors of
many-to-many matching, this whitelists the labels matching, and excluded
the pod.
2019-08-16 15:55:17 +02:00
Benoit Gagnon
04309dcdb6 fix wrong config used for node-exporter container limits 2019-08-12 13:23:10 -04:00
Frederic Branczyk
636f2b6e81 Merge pull request #132 from bgagnon/fix-72
Add mixin to strip cpu/memory limits (issue #72)
2019-08-12 10:29:31 +02:00
Benoit Gagnon
da644166a3 use proper operator so requests are not stripped 2019-08-10 14:35:06 -04:00
Benoit Gagnon
16c22b7250 move resources requests/limits to _config and simplify mixin 2019-08-10 14:31:28 -04:00
Guus van Weelden
c793073789 set kubeStateMetrics version to 1.7.2
Signed-off-by: Guus van Weelden <guus.vanweelden@moia.io>
2019-08-09 16:09:03 +02:00
Frederic Branczyk
e7d1ada775 Merge pull request #141 from dominikhahn/master
Increase ntp clockskewdetected accuracy to 5ms
2019-08-05 10:31:31 +02:00
Lili Cosic
8b570f5fbb jsonnet/kube-state-metrics: Fix storageclass role
The resource is called storageclasses not storageclass.
2019-07-31 16:53:06 +02:00
Lili Cosic
62caa7adc9 jsonnet/kube-prometheus/kube-state-metrics: Bump 1.7.1 2019-07-31 16:52:03 +02:00
Matthias Loibl
6faecd63b7 Bump kube-state-metrics to 1.7.0-rc.1 and add storageclass rules 2019-07-31 16:51:27 +02:00
Matthias Loibl
7d6183a9ec jsonnet/kube-prometheus/kube-state-metrics: Move rules into array 2019-07-31 16:51:27 +02:00
Lili Cosic
ccb138374e jsonnet: Include node name in NodeDiskRunningFull
This makes it easier to indetify which node alert is firing for.
Currently only device namespace and pod name were included in
the alert.
2019-07-24 17:52:39 +02:00
Lili Cosic
44dab89344 jsonet: Relabel instance to node name
This allows us to display the node name in alerts, instead of the IP.
2019-07-24 17:34:54 +02:00
Ryan Phillips
176a187117 change to use GA kubelet OS label 2019-07-18 15:19:07 -05:00
Matthias Loibl
a535968c33 Add empty podMonitorSelector to Prometheus resource 2019-07-18 16:35:01 +02:00
Frederic Branczyk
c8c850ef2b jsonnet: Use upstream prometheus-mixin 2019-07-11 15:35:48 +02:00
Matthias Loibl
f1cd88fdbb Remove Thanos components from kube-prometheus except sidecar 2019-07-10 16:39:41 +02:00
Frederic Branczyk
daf2e76bc6 Merge pull request #144 from zachaller/patch-1
Update kube-state-metrics.libsonnet
2019-07-10 09:45:31 +02:00
Rajat Vig
0bc30832d0 Bump to newer release for Prometheus and Alertmanager 2019-07-09 23:02:26 -07:00
Zach Aller
48651090a6 Update kube-state-metrics.libsonnet
With the bump to kube-state-metrics v1.6 they added ingress and certificates but this updates the rbac rules so that those work with the new version.
2019-07-09 16:28:17 -05:00
Hahn, Dominik
552b341bf6 Increase ntp clockskewdetected accuracy to 5ms 2019-07-05 11:40:07 +02:00
Benjamin
9a681fad7d Add thanos v0.5.0 support & remove deprecated thanos gossip
Signed-off-by: Benjamin <benjamin@yunify.com>
2019-07-05 14:26:14 +08:00
Lili Cosic
6c34ff2d72 jsonnet: Bump kube-state-metrics version 2019-07-01 15:16:31 +02:00
Matthias Loibl
90b8632fb3 Set kubernetes-mixin on master branch to master 2019-06-25 01:21:24 +02:00
Benoit Gagnon
ffc8832ef9 use std.setMember instead of std.count 2019-06-24 11:43:22 -04:00
Benoit Gagnon
8e7d55d795 Add mixin to strip cpu/memory limits (issue #72) 2019-06-24 11:39:47 -04:00
Frederic Branczyk
bdf84bf186 jsonnet: Bump prometheus operator to release-0.31 2019-06-20 11:27:02 +02:00
Frederic Branczyk
3f6d0c6dd3 node-exporter: Fix missing semicolon 2019-06-18 17:46:12 +02:00
paulfantom
272ff23cb6 make node_exporter pods tolerate every taint 2019-06-17 21:28:35 +02:00
Israel Sotomayor
6e65c508f8 fix: typo 2019-06-15 23:59:30 +00:00
Rajat Vig
ef8bb5aac0 Fix Thanos deployment and service for ksonnet.beta.4 2019-06-13 14:46:36 -07:00
Rajat Vig
fa93506941 Update Prometheus, Node Exporter & Grafana (#119)
* Update Prometheus, Node Exporter & Grafana

* Update Prometheus to 2.10.0

* Add generated manifests

* Update lock reference to git sha

* Retrigger Travis
2019-06-13 08:22:01 +02:00
Josh Tilles
df8a5b51b4 Merge branch 'release-0.1' 2019-06-10 14:47:21 -04:00
Josh Tilles
e85d2f3b64 Depend on operator’s release-0.30 branch
Note that, in effect, this also leads to kube-prometheus using
the latest patch-release of prometheus-operator, 0.30.1.
2019-06-10 14:16:41 -04:00
Matthias Loibl
107028fff3 Remove extra ksonnet4 dependency 2019-05-29 17:58:29 +02:00
Matthias Loibl
bedeadb7f5 Add ksonnet4 with ksonnet.beta.4 and update all imports 2019-05-29 17:57:52 +02:00
zzg
6562f95c14 update mem limit of kube-rbac-proxy container in pod of node-exporter to 60M 2019-05-18 04:13:35 +08:00
Matthias Loibl
3ecedc6451 Update Prometheus Operator dependency to v0.30.0 2019-05-15 16:03:03 +02:00
Frederic Branczyk
93a493bda1 Merge pull request #89 from metalmatze/kubernetes-mixin-v0.1.0
Pin kubernetes-mixin v0.1.0
2019-05-07 12:36:27 +02:00
Matthias Loibl
9f4176dcdf Pin release-0.1 branch for kubernetes-mixin 2019-05-06 19:47:50 +02:00
Max Leonard Inden
aa6e048e64 jsonnet/kube-prometheus/alertmanager: Update to v0.17.0 2019-05-06 12:01:49 +02:00
Matthias Loibl
701f32914b Use v0.1.0 of kubernetes-mixin explicitly 2019-05-03 18:24:19 +02:00
Alan Hollis
44347261db Update kubespray libsonnet for kubespray v2.9.0
These changes have been tested against kubespray v2.9.0 on a baremetal
cluster created from scratch. They have not been tested against previous
versions of kubespray.

Fixes:

https://github.com/coreos/kube-prometheus/issues/79
2019-05-02 13:27:34 +01:00
Frederic Branczyk
abd16f1b99 alerts: Replace ntp with timex metrics
A local ntp server is optional, while timex is in kernel and always
present. Some ntp daemons like chrony can also only be run in a client
mode, which synchronizes the kernel but doesn't expose an ntp server.
Using the timex metrics is a more reliable approach.
2019-04-24 10:29:13 +02:00
Sergiusz Urbaniak
1fbf75767a jsonnet/prometheus-adapter: add metrics to default user-facing roles
Currently, only cluster wide admins have the permissions to view
metrics resources. This fixes it by adding a read-only cluster role
which includes aggregation labels to synthesize permission rules for
standard user-facing roles according to [1].

Note that only the "pods" resource is granted as reading "nodes"
metrics requires a cluster wide permission.

[1] https://kubernetes.io/docs/reference/access-authn-authz/rbac/#user-facing-roles
2019-04-23 13:51:26 +02:00
Max Leonard Inden
cb4b913c32 jsonnet: Update Alertmanager to v0.16.2 2019-04-16 14:44:09 +02:00