Commit Graph

233 Commits

Author SHA1 Message Date
Lili Cosic 909f51b3bd jsonnet/kube-prometheus: Prevent many-to-many matching
If there is more than one prometheus-operator pod, which happens briefly
when we delete the prometheus-operator pod, we can see the errors of
many-to-many matching, this whitelists the labels matching, and excluded
the pod.
2019-08-16 15:55:17 +02:00
Frederic Branczyk 636f2b6e81 Merge pull request #132 from bgagnon/fix-72
Add mixin to strip cpu/memory limits (issue #72)
2019-08-12 10:29:31 +02:00
Benoit Gagnon da644166a3 use proper operator so requests are not stripped 2019-08-10 14:35:06 -04:00
Benoit Gagnon 16c22b7250 move resources requests/limits to _config and simplify mixin 2019-08-10 14:31:28 -04:00
Guus van Weelden c793073789 set kubeStateMetrics version to 1.7.2
Signed-off-by: Guus van Weelden <guus.vanweelden@moia.io>
2019-08-09 16:09:03 +02:00
Frederic Branczyk e7d1ada775 Merge pull request #141 from dominikhahn/master
Increase ntp clockskewdetected accuracy to 5ms
2019-08-05 10:31:31 +02:00
Lili Cosic 8b570f5fbb jsonnet/kube-state-metrics: Fix storageclass role
The resource is called storageclasses not storageclass.
2019-07-31 16:53:06 +02:00
Lili Cosic 62caa7adc9 jsonnet/kube-prometheus/kube-state-metrics: Bump 1.7.1 2019-07-31 16:52:03 +02:00
Matthias Loibl 6faecd63b7 Bump kube-state-metrics to 1.7.0-rc.1 and add storageclass rules 2019-07-31 16:51:27 +02:00
Matthias Loibl 7d6183a9ec jsonnet/kube-prometheus/kube-state-metrics: Move rules into array 2019-07-31 16:51:27 +02:00
Lili Cosic ccb138374e jsonnet: Include node name in NodeDiskRunningFull
This makes it easier to indetify which node alert is firing for.
Currently only device namespace and pod name were included in
the alert.
2019-07-24 17:52:39 +02:00
Lili Cosic 44dab89344 jsonet: Relabel instance to node name
This allows us to display the node name in alerts, instead of the IP.
2019-07-24 17:34:54 +02:00
Ryan Phillips 176a187117 change to use GA kubelet OS label 2019-07-18 15:19:07 -05:00
Matthias Loibl a535968c33 Add empty podMonitorSelector to Prometheus resource 2019-07-18 16:35:01 +02:00
Frederic Branczyk c8c850ef2b jsonnet: Use upstream prometheus-mixin 2019-07-11 15:35:48 +02:00
Matthias Loibl f1cd88fdbb Remove Thanos components from kube-prometheus except sidecar 2019-07-10 16:39:41 +02:00
Frederic Branczyk daf2e76bc6 Merge pull request #144 from zachaller/patch-1
Update kube-state-metrics.libsonnet
2019-07-10 09:45:31 +02:00
Rajat Vig 0bc30832d0 Bump to newer release for Prometheus and Alertmanager 2019-07-09 23:02:26 -07:00
Zach Aller 48651090a6 Update kube-state-metrics.libsonnet
With the bump to kube-state-metrics v1.6 they added ingress and certificates but this updates the rbac rules so that those work with the new version.
2019-07-09 16:28:17 -05:00
Hahn, Dominik 552b341bf6 Increase ntp clockskewdetected accuracy to 5ms 2019-07-05 11:40:07 +02:00
Benjamin 9a681fad7d Add thanos v0.5.0 support & remove deprecated thanos gossip
Signed-off-by: Benjamin <benjamin@yunify.com>
2019-07-05 14:26:14 +08:00
Lili Cosic 6c34ff2d72 jsonnet: Bump kube-state-metrics version 2019-07-01 15:16:31 +02:00
Matthias Loibl 90b8632fb3 Set kubernetes-mixin on master branch to master 2019-06-25 01:21:24 +02:00
Benoit Gagnon ffc8832ef9 use std.setMember instead of std.count 2019-06-24 11:43:22 -04:00
Benoit Gagnon 8e7d55d795 Add mixin to strip cpu/memory limits (issue #72) 2019-06-24 11:39:47 -04:00
Frederic Branczyk bdf84bf186 jsonnet: Bump prometheus operator to release-0.31 2019-06-20 11:27:02 +02:00
Frederic Branczyk 3f6d0c6dd3 node-exporter: Fix missing semicolon 2019-06-18 17:46:12 +02:00
paulfantom 272ff23cb6 make node_exporter pods tolerate every taint 2019-06-17 21:28:35 +02:00
Israel Sotomayor 6e65c508f8 fix: typo 2019-06-15 23:59:30 +00:00
Rajat Vig ef8bb5aac0 Fix Thanos deployment and service for ksonnet.beta.4 2019-06-13 14:46:36 -07:00
Rajat Vig fa93506941 Update Prometheus, Node Exporter & Grafana (#119)
* Update Prometheus, Node Exporter & Grafana

* Update Prometheus to 2.10.0

* Add generated manifests

* Update lock reference to git sha

* Retrigger Travis
2019-06-13 08:22:01 +02:00
Josh Tilles df8a5b51b4 Merge branch 'release-0.1' 2019-06-10 14:47:21 -04:00
Josh Tilles e85d2f3b64 Depend on operator’s release-0.30 branch
Note that, in effect, this also leads to kube-prometheus using
the latest patch-release of prometheus-operator, 0.30.1.
2019-06-10 14:16:41 -04:00
Matthias Loibl 107028fff3 Remove extra ksonnet4 dependency 2019-05-29 17:58:29 +02:00
Matthias Loibl bedeadb7f5 Add ksonnet4 with ksonnet.beta.4 and update all imports 2019-05-29 17:57:52 +02:00
zzg 6562f95c14 update mem limit of kube-rbac-proxy container in pod of node-exporter to 60M 2019-05-18 04:13:35 +08:00
Matthias Loibl 3ecedc6451 Update Prometheus Operator dependency to v0.30.0 2019-05-15 16:03:03 +02:00
Frederic Branczyk 93a493bda1 Merge pull request #89 from metalmatze/kubernetes-mixin-v0.1.0
Pin kubernetes-mixin v0.1.0
2019-05-07 12:36:27 +02:00
Matthias Loibl 9f4176dcdf Pin release-0.1 branch for kubernetes-mixin 2019-05-06 19:47:50 +02:00
Max Leonard Inden aa6e048e64 jsonnet/kube-prometheus/alertmanager: Update to v0.17.0 2019-05-06 12:01:49 +02:00
Matthias Loibl 701f32914b Use v0.1.0 of kubernetes-mixin explicitly 2019-05-03 18:24:19 +02:00
Alan Hollis 44347261db Update kubespray libsonnet for kubespray v2.9.0
These changes have been tested against kubespray v2.9.0 on a baremetal
cluster created from scratch. They have not been tested against previous
versions of kubespray.

Fixes:

https://github.com/coreos/kube-prometheus/issues/79
2019-05-02 13:27:34 +01:00
Frederic Branczyk abd16f1b99 alerts: Replace ntp with timex metrics
A local ntp server is optional, while timex is in kernel and always
present. Some ntp daemons like chrony can also only be run in a client
mode, which synchronizes the kernel but doesn't expose an ntp server.
Using the timex metrics is a more reliable approach.
2019-04-24 10:29:13 +02:00
Sergiusz Urbaniak 1fbf75767a jsonnet/prometheus-adapter: add metrics to default user-facing roles
Currently, only cluster wide admins have the permissions to view
metrics resources. This fixes it by adding a read-only cluster role
which includes aggregation labels to synthesize permission rules for
standard user-facing roles according to [1].

Note that only the "pods" resource is granted as reading "nodes"
metrics requires a cluster wide permission.

[1] https://kubernetes.io/docs/reference/access-authn-authz/rbac/#user-facing-roles
2019-04-23 13:51:26 +02:00
Max Leonard Inden cb4b913c32 jsonnet: Update Alertmanager to v0.16.2 2019-04-16 14:44:09 +02:00
Frederic Branczyk 9524cbb406 alerts: Remove node interface down alert
This alert is not specific enough on the interfaces it selects which in
turn makes it too noisy.
2019-04-16 11:47:16 +02:00
Frederic Branczyk 30d2ef0f25 kube-prometheus: Add clock skew and node network interface alerts 2019-04-10 17:36:42 +02:00
Frederic Branczyk f8bd9b89f7 kube-prometheus: Don't drop non-pod metrics
These metrics can still be valuable to troubleshoot components running
in cgroups on the host, outside the scope of Kubernetes pods.
2019-04-09 15:28:44 +02:00
Frederic Branczyk 438381c3e3 kube-prometheus: Set Alertmanagere sessionsAffinity to ClientIP 2019-04-08 15:56:39 +02:00
Frederic Branczyk df17b1ca38 kube-prometheus: Fix double accounting of pod usage
cAdvisor exposes metrics for each cgroup hierachy step, and containers
are part of the respective pod's hierarchy, causing double accounting
when not filtered appropriately.
2019-04-02 14:53:18 +02:00