Philip Gough
bbdb94a23e
Merge pull request #1675 from sikachu/fix-eks-platform
...
Fix problems when enabling eks platform patch
2022-04-14 12:49:30 +01:00
Arunprasad Rajkumar
6ff8bfbb02
Adjust NodeFilesystemSpaceFillingUp thresholds according default kubelet GC behavior
...
Previously[1] we attempted to do the same, but there was a
misunderstanding about the GC behavior and it caused the alert to be
fired even before GC comes into play.
According to[2][3] kubelet GC kicks in only when `imageGCHighThresholdPercent` is hit which is set to 85% by default. However `NodeFilesystemSpaceFillingUp` is set to fire as soon as 80% usage is hit.
This commit changes the `fsSpaceFillingUpWarningThreshold` to 15% so
that we give ample time to GC to reclaim unwanted images. This commit
also changes `fsSpaceFillingUpCriticalThreshold` to 10% which gives more time to admins to react to warning before sending critical alert.
[1] https://github.com/prometheus-operator/kube-prometheus/pull/1357
[2] https://docs.openshift.com/container-platform/4.10/nodes/nodes/nodes-nodes-garbage-collection.html#nodes-nodes-garbage-collection-images_nodes-nodes-configuring
[3] https://kubernetes.io/docs/reference/config-api/kubelet-config.v1beta1/
Signed-off-by: Arunprasad Rajkumar <arajkuma@redhat.com >
2022-04-13 12:01:06 +05:30
ArthurSens
df3de0ed7c
Fix networkpolicies-disabled addon
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-04-12 07:28:14 +00:00
Paweł Krupa
6053258fc3
Merge pull request #1721 from ArthurSens/graf-prom-network-fix
2022-04-11 16:02:25 +02:00
Prometheus Operator Bot
14f0a03028
[bot] [main] Automated version update
2022-04-11 07:43:32 +00:00
ArthurSens
8bdd526039
jsonnet/components/prometheus: Fix grafana network access
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-04-11 07:23:09 +00:00
ArthurSens
05d996495e
Add pyrra to 'update versions' automation
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-04-07 18:50:48 +00:00
ArthurSens
3da9bcd152
jsonnet/components/grafana: Address FIXME
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-04-05 09:28:43 +00:00
Arthur Silva Sens
01004de76c
Merge pull request #1650 from ArthurSens/as/network-policies
...
Adds NetworkPolicies to all components of Kube-prometheus
2022-04-05 09:47:05 +01:00
Prometheus Operator Bot
a16675bc0e
[bot] [main] Automated version update
2022-04-04 07:42:34 +00:00
Philip Gough
eb0fafd789
Merge pull request #1667 from metalmatze/pyrra
...
Add Pyrra as (optional) component
2022-03-31 14:05:57 +01:00
Matthias Loibl
2dad999332
Add Pyrra as addon and commented out example.jsonnet parts
2022-03-30 21:20:13 +02:00
Joao Marcal
1d46f7ece9
Adds port name to prometheus-adapter jsonnet
2022-03-30 15:34:40 +01:00
Philip Gough
6f7a799b78
Merge pull request #1696 from JoaoBraveCoding/2048333
...
Adds readinessProbe and livenessProbe to prometheus-adapter jsonnet
2022-03-30 09:18:11 +01:00
Joao Marcal
f6190e200a
Adds readinessProbe and livenessProbe to prometheus-adapter jsonnet
...
Problem: Currently the prometheus-adapter pods are restarted at the same
time even though the deployment is configured with strategy RollingUpdate.
This happens because the kubelet does not know when the prometheus-adapter
pods are ready to start receiving requests.
Solution: Add both readinessProbe and livenessProbe to the
prometheus-adapter, this way the kubelet will know when either the pod
stoped working and should be restarted or simply when it ready to start
receiving requests.
Issue: https://bugzilla.redhat.com/show_bug.cgi?id=2048333
2022-03-30 07:22:55 +01:00
Prometheus Operator Bot
c30d8c1ef6
[bot] [main] Automated version update
2022-03-28 07:41:44 +00:00
Prometheus Operator Bot
c8361720f9
[bot] [main] Automated version update
2022-03-21 07:41:58 +00:00
Prometheus Operator Bot
1f74cb4148
[bot] [main] Automated version update
2022-03-14 07:41:15 +00:00
Prem Sichanugrist
257031ec49
Fix problems when enabling eks platform patch
2022-03-10 13:44:08 +09:00
paulfantom
3ad08674b3
manifests: regenerate
...
Signed-off-by: paulfantom <pawel@krupa.net.pl >
Signed-off-by: Paweł Krupa (paulfantom) <pawel@krupa.net.pl >
(cherry picked from commit d3ea3147a8 )
(cherry picked from commit d24c347b2742d9474c8f441f2831262c63b8c79b)
2022-03-09 07:48:01 +00:00
Arthur Silva Sens
3f3b56e247
alertmanager/networkPolicy: Allow cluster peer-to-peer communication
...
Signed-off-by: GitHub <noreply@github.com >
(cherry picked from commit df68b8d1da5d2d91b9502d4be67063c2c497e0cb)
2022-03-09 07:47:28 +00:00
Arthur Silva Sens
ea158da23f
Add networkPolicies for alertmanager, grafana, prometheus-operator and prometheus
...
Signed-off-by: GitHub <noreply@github.com >
(cherry picked from commit 86e16b539cc57710b50f4692848cab5645e3d2bc)
2022-03-09 07:47:25 +00:00
Paweł Krupa (paulfantom)
030dec7656
*: add example for disabling NetworkPolicies
...
(cherry picked from commit b4bf38ba6c )
(cherry picked from commit c21bf4fbfa478fd163c091054d9fcc98836d7045)
2022-03-09 07:46:44 +00:00
paulfantom
fddf642de7
jsonnet: add networkpolicies for components accessed by prometheus
...
(cherry picked from commit f8c00b9963 )
(cherry picked from commit f09b8e5de2e46db85f090549d37eeb878a81842f)
2022-03-09 07:42:09 +00:00
Prometheus Operator Bot
4a9f97b41f
[bot] [main] Automated version update
2022-03-07 07:41:44 +00:00
Prometheus Operator Bot
61c007ef56
[bot] [main] Automated version update
2022-02-28 07:40:53 +00:00
Prometheus Operator Bot
36ace1ce7a
[bot] [main] Automated version update
2022-02-21 07:40:44 +00:00
Philip Gough
53542d5cce
Merge pull request #1637 from ahysing/custom-requests-kube-rbacc-proxy-self
...
Added custom overrides for kube-rbac-proxy-self.
2022-02-18 16:32:30 +00:00
Philip Gough
2918583c81
node_exporter: Add priorityClassName as system-cluster-critical
2022-02-17 12:11:31 +00:00
Paweł Krupa
5ff20b1718
Merge pull request #1630 from PhilipGough/fix-1603
2022-02-16 21:46:08 +01:00
Wulf Thimm
30bfd69c3e
removed CAP_ from node-exporter daemonset
2022-02-16 14:41:20 +00:00
ArthurSens
78ca6d9579
Address FIXME
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-02-15 15:20:34 +00:00
Arthur Silva Sens
d270540308
Merge pull request #1628 from ravilr/fix_cluster_cpu_query
...
fix cluster:node_cpu:ratio query
2022-02-15 06:28:22 +00:00
Wulf Thimm
a5c5adba68
added allowedCapabilities to node-exporter psp
2022-02-14 14:38:13 +00:00
Philip Gough
7c715c8126
Merge pull request #1624 from ArthurSens/grafana-volume
...
Add extra-volume mount for plugins downloads
2022-02-14 11:15:14 +00:00
Prometheus Operator Bot
ac7f6b42ab
[bot] [main] Automated version update
2022-02-14 07:41:12 +00:00
Andreas Dreyer Hysing
154019ad82
Added custom overrides for kube-rbac-proxy-self.
2022-02-11 15:29:58 +01:00
Prometheus Operator Bot
2de3d810b1
[bot] [main] Automated version update
2022-02-10 15:40:58 +00:00
Philip Gough
58fa27a40e
Update PodMonitor for kube-proxy
2022-02-10 09:45:27 +00:00
ravilr
1d1ca283b1
fix cluster:node_cpu:ratio query
2022-02-09 20:21:05 -08:00
Arthur Silva Sens
43879b5217
Add extra-volume mount for plugins downloads
...
Signed-off-by: GitHub <noreply@github.com >
2022-02-07 20:31:58 +00:00
Prometheus Operator Bot
7b602e1372
[bot] [main] Automated version update
2022-02-07 07:40:53 +00:00
Arthur Silva Sens
db61b3e18e
Merge pull request #1591 from paulfantom/automountServiceAccountToken
...
disable injecting unnecessary variables allowing access to k8s API
2022-02-05 18:28:33 +00:00
Paweł Krupa (paulfantom)
3429bc77a4
disable injecting unnecessary variables allowing access to k8s API
2022-02-04 14:08:52 +01:00
Paweł Krupa
3436e1a92e
Merge pull request #1612 from ArthurSens/remove-hostport
2022-02-04 12:58:13 +01:00
ArthurSens
fb92a6dbe0
Document better why we use hostPort on node-exporter
...
Signed-off-by: ArthurSens <arthursens2005@gmail.com >
2022-02-03 15:47:43 +00:00
Arunprasad Rajkumar
0eabbb5d0c
Sanitize all regex denylist in ksm-lite addon
...
This is a follow up fix of https://github.com/prometheus-operator/kube-prometheus/pull/1613 . @simonpasquier recommended to sanitize all denylist metrics.
Signed-off-by: Arunprasad Rajkumar <arajkuma@redhat.com >
2022-02-03 19:52:44 +05:30
Arunprasad Rajkumar
f16277e8e0
Sanitize regex denylist in ksm-lite addon
...
The following metrics are missing from kube-state-metrics:
- kube_pod_container_status_terminated_reason
- kube_pod_init_container_status_terminated_reason
- kube_pod_status_scheduled_time
Previously, some metrics were removed from kube-state-metrics by adding the following --metric-denylist argument to the kube-state-metrics container
```
--metric-denylist=
kube_.+_created,
kube_.+_metadata_resource_version,
kube_replicaset_metadata_generation,
kube_replicaset_status_observed_generation,
kube_pod_restart_policy,
kube_pod_init_container_status_terminated,
kube_pod_init_container_status_running,
kube_pod_container_status_terminated,
kube_pod_container_status_running,
kube_pod_completion_time,
kube_pod_status_scheduled
```
--metric-denylist: Comma-separated list of metrics not to be enabled. This list comprises of exact metric names and/or regex patterns. The allowlist and denylist are mutually exclusive.
However, all the list of metrics is managed as RegEx, thus "kube_pod_container_status_terminated" denies .*kube_pod_container_status_terminated.*, that's why kube_pod_init_container_status_terminated_reason is missing
Co-authored-by: Florian Gleizes <fgleizes@redhat.com >
Signed-off-by: Arunprasad Rajkumar <arajkuma@redhat.com >
2022-02-03 17:50:22 +05:30
Arthur Silva Sens
755d27bb46
Merge pull request #1610 from ArthurSens/as/linux-hardening
...
Drop Linux capabilities
2022-02-02 12:56:21 +00:00
Paweł Krupa (paulfantom)
86ac6f79b1
jsonnet: filter out kube-proxy alerts when kube-proxy is disabled
...
Signed-off-by: Paweł Krupa (paulfantom) <pawel@krupa.net.pl >
2022-02-01 16:22:48 +01:00