Commit Graph

704 Commits

Author SHA1 Message Date
Haoyu Sun 1e632d2ec8 update go-jsonnet for linter bugfix 2021-10-21 11:59:44 +02:00
dgrisonnet fbf9ed0f98 [bot] [main] Automated version update 2021-10-18 07:39:30 +00:00
Markus Heinemann f1cff459de fix: upgrade deprecated policy/v1beta1 kubernetes api 2021-10-12 09:00:09 +00:00
dgrisonnet be3bd2c97b [bot] [main] Automated version update 2021-10-11 07:39:24 +00:00
Haoyu Sun b6c97fc6c0 remove "app" label selector deprecated by prometheus-operator 2021-10-05 19:59:39 +02:00
dgrisonnet fe374485a1 [bot] [main] Automated version update 2021-10-04 07:39:27 +00:00
Damien Grisonnet 374413f10a Merge pull request #1409 from dgrisonnet/drop-pa-metrics
Drop some of the metrics exposed by prometheus-adapter
2021-09-30 17:45:15 +02:00
Damien Grisonnet 5ebbb65276 jsonnet: drop some of prometheus-adapter metrics
The current implementation of prometheus-adapter exposes a lot of
metrics about the health of its aggregated apiserver. The issue is that
the some of these metrics are not very useful in the context of
prometheus-adapter, and we currently can't avoid exposing them since
they are registered to the Kubernetes global Prometheus registry. Until
this is improved in upstream Kubernetes, we could benefit from dropping
some of the metrics that are not very useful.

Before this change, in a default kube-prometheus installation, we would
have 800+ series for prometheus-adapter against 400+, so we divided the
number of series by two will focusing on the most valuable metrics for
prometheus-adapter.

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-09-29 13:02:00 +02:00
Jan Fajerski 6fa097c0ed jsonnet/node-exporter: adjust to node-exporter v1.2.0 arg name change
In version [node-exporter v1.2.0](https://github.com/prometheus/node_exporter/releases/tag/v1.2.0)
two argument name changes were introduced. While the old names still
work (with a deprecation warning), lets use the new names.

Signed-off-by: Jan Fajerski <jfajersk@redhat.com>
2021-09-29 11:15:10 +02:00
Damien Grisonnet a2eee1803a Merge pull request #1404 from prometheus-operator/automated-updates-main
[bot] [main] Automated version update
2021-09-28 11:56:30 +02:00
simonpasquier 9a3d5d42e1 [bot] [main] Automated version update 2021-09-28 08:41:28 +00:00
Philip Gough 58e2c131c2 Keep 'container_fs_.*' metrics from cAdvisor 2021-09-27 17:13:00 +01:00
dgrisonnet 02776a1d37 [bot] [main] Automated version update 2021-09-27 09:53:31 +00:00
Philip Gough 56f96e6389 Adjust dropped metrics from cAdvisor
This change drops pod-centric metrics without a non-empty 'container' label.

Previously we dropped pod-centric metrics without a (pod, namespace) label set
however these can be critical for debugging.
2021-09-24 17:24:01 +01:00
Damien Grisonnet 7f1092cdde Merge pull request #1344 from PhilipGough/MON-1085
jsonnet: Support scraping the config-reloader for AlertManager and Pr…
2021-09-22 16:16:48 +02:00
Philip Gough 7b32afb8aa jsonnet: Support scraping the config-reloader for AlertManager and Prometheus 2021-09-22 14:54:12 +01:00
dgrisonnet a232cca3b6 [bot] [main] Automated version update 2021-09-20 07:39:09 +00:00
Sylvain Pasche 6d5c1b793c Always generate grafana-config secret
Since https://github.com/brancz/kubernetes-grafana/pull/115, upstream
grafana contains a non-empty config. Generate the grafana-config secret
unconditionally even if no user config is passed.
2021-09-16 14:25:53 +02:00
dgrisonnet 6654c13142 [bot] [main] Automated version update 2021-09-13 07:39:05 +00:00
dgrisonnet b663558924 [bot] [main] Automated version update 2021-09-07 07:22:35 +00:00
Damien Grisonnet 6f744e24a5 Merge pull request #1357 from arajkumar/adjust-NodeFilesystemSpaceFillingUp-warning-threshold
Adjust node filesystem space filling up warning threshold to 20%
2021-09-06 19:04:29 +02:00
Arunprasad Rajkumar 16ef18ec54 assets: regenerate
Signed-off-by: Arunprasad Rajkumar <arajkuma@redhat.com>
2021-09-01 13:33:41 +05:30
dgrisonnet a1c6a4e21d [bot] [main] Automated version update 2021-08-30 07:39:09 +00:00
simonpasquier eb52023db2 [bot] [main] Automated version update 2021-08-25 09:37:24 +00:00
Damien Grisonnet 46ba787a94 manifests: regenerate
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-08-20 13:52:12 +02:00
Damien Grisonnet b5ec93208b jsonnet: drop deprecated etcd metric
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-08-18 17:27:50 +02:00
Damien Grisonnet 45adc03cfb jsonnet: update prometheus-adapter to v0.9.0
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-08-17 18:05:45 +02:00
dgrisonnet 6ade9e5c7d [bot] [main] Automated version update 2021-08-17 08:05:49 +00:00
paulfantom ad3fc8920e [bot] [main] Automated version update 2021-08-16 08:04:51 +00:00
Dimitrije Manic 12cd7fd9ce Prometheus ruleSelector defaults to all rules 2021-08-11 10:16:24 -04:00
dgrisonnet e97eb0fbe9 [bot] [main] Automated version update 2021-08-02 13:37:08 +00:00
Paweł Krupa b9c73c7b29 Merge pull request #1283 from prashbnair/node-veth
changing node exporter ignore list
2021-07-28 09:17:03 +02:00
Prashant Balachandran 09fdac739d changing node exporter ignore list 2021-07-27 17:17:19 +05:30
lanmarti ed48391831 Add resource requests and limits to prometheus-adapter container 2021-07-27 12:19:51 +02:00
paulfantom 05c72f83ef [bot] Automated version update 2021-07-26 13:44:14 +00:00
Manuel Rüger acd1eeba4c node.libsonnet: Fix small typo
Signed-off-by: Manuel Rüger <manuel@rueg.eu>
2021-07-22 19:14:24 +02:00
paulfantom 755d2fe5c1 manifests: regenerate 2021-07-22 17:31:30 +02:00
Paweł Krupa acea5efd85 Merge pull request #1268 from paulfantom/alerts-best-practices
Alerts best practices
2021-07-21 09:32:32 +02:00
Philip Gough 463ad065d3 jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while preserving ability to monitor system services resource usage
The following provides a description and cardinality estimation based on the tests in a local cluster:

container_blkio_device_usage_total - useful for containers, but not for system services (nodes*disks*services*operations*2)
container_fs_.*                    - add filesystem read/write data (nodes*disks*services*4)
container_file_descriptors         - file descriptors limits and global numbers are exposed via (nodes*services)
container_threads_max              - max number of threads in cgroup. Usually for system services it is not limited (nodes*services)
container_threads                  - used threads in cgroup. Usually not important for system services (nodes*services)
container_sockets                  - used sockets in cgroup. Usually not important for system services (nodes*services)
container_start_time_seconds       - container start. Possibly not needed for system services (nodes*services)
container_last_seen                - Not needed as system services are always running (nodes*services)
container_spec_.*                  - Everything related to cgroup specification and thus static data (nodes*services*5)
2021-07-20 12:50:02 +01:00
paulfantom 02454b3f53 manifests: regenerate 2021-07-20 11:14:28 +02:00
paulfantom 1a3c610c61 [bot] Automated version update 2021-07-19 13:44:23 +00:00
Yury Gargay 9b08b941f8 Update kubernetes-mixin
From https://github.com/kubernetes-monitoring/kubernetes-mixin/commit/b710a868a95621aa93e0b661954f63f4db82aaea
2021-07-14 18:51:36 +02:00
Damien Grisonnet 97e77e9996 Merge pull request #1231 from dgrisonnet/fix-adapter-queries
Consolidate intervals used in prometheus-adapter CPU queries
2021-07-07 13:48:02 +02:00
Philip 3e6865d776 Generate kubernetes-mixin 2021-07-06 17:49:32 +02:00
Damien Grisonnet b9563b9c2d jsonnet: improve adapter queries readability
Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-07-05 15:29:45 +02:00
Damien Grisonnet 8812e45501 jsonnet: readjust prometheus-adapter intervals
Previously, prometheus-adapter configuration wasn't taking into account
the scrape interval of kubelet, node-exporter and windows-exporter
leading to getting non fresh results, and even negative results from the
CPU queries when the irate() function was extrapolating data.
To fix that, we want to set the interval used in the irate() function in
the CPU queries to 4x scrape interval in order to extrapolate data
between the last two scrapes. This will improve the freshness of the cpu
usage exposed and prevent incorrect extrapolations.

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>
2021-07-05 15:28:25 +02:00
Sunil Thaha 0280f4ddf9 jsonnet: kube-prometheus adapt to changes to veth interfaces names
With OVN, the container veth network interface names that used to start
with `veth` has now changed to `<rand-hex>{15}@if<number>`(see Related
Links below).

This patch adapts to the new change introduced in ovn and ignores the network
interfaces that match `[a-z0-9]{15}@if\d+` in addition to those starting
with `veth`

Related Links:
  - https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/vendor/github.com/containernetworking/plugins/pkg/ip/link_linux.go#L107
  - https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/pkg/cni/helper_linux.go#L148

Signed-off-by: Sunil Thaha <sthaha@redhat.com>
2021-07-01 12:01:19 +10:00
Damien Grisonnet 2c5c20cfff Merge pull request #1216 from fpetkovski/prometheus-adapter-cipher-suites
jsonnet: disable insecure cypher suites for prometheus-adapter
2021-06-23 21:19:24 +02:00
paulfantom d0e21f34e5 [bot] Automated version update 2021-06-23 13:41:46 +00:00
fpetkovski 0959155a1c jsonnet: update downstream dependencies
This commit updates all downstream dependencies

Signed-off-by: fpetkovski <filip.petkovsky@gmail.com>
2021-06-22 16:27:29 +02:00