kube-prometheus

Author	SHA1	Message	Date
Philip Gough	d714141304	This change drops pod-centric metrics without a non-empty 'container' label. Previously we dropped pod-centric metrics without a (pod, namespace) label set however these can be critical for debugging. Keep 'container_fs_.*' metrics from cAdvisor	2021-09-28 10:53:20 +01:00
Philip Gough	47e55a460e	jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while preserving ability to monitor system services resource usage The following provides a description and cardinality estimation based on the tests in a local cluster: container_blkio_device_usage_total - useful for containers, but not for system services (nodesdisksservicesoperations2) container_fs_.* - add filesystem read/write data (nodesdisksservices4) container_file_descriptors - file descriptors limits and global numbers are exposed via (nodesservices) container_threads_max - max number of threads in cgroup. Usually for system services it is not limited (nodesservices) container_threads - used threads in cgroup. Usually not important for system services (nodesservices) container_sockets - used sockets in cgroup. Usually not important for system services (nodesservices) container_start_time_seconds - container start. Possibly not needed for system services (nodesservices) container_last_seen - Not needed as system services are always running (nodesservices) container_spec_. - Everything related to cgroup specification and thus static data (nodesservices5)	2021-08-30 12:45:20 +01:00
Simon Rüegg	a09aff9709	[release-0.6] Pin Jsonnet dependencies Pin all Jsonnet dependencies to current commit SHA. Signed-off-by: Simon Rüegg <simon@rueggs.ch>	2021-02-24 15:01:02 +01:00
Derek Wilson	d9465ce7a3	pin etcd-mixin to last working version for release etcd refactored their repo moving and renaming etcd-mixin. the jsonnetfile depended on "master" even though the lock was for an older version. checking out from the last commit before the move works.	2021-02-11 18:08:25 +00:00
Sergiusz Urbaniak	05b7a932ab	jsonnet: bump to prometheus-operator 0.42	2020-09-21 10:51:24 +02:00
Scott Dodson	87fabbc077	node-exporter: set maxUnavailable to 10% This daemonset doesn't affect workload availability so allow its rollout to be parallelized.	2020-09-03 10:01:53 +02:00
Lili Cosic	f8b4c681a6	jsonnet/prometheus-operator.libsonnet: Adjust alerts range	2020-08-06 15:06:53 +02:00
Lili Cosic	c0ff2c9f2d	Pin kube-mixin project to latest release	2020-08-03 13:24:32 +02:00
Sergiusz Urbaniak	2326773ee1	jsonnet/kube-prometheus: pin depdencies	2020-07-31 10:18:24 +02:00
Frederic Branczyk	f0955e0540	Merge pull request #623 from brancz/add-kubelet-probes-metrics Add scraping of endpoint for kubelet probe metrics	2020-07-29 12:57:28 +02:00
Frederic Branczyk	7c35752e3f	Add scraping of endpoint for kubelet probe metrics	2020-07-29 11:49:52 +02:00
Frederic Branczyk	b51b9b983f	prometheus-adapter: Collect metrics from Prometheus Adapter	2020-07-29 11:38:42 +02:00
Frederic Branczyk	6771c9bcc2	Merge pull request #616 from paulfantom/ciphers Update default ciphers used by kube-rbac-proxy	2020-07-28 09:31:20 +02:00
paulfantom	8f85949438	jsonnet: update kube-rbac-proxy ciphers	2020-07-28 08:49:21 +02:00
tafkam	6dfbcf35f2	port https-metrics	2020-07-27 10:27:14 +02:00
tafkam	c1304caa28	update secure ports for other cluster	2020-07-25 18:30:07 +02:00
tafkam	4410a80e4e	secure scheduler/controller metrics ports, kubeadm discovery services	2020-07-25 18:27:17 +02:00
Frederic Branczyk	40adbfae6c	Merge pull request #617 from paulfantom/node_filesystem_usage Remove instance:node_filesystem_usage:sum	2020-07-23 21:25:55 +02:00
Simon Pasquier	fcf7a2fcbf	jsonnet: update component versions	2020-07-23 17:06:48 +02:00
paulfantom	4e116aa7e2	jsonnet: remove incorrect instance:node_filesystem_usage:sum rule	2020-07-23 16:50:27 +02:00
Lili Cosic	5743540fbb	prometheus-operator.libsonnet: Add List error alert and fix threshold to Watch error alert	2020-07-15 10:24:45 +02:00
Lili Cosic	dfe9184c9b	prometheus-operator.libsonnet: Add PrometheusOperatorWatchErrors alert	2020-07-13 17:35:36 +02:00
Lili Cosic	3865eacdb3	jsonnet/kube-prometheus: Bump default versions of prometheus and alertmanager	2020-07-09 11:48:22 +02:00
Abu Kashem	4d6e3d5c19	enable etcd latency metrics in kube-apiserver kube-apiserver has a histogram etcd_request_duration_seconds that measures latency between the kube-apiserver and etcd instance. This metrics is currently dropped by cluster-prometheus. Enable this metrics so we have visibility into etcd latency. We ensured that this does not enable other unwanted metrcis count by(name) ({name=~"etcd_request.+"}) etcd_request_duration_seconds_bucket etcd_request_duration_seconds_count etcd_request_duration_seconds_sum	2020-07-03 09:49:56 -04:00
Simon Pasquier	bbd4e61fc1	Bump Grafana version to v6.7.4	2020-06-24 10:51:35 +02:00
Frederic Branczyk	1d41243b54	Merge pull request #579 from tommyjmquinn/master Updated prometheus adapter deployment to use a multi arch image repo	2020-06-23 16:09:32 +02:00
Tom Quinn	e82acdb253	Updated prometheus adapter deployment to use a multi arch image repo	2020-06-22 13:57:41 +01:00
Kristoffer Dalby	f55a17718d	Allow nodeExporter address to be configured	2020-06-21 09:11:16 +01:00
Kristoffer Dalby	6b4bc0bb26	Allow nodeExporter address to be configured	2020-06-21 08:28:48 +01:00
Frederic Branczyk	6f488250fd	Merge pull request #576 from simonpasquier/fix-alertmanager-config-inconsistent-alert Fix AlertmanagerConfigInconsistent alert	2020-06-19 16:20:40 +02:00
Simon Pasquier	c3ea4675da	Fix AlertmanagerConfigInconsistent alert Previously the alert would fire when the number of Alertmanager pods didn't match the number of replicas defined in the Alertmanager spec even though all the running pods had the same configuration hash. This type of issue is already covered by KubeStatefulSetUpdateNotRolledOut (and possibly KubePodNotReady), having AlertmanagerConfigInconsistent also active in this situation creates unnecessary noise. With this change, the alert expression only returns when Alertmanager pods have different configuration hash values irrespective of the number of pod replicas. The message annotation has also been enhanced to report the configuration hash for each pod. Signed-off-by: Simon Pasquier <spasquie@redhat.com>	2020-06-19 14:30:55 +02:00
Stavros Foteinopoulos	3cbc97d782	Update prometheus-adapter endpoint	2020-06-19 15:27:26 +03:00
Lili Cosic	53bb3431ad	jsonnet/kube-prometheus/jsonnetfile.json: Bump prometheus-operator to v0.40	2020-06-19 10:26:55 +02:00
Paul Gier	d1690d95f7	node_exporter: remove outdated comment and CLI arg The ignored filesystem types now matches the default, so the comment and arg can be removed.	2020-05-12 17:14:05 -05:00
Paul Gier	69b6883033	node-exporter: ignore kubelet pod mounts Ignore kubelet pod filesystem mounts of the form: /var/lib/kubelet/pods/1b260ce7-e75d-44d4-8409-922d2bd0851f/volumes... Metrics for these volumes are available via the kubelet_volume_stats* metrics.	2020-05-12 17:12:36 -05:00
Frederic Branczyk	f58d7b5695	Merge pull request #519 from pgier/dont-remove-preserve-unknown-fields Revert "Remove field preserveUnknownFields from CRDs"	2020-05-11 16:16:22 +02:00
paulfantom	96ea25d5de	*: update jsonnet to use prometheus-operator v0.39	2020-05-11 11:59:46 +02:00
Frederic Branczyk	dab022fc62	Merge pull request #508 from johanneswuerbach/custom-metrics-b2 custom metrics v1beta2 api with k8s-prometheus-adapter v0.7.0	2020-05-07 10:12:42 +02:00
Paul Gier	4840cdcb66	Revert "Remove field preserveUnknownFields from CRDs" This reverts commit `cdaaf3d51c`.	2020-05-05 14:15:18 -05:00
Benjamin	7130905473	Update prometheus version to v2.17.2 Signed-off-by: Benjamin <benjamin@yunify.com>	2020-04-30 14:46:17 +08:00
Johannes Würbach	ab8f1bb9f2	custom metrics v1beta2 api	2020-04-30 00:26:06 +02:00
Johannes Würbach	8d6679658f	k8s-prometheus-adapter v0.7.0	2020-04-30 00:26:06 +02:00
Frederic Branczyk	070413521c	Merge pull request #478 from NickelMedia/fix-nodeexporter-selector-labels Remove version label from node-exporter selectors	2020-04-27 15:45:58 +02:00
Johannes Würbach	145ee24e09	Convert custom-metrics into an addon	2020-04-20 12:38:50 +02:00
Frederic Branczyk	876bb9c5a1	Merge pull request #481 from omerlh/patch-2 Allow to configure EKS available IPs alert	2020-04-14 10:09:32 +02:00
Omer Levi Hevroni	6a08c7d69e	Update kube-prometheus-eks.libsonnet	2020-04-13 10:51:13 +03:00
Johannes Würbach	2ab69fdac0	Fix rules window	2020-04-07 22:01:26 +02:00
Johannes Würbach	bb21ea32e3	Make prometheus-adapter config a real object	2020-04-07 15:32:33 +02:00
Zack Brenton	432db2c799	use top-level config for all nodeExporter selector labels	2020-04-06 13:54:17 -03:00
Omer Levi Hevroni	ea9f474ab3	Allow to configure EKS available IPs alert	2020-04-06 12:15:09 +03:00

1 2 3 4 5 ...

406 Commits