Commit Graph

284 Commits

Author SHA1 Message Date
Max Inden
73e9a9882f Merge pull request #1206 from brancz/jsonnet
Convert kube-prometheus to jsonnet
2018-04-17 09:52:49 +02:00
naseemkullah
73b3259f62 Update kube-prometheus-on-kubeadm.md
In kubeadm 1.10 (and perhaps earlier versions, not sure), ctrl-mgr and scheduler pods' label key is `component=` rather than `k8s-app=`, which is the selector key used for kube-prometheus exporters for these components.
2018-04-15 02:01:30 -04:00
Max Inden
b1aae67d99 Merge pull request #1219 from coreos/thanos4
Enable Thanos sidecar
2018-04-13 17:05:38 +02:00
Max Leonard Inden
3b13afb562 kube-prometheus/thanos: Move to experimental folder
In preparation of PR #1206 [1], This patch moves the Thanos manifests to the
`experimental` folder.

[1] https://github.com/coreos/prometheus-operator/pull/1206
2018-04-13 15:10:18 +02:00
Frederic Branczyk
507617e150 Remove old manifests and replace with jsonnet build 2018-04-13 13:36:34 +02:00
Fabian Reinartz
e7e23cd81f contrib: add Thanos example manifests 2018-04-13 11:53:43 +02:00
Frederic Branczyk
d8692794a9 kube-prometheus: Convert to jsonnet 2018-04-11 18:51:35 +02:00
Frederic Branczyk
0d142fe9da Merge pull request #1205 from zuzzas/cpu-rules
Fixed CPU accounting in recording rules
2018-04-11 15:08:15 +02:00
Arslanbekov Denis
a2d273b11a In description is displayed correctly namespace (#1190)
* in description is displayed correctly namespace

* Bump kube state version

* Update Chart.yaml
2018-04-10 17:18:24 +02:00
Andrey Klimentyev
f9b03ddd9d kube-prometheus: fixed CPU accounting
Currently, node recording rules feature
an incorrect idle CPU accounting. This
change aims to fix that.
2018-04-10 15:14:48 +03:00
Max Leonard Inden
b10e343689 kube-prometheus: Fix minor typo 2018-04-10 10:27:54 +02:00
Frederic Branczyk
63ae4ee19e Merge pull request #1200 from coreos/release-0.18
Cut 0.18.1 back to master
2018-04-09 18:32:57 +02:00
Max Leonard Inden
46c49616e0 *: Bump version to v0.18.1 2018-04-09 14:47:34 +02:00
Keyvan Hedayati
b6e2f2ae31 Minor typo fix 2018-04-07 17:09:22 +04:30
Richard Maynard
889eca2cd4 charts: Add Core DNS Support (#1176)
* charts: Add Core DNS Support

With CoreDNS becoming a first class citizen it should be an optional part
of the kube-prometheus deployment.

Fixes: #1174

* added kube-prometheus service and service monitor

In order to work directly with a cluster created by bootkube used port 9153,
and created a service exposing metrics since bootkube does does not. Also in
the helm chart changed the default port to 9153 since that is the coredns
plugins default port and to be consistent throughout the repo.
2018-04-05 09:57:56 +02:00
Frederic Branczyk
b566db5d4b *: Bump version to v0.18.0 2018-04-04 10:06:38 +02:00
Sébastien GLON
2c10f81102 Add new alert for samples rejected due ti duplicate timestamp (#1148)
Signed-off-by: Sébastien GLON <sebastien.glon@akeneo.com>
2018-03-26 18:11:40 +02:00
Frederic Branczyk
22513e0475 Merge pull request #1145 from brancz/targetlabels
prometheus: Add ability to whitelist Kubernetes labels
2018-03-26 05:49:39 -04:00
Michael Pietzsch
e50adc0926 Grafana Update to 5.0.3 (#1149) 2018-03-26 05:25:41 -04:00
Frederic Branczyk
3246c48913 prometheus: Add ability to whitelist Kubernetes labels 2018-03-23 16:13:28 +01:00
Alexander Holte-Davidsen
4c77a9db1d Update Alert Manager rules for NodeDiskRunningFull with summary 2018-03-22 11:32:38 +01:00
Frederic Branczyk
07263e98c0 *: upgrade client-go 2018-03-21 14:24:45 +01:00
Frederic Branczyk
3c278a202f Merge branch 'master' into restrict-service-monitor 2018-03-20 08:47:43 -04:00
Max Leonard Inden
310f471cf6 contrib/kube: Add rbac role to discover prometheus
The current example-app setup in the kube-prometheus project is able to
discover scraping targets in the default namespace. It is not able to
discover the configured Alertmanager in the monitoring namespace.

This patch adds an alertmanager-discovery rbac role, to permit the
above described action. In addition it does the following cleanups:

- Remove kubeconfig configuration in deploy and teardown script. kubectl
chooses .kube/config whenever KUBECONFIG is not set by default

- Remove namespace specification option via NAMESPACE env var. In most
of the manifests the metadata/namespace was hardcoded anyways, in
addition in the promtheus frontend role binding the service account
namespace is hardcoded to default as well.

- Instead of `kubectl {apply,delete}` individual manifests, the deploy
and teardown shell scripts {apply,delete} on the entire folder.
2018-03-19 17:13:33 +01:00
Max Leonard Inden
2f302943fe Unify rule file label accross repository
In some cases we have been using `alert-rules` in some cases
`prometheus-rulefiles`. This led to confusion [1]. Instead, unify the
Prometheus rules configmap labels to:

```yaml
labels:
  role: alert-rules
```

[1] https://github.com/coreos/prometheus-operator/issues/1102
2018-03-19 14:24:48 +01:00
Max Leonard Inden
3a6d38ea0e contrib/kube: Restrict example-app servicemonitor to default ns
The `prometheus-frontend` role of the example app kubeprometheus section
is scoped to the default namespace. Thereby the frontend Prometheus
instance is not able to discover anything outside of the default
namespace. We might as well restrict the front end service monitor to
the default namespace too.
2018-03-19 13:34:18 +01:00
Laurent Godet
0bda22e380 Update Prometheus to 2.2.1 2018-03-14 19:34:35 +00:00
Riccardo Setti
e497e48e07 Add how to enable cAdvisor support on GKE 2018-03-12 19:41:03 +01:00
Jesse Stuart
f0a86796f0 docs: Add missing argument in kubeadm workaround script.
One of the `sed` commands for updating the kubeadm systemd
file (`/etc/systemd/system/kubelet.service.d/10-kubeadm.conf`)
was missing a `-i` reference to the file itself, causing it
to hang indefinitely if ran as declared.

I also wrapped this second `sed` in an `if grep ...` check,
in order to make the operation idempotent.
2018-03-11 07:04:09 -04:00
Francisco Ripoli
5d33cbb166 example service monitoring updated with missing config
added service account, role and role binding for the prometheus frontend
example, also updated prometheus to use the correct service account

fixes #1049
2018-03-08 17:36:05 +00:00
Alexander Holte-Davidsen
8b6ee5c18b Add summary to Alertmanager rules where missing - updated accoring to guidelines 2018-03-05 09:52:51 +01:00
Jordan Jennings
c54468ab7b Update to grafana 5.0.0 2018-03-01 11:32:42 -05:00
Frederic Branczyk
89d6181e83 Merge pull request #1032 from atoato88/change-kubelet-alert-rule
Update alert rule for kubelet
2018-03-01 11:37:52 +01:00
Akihito INOH
7fe4506ae4 Update alert rule for kubelet
Update alert rule check kubelet down ratio from 1% to 10%.
In #774 , it is changed to 1%, so returns to 10%.
2018-03-01 14:10:27 +09:00
Carlos Eduardo
14d57e1fec Added information on webhook token authentication
Added information on webhook token authentication and information on how to update the manifests if Kubernetes is already deployed with kubeadm.
2018-02-28 13:36:28 -05:00
Antoine Legrand
9764d157b1 Merge pull request #1010 from coreos/no_ingest_alert
Add alert if it samples aren't ingested
2018-02-28 10:43:23 +01:00
Jordan Jennings
eb636277cb Update to Prometheus v2.2.0-rc.1 2018-02-26 10:11:56 -05:00
Frederic Branczyk
c131e54ad6 Merge pull request #995 from jordanjennings/patch-3
Update to grafana 5.0.0-beta5
2018-02-26 15:27:49 +01:00
Antoine Legrand
0ae6c98a48 Add alert if it no samples are ingested 2018-02-26 14:57:39 +01:00
jordanjennings
1c3fb8462f Update to grafana 5.0.0-beta5 2018-02-26 08:24:11 -05:00
Dan Mace
80b2a511be Give operator permission to manager alertmanagers
Add a missing resource to the operator's cluster role allowing it to correctly
manager alertmanagers.
2018-02-16 12:40:32 -05:00
Frederic Branczyk
414b380cfa *: Bump Promteheus Operator version to v0.17.0 2018-02-14 14:01:23 +01:00
Frederic Branczyk
863156dcc9 *: bump Prometheus and Alertmanager versions 2018-02-14 14:01:22 +01:00
Dan Mace
936622aa3e Add missing resource to operator role
Add a missing 'prometheuses/finalizers' resource to the operator cluster role.
Without this role, the operator will fail to create a default secret for a
Prometheus instance in the absence of any service monitors.

The fix seems to already be present in the `examples` version of the role; this
patch brings the contrib version in line with the example.
2018-02-13 16:05:17 -05:00
Frederic Branczyk
e8a19234dc kube-prometheus/hack: Add generation for grafana dashboard source file 2018-02-12 11:50:04 +01:00
Antoine Legrand
8a0c21ff29 Merge pull request #968 from brancz/grafana-v5
kube-prometheus: Upgrade to grafana v5
2018-02-12 10:41:07 +01:00
Frederic Branczyk
85f88025f3 kube-prometheus: Upgrade to grafana v5 2018-02-09 13:21:37 +01:00
Frederic Branczyk
acb10c1678 Merge pull request #973 from Ludek2/fixing#972
fixing#972
2018-02-09 11:14:01 +01:00
Scott Brenner
51515b1e2b Update wrap-dashboard.sh
Might as well, right?
2018-02-08 12:50:19 -08:00
ludek_navratil
37f7fd52e8 fixing#972 2018-02-08 17:21:44 +00:00