Grafana Alerting
In recipes for this section, we will discuss
- How to setup a new alert (opens in a new tab) for your application
- How to link alerts with specific contact points (opens in a new tab) for better relevant targeting ⇲.
We, somewhat arbitrary, split the alert into 3 different categories
- Alerting on application system resources (CPU, Memory, Disk, etc.) ⇲
- Alerting on application's logs and specific keywords or pattern in those logs ⇲
- Alerting on availability of specific endpoints or data provided by those endpoints ⇲
This categorization is in line with the supporting Grafana applications/data sources, that all have different use-cases and all need a bit different syntax. Currently we support
- Prometheus (opens in a new tab) is a monitoring tool that sits on top of our kubernetes infrastructure and provides various metrics about nodes and application health, resources, etc.
- Loki (opens in a new tab) is a Grafana application that specializes in logs monitoring and alerting.
- Infinity (opens in a new tab) is a very simple application that provides HTTP requests capabilities, it can be used to monitor health endpoint and is capable of parsing JSON responses and alerting on them
And for more information on how all of this is setup, please visit Monitoring and Observability section.