Skip to content

Commit c5a1466

Browse files
authored
[docs] Added documentation for default alert settings #611
Closes #611
1 parent 92f13b0 commit c5a1466

File tree

3 files changed

+188
-0
lines changed

3 files changed

+188
-0
lines changed

docs/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,6 +41,7 @@ the OpenWISP architecture.
4141
./user/device-health-status.rst
4242
./user/metrics.rst
4343
./user/checks.rst
44+
./user/alerts.rst
4445
./user/device-checks-and-alert-settings
4546
./user/configuring-iperf3-check.rst
4647
./user/dashboard-monitoring-charts.rst

docs/user/alerts.rst

Lines changed: 175 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,175 @@
1+
Alerts
2+
======
3+
4+
.. contents:: **Table of contents**:
5+
:depth: 2
6+
:local:
7+
8+
An alert is triggered when a device metric (e.g., ping, CPU usage) crosses
9+
its configured threshold for a specified duration (tolerance). A recovery
10+
alert is sent when the metric returns to normal.
11+
12+
Alerts are based on **Alert Settings** defined for each metric, each
13+
setting includes:
14+
15+
- **Operator**: defines the condition to evaluate the metric value:
16+
17+
- **Greater than**: triggers when the value exceeds the threshold.
18+
- **Less than**: triggers when the value is below the threshold.
19+
20+
- **Threshold**: the metric value that triggers the alert.
21+
- **Tolerance**: the duration (in minutes) for which the threshold must be
22+
breached before an alert is triggered.
23+
24+
OpenWISP Monitoring provides built-in alerts for the following metrics:
25+
26+
.. note::
27+
28+
You can override the default alert settings globally using the
29+
:ref:`openwisp_monitoring_metrics` setting, or on a per-device basis
30+
as explained in the :doc:`device-checks-and-alert-settings` section.
31+
32+
.. _ping_alert:
33+
34+
Ping
35+
----
36+
37+
Triggers when the device becomes unreachable via ping. This alert is
38+
enabled by default.
39+
40+
**Default Alert Settings:**
41+
42+
========= =================
43+
Operator ``< (less than)``
44+
Threshold ``1``
45+
Tolerance ``0`` minutes
46+
========= =================
47+
48+
.. note::
49+
50+
The :ref:`ping_check` check should be enabled for the device to
51+
receive this alert.
52+
53+
.. _configuration_applied_alert:
54+
55+
Config Applied
56+
--------------
57+
58+
Triggers when the device fails to apply configuration changes within the
59+
specified time. This alert is enabled by default.
60+
61+
**Default Alert Settings:**
62+
63+
========= =================
64+
Operator ``< (less than)``
65+
Threshold ``1``
66+
Tolerance ``5`` minutes
67+
========= =================
68+
69+
.. note::
70+
71+
The :ref:`config_applied_check` check should be enabled for the device
72+
to receive this alert.
73+
74+
.. _monitoring_data_collected_alert:
75+
76+
Data Collected
77+
--------------
78+
79+
Triggers when no metric data has been collected from the device. This
80+
alert is enabled by default.
81+
82+
**Default Alert Settings:**
83+
84+
========= =================
85+
Operator ``< (less than)``
86+
Threshold ``1``
87+
Tolerance ``30`` minutes
88+
========= =================
89+
90+
.. note::
91+
92+
The :ref:`monitoring_data_collected_check` check should be enabled for
93+
the device to receive this alert.
94+
95+
CPU Usage
96+
---------
97+
98+
Triggers when CPU usage exceeds the threshold. This alert is enabled by
99+
default.
100+
101+
**Default Alert Settings:**
102+
103+
========= ====================
104+
Operator ``> (greater than)``
105+
Threshold ``90`` (percent)
106+
Tolerance ``5`` minutes
107+
========= ====================
108+
109+
Memory Usage
110+
------------
111+
112+
Triggers when memory usage exceeds the threshold. This alert is enabled by
113+
default.
114+
115+
**Default Alert Settings:**
116+
117+
========= ====================
118+
Operator ``> (greater than)``
119+
Threshold ``95`` (percent)
120+
Tolerance ``5`` minutes
121+
========= ====================
122+
123+
Disk Usage
124+
----------
125+
126+
Triggers when disk usage exceeds the threshold. This alert is enabled by
127+
default.
128+
129+
**Default Alert Settings:**
130+
131+
========= ====================
132+
Operator ``> (greater than)``
133+
Threshold ``90`` (percent)
134+
Tolerance ``0`` minutes
135+
========= ====================
136+
137+
.. _wifi_clients_alert:
138+
139+
WiFi Clients (Max)
140+
------------------
141+
142+
Triggers when the number of connected WiFi clients exceeds the threshold.
143+
This alert is disabled by default.
144+
145+
**Default Alert Settings:**
146+
147+
========= ====================
148+
Operator ``> (greater than)``
149+
Threshold ``50``
150+
Tolerance ``120`` minutes
151+
========= ====================
152+
153+
.. note::
154+
155+
The :ref:`wifi_clients_check` check should be enabled for the device
156+
to receive this alert.
157+
158+
WiFi Clients (Min)
159+
------------------
160+
161+
Triggers when the number of connected WiFi clients falls below the
162+
threshold. This alert is disabled by default.
163+
164+
**Default Alert Settings:**
165+
166+
========= =================
167+
Operator ``< (less than)``
168+
Threshold ``1``
169+
Tolerance ``0`` minutes
170+
========= =================
171+
172+
.. note::
173+
174+
The :ref:`wifi_clients_check` check should be enabled for the device
175+
to receive this alert.

docs/user/checks.rst

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,9 @@ choose to disable auto creation of this check by setting
1919
You can change the default values used for ping checks using
2020
:ref:`openwisp_monitoring_ping_check_config` setting.
2121

22+
This check also :ref:`sends an alert when the device becomes unreachable
23+
<ping_alert>`.
24+
2225
.. _config_applied_check:
2326

2427
Configuration Applied
@@ -35,6 +38,9 @@ configuration status of a device changes, this ensures the check reacts
3538
quickly to events happening in the network and informs the user promptly
3639
if there's anything that is not working as intended.
3740

41+
This check also :ref:`sends an alert if configuration is not being applied
42+
<configuration_applied_alert>`.
43+
3844
.. _monitoring_data_collected_check:
3945

4046
Monitoring Data Collected
@@ -45,6 +51,9 @@ devices in a timely manner. You may choose to disable auto creation of
4551
this check by using the setting
4652
:ref:`openwisp_monitoring_auto_data_collected_check`.
4753

54+
This check also :ref:`sends an alert when no data is received from a
55+
device <monitoring_data_collected_alert>`.
56+
4857
.. _iperf3_check:
4958

5059
Iperf3
@@ -109,3 +118,6 @@ You can use the
109118
:ref:`openwisp_monitoring_wifi_clients_check_snooze_schedule` setting to
110119
disable this check on specific dates, such as during scheduled
111120
maintenance, to avoid generating unnecessary alerts.
121+
122+
This check also :ref:`sends alerts based on WiFi client thresholds
123+
<wifi_clients_alert>`.

0 commit comments

Comments
 (0)