Skip to content

Commit 584c7ce

Browse files
committed
[docs] Improved docs: metrics, checks and alerts
Give priority to visuals, list general information first, better interlinking of metrics, checks and alerts, fixed minor issues with the wifi clients check section.
1 parent e2fd0c1 commit 584c7ce

File tree

4 files changed

+228
-148
lines changed

4 files changed

+228
-148
lines changed

docs/user/alerts.rst

Lines changed: 72 additions & 44 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,9 @@ Alerts
55
:depth: 2
66
:local:
77

8+
Introduction
9+
------------
10+
811
An alert is triggered when a device metric (e.g., ping, CPU usage) crosses
912
its configured threshold for a specified duration (tolerance). A recovery
1013
alert is sent when the metric returns to normal.
@@ -22,21 +25,26 @@ setting includes:
2225
continuously breached before triggering an alert. Helps reduce noise
2326
from flapping metrics.
2427

25-
OpenWISP Monitoring provides built-in alerts for the following metrics:
26-
2728
.. note::
2829

2930
You can override the default alert settings globally using the
3031
:ref:`openwisp_monitoring_metrics` setting, or on a per-device basis
3132
as explained in the :doc:`device-checks-and-alert-settings` section.
3233

34+
The built-in alerts are explained below.
35+
3336
.. _ping_alert:
3437

3538
Ping
3639
----
3740

38-
Triggers when the device becomes unreachable via ping. This alert is
39-
enabled by default.
41+
Triggers when the device becomes unreachable via ping.
42+
43+
**Alert enabled by default?** Yes.
44+
45+
**Collected via**: :ref:`Ping Check <ping_check>`.
46+
47+
**Charts**: :ref:`Ping Chart <ping>`.
4048

4149
**Default Alert Settings:**
4250

@@ -46,18 +54,17 @@ Threshold ``1``
4654
Tolerance ``30`` minutes
4755
========= =================
4856

49-
.. note::
50-
51-
The :ref:`ping_check` check should be enabled for the device to
52-
receive this alert.
53-
5457
.. _configuration_applied_alert:
5558

5659
Config Applied
5760
--------------
5861

5962
Triggers when the device fails to apply configuration changes within the
60-
specified time. This alert is enabled by default.
63+
specified time.
64+
65+
**Alert enabled by default?** Yes.
66+
67+
**Collected via**: :ref:`Config Applied Check <config_applied_check>`.
6168

6269
**Default Alert Settings:**
6370

@@ -67,18 +74,17 @@ Threshold ``1``
6774
Tolerance ``10`` minutes
6875
========= =================
6976

70-
.. note::
71-
72-
The :ref:`config_applied_check` check should be enabled for the device
73-
to receive this alert.
74-
7577
.. _monitoring_data_collected_alert:
7678

7779
Data Collected
7880
--------------
7981

80-
Triggers when no metric data has been collected from the device. This
81-
alert is enabled by default.
82+
Triggers when no metric data has been collected from the device.
83+
84+
**Alert enabled by default?** Yes.
85+
86+
**Collected via**: :ref:`Config Applied Check
87+
<monitoring_data_collected_check>`.
8288

8389
**Default Alert Settings:**
8490

@@ -88,44 +94,63 @@ Threshold ``1``
8894
Tolerance ``30`` minutes
8995
========= =================
9096

91-
.. note::
97+
.. _memory_usage_alert:
9298

93-
The :ref:`monitoring_data_collected_check` check should be enabled for
94-
the device to receive this alert.
99+
Memory Usage
100+
------------
101+
102+
Triggers when memory usage exceeds the threshold.
95103

96-
CPU Usage
97-
---------
104+
**Alert enabled by default?** Yes.
98105

99-
Triggers when CPU usage exceeds the threshold. This alert is enabled by
100-
default.
106+
**Collected via**: :doc:`OpenWrt Monitoring Agent
107+
</openwrt-monitoring-agent/index>`.
108+
109+
**Charts**: :ref:`Memory Usage Chart <memory_usage>`.
101110

102111
**Default Alert Settings:**
103112

104113
========= ====================
105114
Operator ``> (greater than)``
106-
Threshold ``90`` (percent)
115+
Threshold ``95`` (percent)
107116
Tolerance ``30`` minutes
108117
========= ====================
109118

110-
Memory Usage
111-
------------
119+
.. _cpu_load_alert:
120+
121+
CPU Load Average
122+
----------------
123+
124+
Triggers when CPU usage exceeds the threshold.
125+
126+
**Alert enabled by default?** Yes.
112127

113-
Triggers when memory usage exceeds the threshold. This alert is enabled by
114-
default.
128+
**Collected via**: :doc:`OpenWrt Monitoring Agent
129+
</openwrt-monitoring-agent/index>`.
130+
131+
**Charts**: :ref:`CPU Load Chart <cpu_load_averages>`.
115132

116133
**Default Alert Settings:**
117134

118135
========= ====================
119136
Operator ``> (greater than)``
120-
Threshold ``95`` (percent)
137+
Threshold ``90`` (percent)
121138
Tolerance ``30`` minutes
122139
========= ====================
123140

141+
.. _disk_usage_alert:
142+
124143
Disk Usage
125144
----------
126145

127-
Triggers when disk usage exceeds the threshold. This alert is enabled by
128-
default.
146+
Triggers when disk usage exceeds the threshold.
147+
148+
**Alert enabled by default?** Yes.
149+
150+
**Collected via**: :doc:`OpenWrt Monitoring Agent
151+
</openwrt-monitoring-agent/index>`.
152+
153+
**Charts**: :ref:`Disk Usage Chart <disk_usage>`.
129154

130155
**Default Alert Settings:**
131156

@@ -141,7 +166,13 @@ WiFi Clients (Max)
141166
------------------
142167

143168
Triggers when the number of connected WiFi clients exceeds the threshold.
144-
This alert is disabled by default.
169+
170+
**Alert enabled by default?** No (see :ref:`WiFi Clients Check
171+
<wifi_clients_check>` for details on how to enable it).
172+
173+
**Collected via**: the WiFi clients information is collected through the
174+
:doc:`OpenWrt Monitoring Agent </openwrt-monitoring-agent/index>`, but the
175+
alert is triggered by the :ref:`WiFi Clients Check <wifi_clients_check>`.
145176

146177
**Default Alert Settings:**
147178

@@ -151,16 +182,18 @@ Threshold ``50``
151182
Tolerance ``120`` minutes
152183
========= ====================
153184

154-
.. note::
155-
156-
The :ref:`wifi_clients_check` check should be enabled for the device
157-
to receive this alert.
158-
159185
WiFi Clients (Min)
160186
------------------
161187

162188
Triggers when the number of connected WiFi clients falls below the
163-
threshold. This alert is disabled by default.
189+
threshold.
190+
191+
**Alert enabled by default?** No (see :ref:`WiFi Clients Check
192+
<wifi_clients_check>` for details on how to enable it).
193+
194+
**Collected via**: the WiFi clients information is collected through the
195+
:doc:`OpenWrt Monitoring Agent </openwrt-monitoring-agent/index>`, but the
196+
alert is triggered by the :ref:`WiFi Clients Check <wifi_clients_check>`.
164197

165198
**Default Alert Settings:**
166199

@@ -169,8 +202,3 @@ Operator ``< (less than)``
169202
Threshold ``1``
170203
Tolerance ``0`` minutes
171204
========= =================
172-
173-
.. note::
174-
175-
The :ref:`wifi_clients_check` check should be enabled for the device
176-
to receive this alert.

docs/user/checks.rst

Lines changed: 42 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -5,11 +5,31 @@ Checks
55
:depth: 2
66
:local:
77

8+
Introduction
9+
------------
10+
11+
Active checks are periodic background tasks executed by `Celery workers
12+
<https://docs.celeryq.dev/en/stable/>`_ on the OpenWISP server.
13+
14+
These workers perform the enabled checks and store their results in the
15+
time series database.
16+
17+
The built-in checks and the related django settings available in OpenWISP
18+
Monitoring are described below.
19+
20+
.. include:: /partials/settings-note.rst
21+
822
.. _ping_check:
923

1024
Ping
1125
----
1226

27+
**Check enabled by default?** Yes.
28+
29+
**Alert**: :ref:`Ping Alert <ping_alert>`.
30+
31+
**Charts**: :ref:`Ping Charts <ping>`.
32+
1333
This check returns information on Ping Success Rate and RTT (Round trip
1434
time). It creates charts like Ping Success Rate, Packet Loss and RTT.
1535
These metrics are collected using the ``fping`` Linux program. You may
@@ -27,6 +47,10 @@ This check also :ref:`sends an alert when the device becomes unreachable
2747
Configuration Applied
2848
---------------------
2949

50+
**Check enabled by default?** Yes.
51+
52+
**Alert**: :ref:`Config Applied Alert <configuration_applied_alert>`.
53+
3054
This check ensures that the :doc:`openwisp-config agent
3155
</openwrt-config-agent/index>` is running and applying configuration
3256
changes in a timely manner. You may choose to disable auto creation of
@@ -46,6 +70,10 @@ This check also :ref:`sends an alert if configuration is not being applied
4670
Monitoring Data Collected
4771
-------------------------
4872

73+
**Check enabled by default?** Yes.
74+
75+
**Alert**: :ref:`Data Collected Alert <monitoring_data_collected_alert>`.
76+
4977
This check ensures that the server is receiving metrics from network
5078
devices in a timely manner. You may choose to disable auto creation of
5179
this check by using the setting
@@ -59,13 +87,16 @@ device <monitoring_data_collected_alert>`.
5987
Iperf3
6088
------
6189

90+
**Check enabled by default?** No.
91+
92+
**Charts**: :ref:`Iperf3 Charts <iperf3>`.
93+
6294
This check provides network performance measurements such as maximum
6395
achievable bandwidth, jitter, datagram loss etc of the device using
6496
`iperf3 utility <https://iperf.fr/>`_.
6597

6698
This check is **disabled by default**. You can enable auto creation of
67-
this check by setting the :ref:`openwisp_monitoring_auto_iperf3` to
68-
``True``.
99+
this check by setting :ref:`openwisp_monitoring_auto_iperf3` to ``True``.
69100

70101
You can also :doc:`add the iperf3 check
71102
<device-checks-and-alert-settings>` directly from the device page.
@@ -87,6 +118,10 @@ parameters used for iperf3 checks (e.g. timing, port, username, password,
87118
WiFi Clients
88119
------------
89120

121+
**Check enabled by default?** No.
122+
123+
**Alerts**: :ref:`WiFi Clients Alerts (Max/Min) <wifi_clients_alert>`.
124+
90125
This check sends alerts based on the total number of WiFi Clients
91126
connected to a device. It sends two types of alerts:
92127

@@ -102,14 +137,7 @@ connected to a device. It sends two types of alerts:
102137
connectivity.
103138

104139
This check is **disabled by default**. To enable auto creation of this
105-
check, set :ref:`openwisp_monitoring_auto_wifi_clients_check` to ``True``
106-
and configure the task scheduling in your Django project:
107-
108-
.. code-block:: python
109-
110-
from datetime import timedelta
111-
112-
OPENWISP_MONITORING_AUTO_WIFI_CLIENTS_CHECK = True
140+
check, set :ref:`openwisp_monitoring_auto_wifi_clients_check` to ``True``.
113141

114142
You can also :doc:`add the WiFi Clients check
115143
<device-checks-and-alert-settings>` directly from the device page.
@@ -119,5 +147,7 @@ You can use the
119147
disable this check on specific dates, such as during scheduled
120148
maintenance, to avoid generating unnecessary alerts.
121149

122-
This check also :ref:`sends alerts based on WiFi client thresholds
123-
<wifi_clients_alert>`.
150+
Other related settings:
151+
152+
- :ref:`openwisp_monitoring_wifi_clients_max_check_interval`
153+
- :ref:`openwisp_monitoring_wifi_clients_min_check_interval`

docs/user/intro.rst

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -20,10 +20,11 @@ OpenWISP provides the following monitoring capabilities:
2020
:ref:`packet loss <ping_check>`, :ref:`round trip time (latency)
2121
<ping_check>`, :ref:`associated wifi clients <wifi_clients>`,
2222
:ref:`interface traffic <traffic>`, :ref:`RAM usage <memory_usage>`,
23-
:ref:`CPU load <cpu_load>`, :ref:`flash/disk usage <disk_usage>`, mobile
24-
signal (LTE/UMTS/GSM :ref:`signal strength <mobile_signal_strength>`,
25-
:ref:`signal quality <mobile_signal_quality>`, :ref:`access technology
26-
in use <mobile_access_technology_in_use>`), :ref:`bandwidth <iperf3>`,
23+
:ref:`CPU load <cpu_load_averages>`, :ref:`flash/disk usage
24+
<disk_usage>`, mobile signal (LTE/UMTS/GSM :ref:`signal strength
25+
<mobile_signal_strength>`, :ref:`signal quality
26+
<mobile_signal_quality>`, :ref:`access technology in use
27+
<mobile_access_technology_in_use>`), :ref:`bandwidth <iperf3>`,
2728
:ref:`transferred data <iperf3>`, :ref:`restransmits <iperf3>`,
2829
:ref:`jitter <iperf3>`, :ref:`datagram <iperf3>`, :ref:`datagram loss
2930
<iperf3>`

0 commit comments

Comments
 (0)