Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 12 additions & 2 deletions inputs/activemq/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,13 @@
# activemq
# ActiveMQ

ActiveMQ 当前可以使用 jolokia_agent 插件来监控,通过读取 jmx 数据的方式获取监控指标,配置文件可以参考:[activemq.toml](../../conf/input.jolokia_agent_misc/activemq.toml)
ActiveMQ can be monitored using the `jolokia_agent` plugin, which retrieves metrics by reading JMX data.

For configuration details, please refer to: [activemq.toml](../../conf/input.jolokia_agent_misc/activemq.toml).

## Metrics

Once configured via the Jolokia Agent plugin, Categraf will export the following types of metrics:
- **Broker Metrics**: e.g., `activemq_broker_TotalMessageCount`, `activemq_broker_TotalConsumerCount`
- **Queue Metrics**: e.g., `activemq_queue_QueueSize`, `activemq_queue_ConsumerCount`
- **Topic Metrics**: e.g., `activemq_topic_EnqueueCount`, `activemq_topic_DequeueCount`
- **JVM Metrics**: Generic Java Runtime metrics such as Garbage Collection, Memory Heap, etc.
3 changes: 3 additions & 0 deletions inputs/activemq/README_CN.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# activemq

ActiveMQ 当前可以使用 jolokia_agent 插件来监控,通过读取 jmx 数据的方式获取监控指标,配置文件可以参考:[activemq.toml](../../conf/input.jolokia_agent_misc/activemq.toml)
142 changes: 142 additions & 0 deletions inputs/activemq/dashboard.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,142 @@
{
"title": "ActiveMQ",
"uid": "16eab841",
"tags": [
"activemq"
],
"timezone": "browser",
"schemaVersion": 30,
"panels": [
{
"title": "Total Message Count",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 0,
"w": 8,
"h": 8
},
"id": 1,
"targets": [
{
"expr": "activemq_broker_TotalMessageCount",
"legendFormat": "{{brokerName}}",
"refId": "A"
}
]
},
{
"title": "Total Consumer Count",
"type": "timeseries",
"gridPos": {
"x": 8,
"y": 0,
"w": 8,
"h": 8
},
"id": 2,
"targets": [
{
"expr": "activemq_broker_TotalConsumerCount",
"legendFormat": "{{brokerName}}",
"refId": "A"
}
]
},
{
"title": "Memory Percent Usage",
"type": "timeseries",
"gridPos": {
"x": 16,
"y": 0,
"w": 8,
"h": 8
},
"id": 3,
"targets": [
{
"expr": "activemq_broker_MemoryPercentUsage",
"legendFormat": "{{brokerName}}",
"refId": "A"
}
]
},
{
"title": "Queue Size",
"type": "timeseries",
"gridPos": {
"x": 12,
"y": 0,
"w": 12,
"h": 8
},
"id": 4,
"targets": [
{
"expr": "activemq_queue_QueueSize",
"legendFormat": "{{destinationName}}",
"refId": "A"
}
]
},
{
"title": "Queue Consumer Count",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 8,
"w": 12,
"h": 8
},
"id": 5,
"targets": [
{
"expr": "activemq_queue_ConsumerCount",
"legendFormat": "{{destinationName}}",
"refId": "A"
}
]
},
{
"title": "Topic Enqueue Count",
"type": "timeseries",
"gridPos": {
"x": 12,
"y": 8,
"w": 12,
"h": 8
},
"id": 6,
"targets": [
{
"expr": "activemq_topic_EnqueueCount",
"legendFormat": "{{destinationName}}",
"refId": "A"
}
]
},
{
"title": "Topic Dequeue Count",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 16,
"w": 12,
"h": 8
},
"id": 7,
"targets": [
{
"expr": "activemq_topic_DequeueCount",
"legendFormat": "{{destinationName}}",
"refId": "A"
}
]
}
],
"refresh": "1m",
"time": {
"from": "now-1h",
"to": "now"
}
}
8 changes: 8 additions & 0 deletions inputs/aliyun/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -24,6 +24,14 @@ RAM 用户授权。RAM 用户调用云监控 API 前,需要所属的阿里云

4. 配置

不同namespace附件label的权限点
- rds:
- `rds:DescribeDBInstances` `acs:rds:{#regionId}:{#accountId}:dbinstance/{#dbinstanceId}` [链接](https://next.api.aliyun.com/document/Rds/2014-08-15/DescribeDBInstances)
- `rds:ListTagResources` `acs:rds:{#regionId}:{#accountId}:dbinstance/{#dbinstanceId}` [链接](https://next.api.aliyun.com/document/Rds/2014-08-15/ListTagResources)
- polardb: `acs:DescribeDBClusters` `acs:polardb:*:*:dbcluster` [链接](https://help.aliyun.com/document_detail/118034.html?spm=a2c4g.98094.0.0)
- kvstore: `DescribeInstances` `acs:kvstore:$regionid:$accountid:instance/$instanceid` [链接](https://help.aliyun.com/apsara/enterprise/v_3_18_0/kvstore/enterprise-developer-guide/api-authentication-rules.html)
- ecs: `DescribeInstances` `acs:ecs:$regionid:$accountid:instance/*` [链接](https://help.aliyun.com/document_detail/25497.html?spm=a2c4g.25506.0.0)

```toml
# # categraf采集周期,阿里云指标的粒度一般是60秒,建议设置不要少于60秒
# interval = 60
Expand Down
53 changes: 53 additions & 0 deletions inputs/amd_rocm_smi/README_CN.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,53 @@
# AMD ROCm System Management Interface (SMI) 采集插件

该插件 fork 自 [telegraf/amd_rocm_smi](https://github.com/influxdata/telegraf/blob/master/plugins/inputs/amd_rocm_smi)

此插件通过执行 [`rocm-smi`][1] 命令来获取 AMD GPU 的状态指标,包括显存使用、GPU 使用率、温度等。

[1]: https://github.com/RadeonOpenCompute/rocm_smi_lib/tree/master/python_smi_tools

## 配置说明

```toml
# 使用 rocm-smi 命令查询 AMD 显卡统计信息
# bin_path = "/opt/rocm/bin/rocm-smi"
# 如果不设置 bin_path,则不会进行采集

## 可选: GPU 轮询的超时时间
# timeout = "5s"
```

## 采集指标

- 测量名称: `amd_rocm_smi`
- 标签 (Tags)
- `name` (rocm-smi 可执行文件分配的显卡名称)
- `gpu_id` (rocm-smi 识别的 GPU ID)
- `gpu_unique_id` (GPU 的唯一 ID)

- 字段 (Fields)
- `driver_version` (整数)
- `fan_speed` (整数,风扇转速百分比)
- `memory_total` (整数 B,显存总量)
- `memory_used` (整数 B,已用显存)
- `memory_free` (整数 B,空闲显存)
- `temperature_sensor_edge` (浮点数,摄氏度)
- `temperature_sensor_junction` (浮点数,结温摄氏度)
- `temperature_sensor_memory` (浮点数,显存温度摄氏度)
- `utilization_gpu` (整数,GPU 使用率百分比)
- `utilization_memory` (整数,显存使用率百分比)
- `clocks_current_sm` (整数,Mhz)
- `clocks_current_memory` (整数,Mhz)
- `power_draw` (浮点数,瓦特)

## 故障排除

如果遇到问题,可以尝试手动运行完整的 `rocm-smi` 命令来检查输出结果。

Linux 环境下:

```sh
rocm-smi rocm-smi -o -l -m -M -g -c -t -u -i -f -p -P -s -S -v --showreplaycount --showpids --showdriverversion --showmemvendor --showfwinfo --showproductname --showserial --showuniqueid --showbus --showpendingpages --showpagesinfo --showretiredpages --showunreservablepages --showmemuse --showvoltage --showtopo --showtopoweight --showtopohops --showtopotype --showtoponuma --showmeminfo all --json
```

如果在 GitHub 提交 issue,请附上此命令的输出结果以及您所使用的 ROCm 版本。
160 changes: 160 additions & 0 deletions inputs/amd_rocm_smi/dashboard.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,160 @@
{
"title": "AMD ROCm SMI",
"uid": "adadd6dc",
"tags": [
"amd rocm smi"
],
"timezone": "browser",
"schemaVersion": 30,
"panels": [
{
"title": "GPU Utilization",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 0,
"w": 12,
"h": 8
},
"id": 1,
"targets": [
{
"expr": "amd_rocm_smi_utilization_gpu",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Memory Utilization",
"type": "timeseries",
"gridPos": {
"x": 12,
"y": 0,
"w": 12,
"h": 8
},
"id": 2,
"targets": [
{
"expr": "amd_rocm_smi_utilization_memory",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Memory Used (Bytes)",
"type": "timeseries",
"gridPos": {
"x": 16,
"y": 0,
"w": 8,
"h": 8
},
"id": 3,
"targets": [
{
"expr": "amd_rocm_smi_memory_used",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Memory Free (Bytes)",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 8,
"w": 8,
"h": 8
},
"id": 4,
"targets": [
{
"expr": "amd_rocm_smi_memory_free",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Power Draw (W)",
"type": "timeseries",
"gridPos": {
"x": 8,
"y": 8,
"w": 8,
"h": 8
},
"id": 5,
"targets": [
{
"expr": "amd_rocm_smi_power_draw",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Temperature (Edge)",
"type": "timeseries",
"gridPos": {
"x": 16,
"y": 8,
"w": 8,
"h": 8
},
"id": 6,
"targets": [
{
"expr": "amd_rocm_smi_temperature_sensor_edge",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Temperature (Junction)",
"type": "timeseries",
"gridPos": {
"x": 0,
"y": 16,
"w": 8,
"h": 8
},
"id": 7,
"targets": [
{
"expr": "amd_rocm_smi_temperature_sensor_junction",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
},
{
"title": "Fan Speed (%)",
"type": "timeseries",
"gridPos": {
"x": 8,
"y": 16,
"w": 8,
"h": 8
},
"id": 8,
"targets": [
{
"expr": "amd_rocm_smi_fan_speed",
"legendFormat": "{{name}} ({{gpu_id}})",
"refId": "A"
}
]
}
],
"refresh": "1m",
"time": {
"from": "now-1h",
"to": "now"
}
}
Loading
Loading