Collectd를 실행하는 서버가 4개 있습니다.
PVE
(Proxmox VE) 및 AP1
(OpenWRT가 포함된 Zyxel AP) 서버는 collectd
쓰기 작업을 실행 ROUTER
하고 라우터는 자체 메트릭을 가져와 프록시 역할을 하며 collectd
이를 모두SERVER
PVE와 AP1 구성은 동일합니다.
Hostname "pve or ap1"
FQDNLookup false
Interval 5
LoadPlugin network
LoadPlugin cpu
LoadPlugin memory
LoadPlugin uptime
<Plugin "network">
<Server "ip-of-router" "25826">
ResolveInterval 30
</Server>
ReportStats false
</Plugin>
<Plugin "cpu">
ReportByState true
ReportByCpu false
ValuesPercentage false
ReportNumCpu true
</Plugin>
라우터는 다음과 같이 구성됩니다.
Hostname "router"
FQDNLookup false
Interval 5
LoadPlugin network
LoadPlugin cpu
LoadPlugin memory
LoadPlugin interface
LoadPlugin uptime
<Plugin "network">
<Listen "0.0.0.0" "25826">
</Listen>
<Server "ip-of-collectd-server" "25826">
Interface wgcli_hub
ResolveInterval 30
</Server>
Forward true
ReportStats true
</Plugin>
<Plugin "cpu">
ReportByState true
ReportByCpu false
ValuesPercentage false
ReportNumCpu true
</Plugin>
<Plugin "interface">
Interface eth1
Interface eth2
IgnoreSelected false
ReportInactive true
</Plugin>
데이터는 이런 식으로 잘 흐르지만 router
로그 에는 다음과 같은 Value too old
오류가 표시됩니다.pve
ap1
Mon Mar 13 19:44:04 2023 daemon.err collectd[2616]: uc_update: Value too old: name = pve/cpufreq-2/cpufreq; value time = 1678725834.442; last cache update = 1678725839.442;
Mon Mar 13 19:44:59 2023 daemon.err collectd[2616]: uc_update: Value too old: name = ap1/memory/memory-buffered; value time = 1678725889.720; last cache update = 1678725894.720;
Mon Mar 13 19:44:59 2023 daemon.err collectd[2616]: uc_update: Value too old: name = ap1/cpu/percent-interrupt; value time = 1678725889.721; last cache update = 1678725894.721;
Mon Mar 13 19:45:49 2023 daemon.err collectd[2616]: uc_update: Value too old: name = pve/cpu/percent-wait; value time = 1678725944.382; last cache update = 1678725949.382;
Mon Mar 13 19:45:49 2023 daemon.err collectd[2616]: uc_update: Value too old: name = pve/cpu/percent-nice; value time = 1678725944.382; last cache update = 1678725949.382;
Mon Mar 13 19:46:59 2023 daemon.err collectd[2616]: uc_update: Value too old: name = ap1/cpu/percent-wait; value time = 1678726009.721; last cache update = 1678726014.721;
Mon Mar 13 19:46:59 2023 daemon.err collectd[2616]: uc_update: Value too old: name = ap1/memory/memory-slab_recl; value time = 1678726009.720; last cache update = 1678726014.720;
다시 확인했습니다. ap1\pve에 두 번째 Collectd 프로세스가 없고, 데이터를 보내는 동일한 이름의 Collectd가 없으며, 네트워크 플러그인은 한 번만 로드됩니다.
나는 그것이 시간 value time
보다 항상 5초 전이고 last cache update
5초가 Collectd의 간격이라는 것을 알았습니다.
주기적으로 발생하지만 측정항목은 매번 다릅니다(호스트에도 동일하지만 포함되지 않음).
18:30:40 <..> pve/memory/memory-free; value time = 1678721430.332; last cache update = 1678721435.332;
18:31:00 <..> pve/cpu/percent-softirq; value time = 1678721455.332; last cache update = 1678721460.332;
18:34:10 <..> pve/cpu/percent-nice; value time = 1678721645.332; last cache update = 1678721650.332;
18:34:30 <..> pve/cpu/percent-idle; value time = 1678721665.332; last cache update = 1678721670.332;
18:34:30 <..> pve/cpu/percent-wait; value time = 1678721665.332; last cache update = 1678721670.332;
18:36:15 <..> pve/memory/memory-free; value time = 1678721765.332; last cache update = 1678721770.332;
18:36:15 <..> pve/sensors-coretemp-isa-0000/temperature-temp1; value time = 1678721765.333; last cache update = 1678721770.333;
18:36:35 <..> pve/cpu/count; value time = 1678721790.332; last cache update = 1678721795.332;
18:40:05 <..> pve/memory/memory-used; value time = 1678722000.332; last cache update = 1678722005.332;
18:40:05 <..> pve/cpu/percent-idle; value time = 1678722000.332; last cache update = 1678722005.332;
18:42:30 <..> pve/memory/memory-slab_unrecl; value time = 1678722145.332; last cache update = 1678722150.332;
18:42:30 <..> pve/memory/memory-free; value time = 1678722145.332; last cache update = 1678722150.332;
18:43:00 <..> pve/memory/memory-used; value time = 1678722175.332; last cache update = 1678722180.332;
18:44:35 <..> pve/cpu/percent-steal; value time = 1678722270.332; last cache update = 1678722275.332;
18:44:50 <..> pve/memory/memory-used; value time = 1678722285.332; last cache update = 1678722290.332;
18:44:50 <..> pve/memory/memory-free; value time = 1678722285.332; last cache update = 1678722290.332;
18:46:25 <..> pve/memory/memory-slab_unrecl; value time = 1678722380.332; last cache update = 1678722385.332;
18:47:10 <..> pve/cpu/count; value time = 1678722425.332; last cache update = 1678722430.332;
18:47:15 <..> pve/cpufreq-1/cpufreq; value time = 1678722430.374; last cache update = 1678722435.374;
18:49:05 <..> pve/memory/memory-used; value time = 1678722540.332; last cache update = 1678722545.332;
18:50:40 <..> pve/memory/memory-buffered; value time = 1678722635.332; last cache update = 1678722640.332;
18:54:45 <..> pve/memory/memory-slab_recl; value time = 1678722875.332; last cache update = 1678722880.332;
19:01:05 <..> pve/cpufreq-1/cpufreq; value time = 1678723255.374; last cache update = 1678723260.374;
19:07:10 <..> pve/cpu/percent-softirq; value time = 1678723625.332; last cache update = 1678723630.332;
19:08:00 <..> pve/cpu/percent-user; value time = 1678723675.332; last cache update = 1678723680.332;
19:08:20 <..> pve/memory/memory-slab_recl; value time = 1678723695.332; last cache update = 1678723700.332;
19:08:20 <..> pve/memory/memory-cached; value time = 1678723695.332; last cache update = 1678723700.332;
19:14:00 <..> pve/uptime/uptime; value time = 1678724030.335; last cache update = 1678724035.335;
19:14:00 <..> pve/cpufreq-0/cpufreq; value time = 1678724030.354; last cache update = 1678724035.354;
19:15:50 <..> pve/uptime/uptime; value time = 1678724140.335; last cache update = 1678724145.335;
19:15:50 <..> pve/sensors-coretemp-isa-0000/temperature-temp1; value time = 1678724140.333; last cache update = 1678724145.333;
19:16:55 <..> pve/cpufreq-2/cpufreq; value time = 1678724205.394; last cache update = 1678724210.394;
19:20:05 <..> pve/cpu/percent-wait; value time = 1678724400.332; last cache update = 1678724405.332;
19:25:20 <..> pve/uptime/uptime; value time = 1678724710.335; last cache update = 1678724715.335;
19:25:20 <..> pve/cpufreq-1/cpufreq; value time = 1678724710.374; last cache update = 1678724715.374;
19:28:30 <..> pve/uptime/uptime; value time = 1678724900.335; last cache update = 1678724905.335;
19:28:30 <..> pve/cpufreq-0/cpufreq; value time = 1678724900.354; last cache update = 1678724905.354;
19:30:05 <..> pve/uptime/uptime; value time = 1678724995.335; last cache update = 1678725000.335;
19:30:55 <..> pve/cpufreq-2/cpufreq; value time = 1678725045.394; last cache update = 1678725050.394;
19:31:30 <..> pve/cpu/percent-wait; value time = 1678725085.332; last cache update = 1678725090.332;