Linux의 i7-3612QE 프로세서에서 RAM ECC 오류를 모니터링하는 방법은 무엇입니까?

Linux의 i7-3612QE 프로세서에서 RAM ECC 오류를 모니터링하는 방법은 무엇입니까?

다음 메모리 컨트롤러를 사용하여 산업용 단일 보드 컴퓨터에서 DRAM의 상태를 모니터링하고 싶습니다.

$ lspci -nn | grep DRAM
00:00.0 Host bridge [0600]: Intel Corporation 3rd Gen Core processor DRAM Controller [8086:0154] (rev 09)

나는 Debian 10을 실행해 보았습니다.

$ modprobe ie31200_edac
$ ras-mc-ctl --status
ras-mc-ctl: drivers are loaded.
$ ras-mc-ctl --layout
ras-mc-ctl: Error: No memories found at via edac.
$ ls /sys/devices/system/edac/mc/
power  subsystem  uevent

나는 메모리 레이아웃과 mc0하위 디렉토리의 유용한 출력을 기대하고 있습니다(이것은 동일한 공급업체의 4세대 단일 보드 컴퓨터에서 잘 작동합니다). 올바른 커널 모듈을 로드했습니까?


메모리 세부정보:

$ sudo dmidecode --type=memory
# dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 2.7 present.

Handle 0x0005, DMI type 5, 20 bytes
Memory Controller Information
        Error Detecting Method: 64-bit ECC
        Error Correcting Capabilities:
                Single-bit Error Correcting
        Supported Interleave: One-way Interleave
        Current Interleave: One-way Interleave
        Maximum Memory Module Size: 8192 MB
        Maximum Total Memory Size: 16384 MB
        Supported Speeds:
                Other
        Supported Memory Types:
                Other
        Memory Module Voltage: Unknown
        Associated Memory Slots: 2
                0x0006
                0x0007
        Enabled Error Correcting Capabilities:
                Single-bit Error Correcting

Handle 0x0006, DMI type 6, 12 bytes
Memory Module Information
        Socket Designation: DIMM0
        Bank Connections: None
        Current Speed: Unknown
        Type: DIMM
        Installed Size: 4096 MB (Single-bank Connection)
        Enabled Size: 4096 MB (Single-bank Connection)
        Error Status: OK

Handle 0x0007, DMI type 6, 12 bytes
Memory Module Information
        Socket Designation: DIMM1
        Bank Connections: None
        Current Speed: Unknown
        Type: DIMM
        Installed Size: 4096 MB (Single-bank Connection)
        Enabled Size: 4096 MB (Single-bank Connection)
        Error Status: OK

Handle 0x002A, DMI type 16, 23 bytes
Physical Memory Array
        Location: System Board Or Motherboard
        Use: System Memory
        Error Correction Type: None
        Maximum Capacity: 8 GB
        Error Information Handle: 0x002F
        Number Of Devices: 2

Handle 0x002B, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x002A
        Error Information Handle: 0x002D
        Total Width: Unknown
        Data Width: Unknown
        Size: 4096 MB
        Form Factor: DIMM
        Set: None
        Locator: DIMM0
        Bank Locator: BANK 0
        Type: <OUT OF SPEC>
        Type Detail: None
        Speed: Unknown
        Manufacturer: Not Specified
        Serial Number: Not Specified
        Asset Tag: Unknown
        Part Number: Not Specified
        Rank: Unknown
        Configured Memory Speed: 1600 MT/s

Handle 0x002C, DMI type 17, 34 bytes
Memory Device
        Array Handle: 0x002A
        Error Information Handle: 0x002E
        Total Width: Unknown
        Data Width: Unknown
        Size: 4096 MB
        Form Factor: DIMM
        Set: None
        Locator: DIMM1
        Bank Locator: BANK 2
        Type: <OUT OF SPEC>
        Type Detail: None
        Speed: Unknown
        Manufacturer: Not Specified
        Serial Number: Not Specified
        Asset Tag: Unknown
        Part Number: Not Specified
        Rank: Unknown
        Configured Memory Speed: 1600 MT/s

관련 정보