APEI 일반 하드웨어 오류 소스의 하드웨어 오류: 5

APEI 일반 하드웨어 오류 소스의 하드웨어 오류: 5

이 문제는 Dell XR12에서 발생합니다. 나는인텔 ACC100(PDF 다운로드)액셀러레이터 카드. 하지만 나는 이 오류를 이해하지 못한다. 여기서 무슨 일이 일어나고 있는 걸까요? 도움을 주시면 감사하겠습니다!

[Thu Sep  7 08:43:27 2023] loop10: detected capacity change from 0 to 8
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 5
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]: event severity: recoverable
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:  Error 0, type: fatal
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   section_type: PCIe error
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   port_type: 4, root port
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   version: 3.0
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   command: 0x0547, status: 0x4010
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   device_id: 0000:50:02.0
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   slot: 2
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   secondary_bus: 0x51
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   vendor_id: 0x8086, device_id: 0x347a
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   class_code: 060400
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   bridge: secondary_status: 0x2000, control: 0x0003
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   aer_uncor_status: 0x00000020, aer_uncor_mask: 0x01310000
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   aer_uncor_severity: 0x044ef030
[Thu Sep  7 08:44:23 2023] {1}[Hardware Error]:   TLP Header: ffffffff ffffffff ffffffff ffffffff
[Thu Sep  7 08:44:23 2023] pcieport 0000:50:02.0: AER: aer_status: 0x00000020, aer_mask: 0x01310000
[Thu Sep  7 08:44:23 2023] pcieport 0000:50:02.0:    [ 5] SDES                   (First)
[Thu Sep  7 08:44:23 2023] pcieport 0000:50:02.0: AER: aer_layer=Transaction Layer, aer_agent=Receiver ID
[Thu Sep  7 08:44:23 2023] pcieport 0000:50:02.0: AER: aer_uncor_severity: 0x044ef030
[Thu Sep  7 08:44:24 2023] pcieport 0000:50:02.0: AER: Root Port link has been reset (0)
[Thu Sep  7 08:44:24 2023] pcieport 0000:50:02.0: AER: device recovery successful
[Thu Sep  7 08:44:24 2023] vfio-pci 0000:51:00.0: vfio_ecap_init: hiding ecap 0x19@0x248
[Thu Sep  7 08:44:37 2023] pci 0000:52:00.0: [8086:0d5d] type 00 class 0x120001
[Thu Sep  7 08:44:37 2023] pci 0000:52:00.0: Adding to iommu group 96
[Thu Sep  7 08:44:37 2023] vfio-pci 0000:51:00.0: Captured SR-IOV VF 0000:52:00.0 driver_override
[Thu Sep  7 08:44:37 2023] pci 0000:52:00.1: [8086:0d5d] type 00 class 0x120001
[Thu Sep  7 08:44:37 2023] pci 0000:52:00.1: Adding to iommu group 97
[Thu Sep  7 08:44:37 2023] vfio-pci 0000:51:00.0: Captured SR-IOV VF 0000:52:00.1 driver_override
[Thu Sep  7 08:44:50 2023] vfio-pci 0000:52:00.0: enabling device (0000 -> 0002)

관련 정보