내 디스크에 오류가 발생했나요?

내 디스크에 오류가 발생했나요?

가끔 응답하지 않는 Debian Buster 시스템 문제를 해결하고 있습니다. 을 보면서 dmesg몇 가지 걱정스러운 메시지가 나타나는 것을 보았습니다.

[Wed Apr 19 19:39:47 2023] ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x4050000 action 0xe frozen                                   
[Wed Apr 19 19:39:47 2023] ata1.00: irq_stat 0x00000040, connection status changed                                                   
[Wed Apr 19 19:39:47 2023] ata1: SError: { PHYRdyChg CommWake DevExch }                                                              
[Wed Apr 19 19:39:47 2023] ata1.00: failed command: WRITE DMA EXT
[Wed Apr 19 19:39:47 2023] ata1.00: cmd 35/00:18:68:02:96/00:00:1d:00:00/e0 tag 19 dma 12288 out                                     
                                    res 50/00:00:00:00:00/00:00:00:00:00/a0 Emask 0x10 (ATA bus error)                               
[Wed Apr 19 19:39:47 2023] ata1.00: status: { DRDY }
[Wed Apr 19 19:39:47 2023] ata1: hard resetting link
[Wed Apr 19 19:39:48 2023] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 310)                                                    
[Wed Apr 19 19:39:48 2023] ata1.00: supports DRM functions and may not be fully accessible                                           
[Wed Apr 19 19:39:48 2023] ata1.00: supports DRM functions and may not be fully accessible                                           
[Wed Apr 19 19:39:48 2023] ata1.00: configured for UDMA/33
[Wed Apr 19 19:39:48 2023] ata1: EH complete
[Wed Apr 19 19:39:48 2023] ata1.00: Enabling discard_zeroes_data

이러한 메시지(반복적으로 나타남)는 SATA 링크를 몇 분마다 재설정해야 함을 나타냅니다.

확장된 SMART 테스트를 실행했는데 /dev/sda어떤 오류도 발견하지 못했습니다(전체 로그):

smartctl 6.6 2017-11-05 r4594 [x86_64-linux-5.9.0-0.bpo.5-amd64] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     Samsung SSD 860 PRO 512GB
Serial Number:    S5HTNE0N107136V
LU WWN Device Id: 5 002538 e2014235a
Firmware Version: RVM02B6Q
User Capacity:    512,110,190,592 bytes [512 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:    Thu Apr 20 08:10:54 2023 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Unavailable
ATA Security is:  Disabled, frozen [SEC2]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
[...]

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  5 Reallocated_Sector_Ct   PO--CK   100   100   010    -    0
  9 Power_On_Hours          -O--CK   096   096   000    -    16420
 12 Power_Cycle_Count       -O--CK   099   099   000    -    372
177 Wear_Leveling_Count     PO--C-   099   099   000    -    17
179 Used_Rsvd_Blk_Cnt_Tot   PO--C-   100   100   010    -    0
181 Program_Fail_Cnt_Total  -O--CK   100   100   010    -    0
182 Erase_Fail_Count_Total  -O--CK   100   100   010    -    0
183 Runtime_Bad_Block       PO--C-   100   100   010    -    0
187 Reported_Uncorrect      -O--CK   100   100   000    -    0
190 Airflow_Temperature_Cel -O--CK   079   045   000    -    21
195 Hardware_ECC_Recovered  -O-RC-   200   200   000    -    0
199 UDMA_CRC_Error_Count    -OSRCK   100   100   000    -    0
235 Unknown_Attribute       -O--C-   099   099   000    -    230
241 Total_LBAs_Written      -O--CK   099   099   000    -    9965553603

[...]

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     16411         -
# 2  Short offline       Completed without error       00%     16406         -
# 3  Short offline       Completed without error       00%     16405         -

파일 시스템 버그는 아닌 것 같은데, 그럼에도 불구하고 커널 명령줄 옵션을 시도해 보았는데 fsck.mode=force실제로 EFI 파티션 외에는 디스크를 검사하지 않는 것 같습니다.

이는 디스크 오류, 잘못된 연결 또는 파일 시스템 손상과 같은 일종의 오류 모드를 나타냅니까?

관련 정보