Так, нарушен 8-дневный цикл зависаний. Опять зависло
Jan 8 18:15:38 Main systemd[1]: snapd.refresh.timer: Adding 5h 7.570688s random time.
Jan 8 18:17:01 Main CRON[549]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 18:38:21 Main smartd[929]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 65 to 64
Jan 8 18:38:21 Main smartd[929]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 35 to 36
Jan 8 18:38:22 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 90 to 92
Jan 8 19:08:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 92 to 70
Jan 8 19:08:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 36 to 37
Jan 8 19:17:02 Main CRON[937]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 19:38:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 70 to 90
Jan 8 19:38:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 37 to 36
Jan 8 20:08:21 Main smartd[929]: Device: /dev/sda [SAT], SMART Usage Attribute: 190 Airflow_Temperature_Cel changed from 64 to 65
Jan 8 20:08:21 Main smartd[929]: Device: /dev/sda [SAT], SMART Usage Attribute: 194 Temperature_Celsius changed from 36 to 35
Jan 8 20:08:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 90 to 94
Jan 8 20:17:01 Main CRON[1221]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 20:24:38 Main systemd[1]: Starting Cleanup of Temporary Directories...
Jan 8 20:24:38 Main systemd-tmpfiles[1246]: [/usr/lib/tmpfiles.d/var.conf:14] Duplicate line for path "/var/log", ignoring.
Jan 8 20:24:38 Main systemd[1]: Started Cleanup of Temporary Directories.
Jan 8 20:28:45 Main kernel: [260429.703963] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03bab714
Jan 8 20:28:45 Main kernel: [260429.703966] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129277
Jan 8 20:28:45 Main kernel: [260429.703967] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703968] VM fault (0x14, vmid 4) at page 1217143, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703970] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03bab714
Jan 8 20:28:45 Main kernel: [260429.703971] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129278
Jan 8 20:28:45 Main kernel: [260429.703972] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703972] VM fault (0x14, vmid 4) at page 1217144, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703974] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03bab714
Jan 8 20:28:45 Main kernel: [260429.703975] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0012927B
Jan 8 20:28:45 Main kernel: [260429.703976] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703976] VM fault (0x14, vmid 4) at page 1217147, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703978] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03bab714
Jan 8 20:28:45 Main kernel: [260429.703979] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0012927C
Jan 8 20:28:45 Main kernel: [260429.703980] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703980] VM fault (0x14, vmid 4) at page 1217148, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703982] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03c2b714
Jan 8 20:28:45 Main kernel: [260429.703983] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0012927D
Jan 8 20:28:45 Main kernel: [260429.703983] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703984] VM fault (0x14, vmid 4) at page 1217149, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703986] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03c2b714
Jan 8 20:28:45 Main kernel: [260429.703987] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0012927F
Jan 8 20:28:45 Main kernel: [260429.703987] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703988] VM fault (0x14, vmid 4) at page 1217151, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703990] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03c2b714
Jan 8 20:28:45 Main kernel: [260429.703991] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129281
Jan 8 20:28:45 Main kernel: [260429.703991] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703992] VM fault (0x14, vmid 4) at page 1217153, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.703994] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03c2b714
Jan 8 20:28:45 Main kernel: [260429.703995] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129281
Jan 8 20:28:45 Main kernel: [260429.703995] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.703996] VM fault (0x14, vmid 4) at page 1217153, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.704001] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03cab714
Jan 8 20:28:45 Main kernel: [260429.704002] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00000000
Jan 8 20:28:45 Main kernel: [260429.704002] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.704003] VM fault (0x14, vmid 4) at page 0, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.704005] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03cab714
Jan 8 20:28:45 Main kernel: [260429.704006] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129282
Jan 8 20:28:45 Main kernel: [260429.704006] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.704007] VM fault (0x14, vmid 4) at page 1217154, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.704009] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03cab714
Jan 8 20:28:45 Main kernel: [260429.704010] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129278
Jan 8 20:28:45 Main kernel: [260429.704010] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.704011] VM fault (0x14, vmid 4) at page 1217144, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.704013] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03d2b714
Jan 8 20:28:45 Main kernel: [260429.704014] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x00129280
Jan 8 20:28:45 Main kernel: [260429.704014] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x090B7014
Jan 8 20:28:45 Main kernel: [260429.704015] VM fault (0x14, vmid 4) at page 1217152, write from 'SDM0' (0x53444d30) (183)
Jan 8 20:28:45 Main kernel: [260429.704317] amdgpu 0000:01:00.0: GPU fault detected: 146 0x03f0c40c
Jan 8 20:28:45 Main kernel: [260429.704319] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_ADDR 0x0012927E
Jan 8 20:28:45 Main kernel: [260429.704319] amdgpu 0000:01:00.0: VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E0C400C
Jan 8 20:28:45 Main kernel: [260429.704320] VM fault (0x0c, vmid 7) at page 1217150, read from 'TC4' (0x54433400) (196)
Jan 8 20:37:28 Main systemd[1]: Starting Daily apt activities...
Jan 8 20:37:28 Main systemd[1]: Started Daily apt activities.
Jan 8 20:37:28 Main systemd[1]: apt-daily.timer: Adding 5h 22min 21.622996s random time.
Jan 8 20:37:28 Main systemd[1]: apt-daily.timer: Adding 9h 54min 27.804107s random time.
Jan 8 20:38:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 94 to 82
Jan 8 21:08:22 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 82 to 90
Jan 8 21:08:28 Main evolution-sourc[2489]: secret_service_search_sync: must specify at least one attribute to match
Jan 8 21:17:01 Main CRON[1542]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 21:38:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 90 to 94
Jan 8 22:08:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 94 to 84
Jan 8 22:17:01 Main CRON[1891]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 22:38:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 84 to 90
Jan 8 23:08:21 Main smartd[929]: Device: /dev/sdb [SAT], SMART Prefailure Attribute: 1 Raw_Read_Error_Rate changed from 90 to 94
Jan 8 23:17:01 Main CRON[2200]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)
Jan 8 23:23:01 Main kernel: [270886.259789] [drm:amdgpu_atombios_dp_link_train [amdgpu]] *ERROR* clock recovery reached max voltage
Jan 8 23:23:01 Main kernel: [270886.259798] [drm:amdgpu_atombios_dp_link_train [amdgpu]] *ERROR* clock recovery failed
Jan 8 23:23:01 Main kernel: [270886.267590] [drm:amdgpu_atombios_dp_link_train [amdgpu]] *ERROR* clock recovery reached max voltage
Jan 8 23:23:01 Main kernel: [270886.267598] [drm:amdgpu_atombios_dp_link_train [amdgpu]] *ERROR* clock recovery failed
Блин да что же это за х такая
p.s. Нет это не TRIM. Надо последовательно разобраться с amdgpu, xorg, dri и тп.
Для начала, как можно протестировать видеокарту? Надо получить максимум нужной инфы по ней чтобы посмотреть с чем имеем дело.