I'm having a problem that causes my display to go to sleep and all keyboard inputs to cease function. Audio will still play but only for 30 seconds to a minute. It usually happens seemingly randomly while I'm running a game, I've noticed it happening most with the game 'Timberborn'(not a very graphically intense game at first glance) but that is probably only because it is what I am often playing. I'm fully willing to admit this is a hardware problem likely with my power supply or simply overall dust buildup but some second opinions would be nice before I start to tear my machine apart.
I would update your system and BIOS. There was a recent mesa update that fixed its prior version's issues for some AMD GPUs, so I would do a garuda-update.
As for the BIOS, it looks like there are some critical versions released. I would update to at least version F62 as that has fixes for "major vulnerabilities" and the motherboard site states "customers are strongly encouraged to update to this release...". I've seen outdated BIOS cause all kinds of odd issues when running games, so there is a good chance this could be your issue.
Random could mean that there is a heat problem.
Start the games from the terminal for error messages, open a second one and observe with btop what else could be the problem.
Check also
I assumed it was a heat problem but it's steadily getting worse even after turning my fan speeds up. It happens while just browsing the web sometimes now. My power supply is kind of ancient so that could be the issue.
Jun 03 00:46:32 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=28231, emitted seq=28232
Jun 03 00:46:32 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu: GPU reset begin!
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:32 Kevin kernel: amdgpu 0000:06:00.0: amdgpu:
Jun 03 00:46:49 Kevin kernel: GpuWatchdog[42283]: segfault at 0 ip 00007f65e4f92336 sp 00007f65d9ffd4f0 error 6 in libcef.so[7f65e0aef000+776f000] likely on CPU 5 (core 1, socket 0)
Jun 03 00:46:52 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 03 00:46:52 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing DA4C (len 824, WS 0, PS 0) @ 0xDBCC
Jun 03 00:46:52 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing D906 (len 326, WS 0, PS 0) @ 0xD9F6
Jun 03 00:46:52 Kevin kernel: [drm:dce110_link_encoder_disable_output [amdgpu]] *ERROR* dce110_link_encoder_disable_output: Failed to execute VBIOS command table!
Jun 03 00:47:12 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 03 00:47:12 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing C4AC (len 62, WS 0, PS 0) @ 0xC4C8
Jun 03 00:50:43 Kevin kernel: RAS: Correctable Errors collector initialized.
Maybe you could try changing kernel.
I'd start from linux-lts.
I also noticed that your system seems to be in a partial upgrade status. This is not necessarily a cause of your issue, but should be addressed.
If you are not holding any package back on purpose, a
I have tried searching for your errors online and found
This seems like a hung process and then gpu reset began after that though the the unusual thing is it reports pid 0 at fault.
This segfault most likely means a kernel bug since error 6 means the cause of segfault was a user-mode write resulting in no page being found.
according to this tool: Raphael's blog: Segmentation fault error decoder
This last part seems to be caused by runtime pm support which can be fixed by adding radeon.runpm=0 to the kernel parameters in garuda boot options
Overall I mean to say try to switch kernels as filo says which should resolve most if not all the issues that you see in journal log and if just changing kernel doesn’t help and the atombios error is still there in your logs after switching kernels try adding the mentioned kernel parameters
Using lts and adding the kernel parameters didn't work. I've got either the same or nearly the same error
Jun 07 22:50:02 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=2630, emitted seq=2632
Jun 07 22:50:02 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
Jun 07 22:50:22 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 07 22:50:22 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing DA4C (len 824, WS 0, PS 0) @ 0xDBCC
Jun 07 22:50:22 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing D906 (len 326, WS 0, PS 0) @ 0xD9F6
Jun 07 22:50:22 Kevin kernel: [drm:dce110_link_encoder_disable_output [amdgpu]] *ERROR* dce110_link_encoder_disable_output: Failed to execute VBIOS command table!
Jun 07 22:50:42 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 07 22:50:42 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing C4AC (len 62, WS 0, PS 0) @ 0xC4C8
Jun 07 22:51:02 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 07 22:51:02 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing B3B2 (len 1227, WS 8, PS 8) @ 0xB63A
Jun 07 22:51:02 Kevin kernel: [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* suspend of IP block <vce_v3_0> failed -110
Jun 07 22:51:03 Kevin kernel: [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* suspend of IP block <powerplay> failed -22
Jun 07 22:51:03 Kevin kernel: amdgpu 0000:06:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Jun 07 22:51:03 Kevin kernel: [drm:gfx_v8_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
Jun 07 22:52:15 Kevin kernel: [drm:atom_op_jump [amdgpu]] *ERROR* atombios stuck in loop for more than 20secs aborting
Jun 07 22:52:15 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing C4AC (len 62, WS 0, PS 0) @ 0xC4C8
Jun 07 22:52:15 Kevin kernel: [drm:amdgpu_atom_execute_table_locked [amdgpu]] *ERROR* atombios stuck executing AD42 (len 126, WS 0, PS 8) @ 0xAD5D
Jun 07 22:52:15 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -22
Jun 07 22:52:25 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=2962, emitted seq=2964
Jun 07 22:52:25 Kevin kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0