Periodic crashes/kernel panics when waking system

Hi,

I've been using Garuda Dragonized for close to a year on my laptop, and about a moth or two ago it started crashing on me. The crashes only happen after I wake the system from sleep after e.g. closing the lid or just waiting. They don't happen every time but about 33% percent of the time. I encounter three different behaviors after trying to wake the system:

  • The screen turns on, but the system is completely frozen
  • The screen remains black, but the CAPS-Key flashes (Kernel-Panic?)
  • The screen remains black, but the power button flashes

My keyboard backlight is still controllable in all cases, and it happens with and without power supply. The Laptops also seems to get noticeably warm/hot when one of the black-screen crashes happen (could be unrelated).
My first thought was that it could be related to the Garuda power saving tweaks, but changing those changed nothing.
After some research, I found out about journald and tested that. The last errors before a Kernel panic crash were:

Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:19 laptop dbus-daemon[591]: [system] Rejected send message, 0 matched rules; type="error", sender=":1.53" (uid=1000 pid=1729 comm="/usr/bin/wireplumber") interface="(unset)>
Jun 29 10:09:23 laptop nordvpnd[1012]: 2023/06/29 10:09:23 [Error] refreshing vpn unsetting killswitch: getting api_whitelist: rule with specified name does not exist
Jun 29 10:09:28 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 26134, resource id: 16980215, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 10:30:40 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 41392, resource id: 17002714, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 10:35:54 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 47482, resource id: 17008245, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 10:37:20 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 6963, resource id: 17009998, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 10:39:10 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 35388, resource id: 17012410, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 11:21:36 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 17626, resource id: 17037393, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 11:23:35 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 39804, resource id: 17039246, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 11:24:29 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 60193, resource id: 17041031, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 11:45:32 laptop kwin_x11[1562]: kwin_core: XCB error: 152 (BadDamage), sequence: 871, resource id: 17058101, major code: 143 (DAMAGE), minor code: 3 (Subtract)
Jun 29 12:07:31 laptop kscreenlocker_greet[141297]: Qt: Session management error: networkIdsList argument is NULL
-- Boot 73fcc06fe6f94eb395500930aa0bf0ac --

That led me to believe that it could be an X11 problem, so I switched to Wayland for a week, but that also changed nothing (and broke the Dragonized theme). Now, I can only think about KDE or Qt Errors, but I couldn't find any matching cases here or on arch wiki. A friend also suggested that it could be because I encrypted my SSD in the installation process and that somehow screws with the session-management. I also thought about rolling back my system to about two months ago to see if the problem disappears, but I hoped to find an actual solution. I am using a default Garuda Dragonized (updated at least once a week) on a Lenovo IdeaPad 5 15ARE05. Full specs:

System:
  Kernel: 6.3.9-zen1-1-zen arch: x86_64 bits: 64 compiler: gcc v: 13.1.1
    parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-zen root=UUID=078bc3f2-f436-4d24-89b4-58f030c3ad11
    rw rootflags=subvol=@ quiet
    cryptdevice=UUID=e3504fcb-cee9-4616-9bb2-850f4dbb88b7:luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7
    root=/dev/mapper/luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7 quiet splash rd.udev.log_priority=3
    vt.global_cursor_default=0 resume=/dev/mapper/luks-f7b84058-1844-4a26-be37-a2f1a18ad87a
    loglevel=3 ibt=off
  Desktop: KDE Plasma v: 5.27.6 tk: Qt v: 5.15.10 wm: kwin_x11 vt: 2 dm: SDDM
    Distro: Garuda Linux base: Arch Linux
Machine:
  Type: Laptop System: LENOVO product: 81YQ v: IdeaPad 5 15ARE05 serial: <superuser required>
    Chassis: type: 10 v: IdeaPad 5 15ARE05 serial: <superuser required>
  Mobo: LENOVO model: LNVNB161216 v: SDK0J40709WIN serial: <superuser required> UEFI: LENOVO
    v: E7CN45WW date: 04/28/2022
Battery:
  ID-1: BAT0 charge: 48.1 Wh (66.2%) condition: 72.7/70.0 Wh (103.8%) volts: 15.5 min: 15.1
    model: LGC L19L4PF1 type: Li-poly serial: <filter> status: discharging cycles: 135
CPU:
  Info: model: AMD Ryzen 7 4700U with Radeon Graphics bits: 64 type: MCP arch: Zen 2 gen: 3
    level: v3 note: check built: 2020-22 process: TSMC n7 (7nm) family: 0x17 (23) model-id: 0x60 (96)
    stepping: 1 microcode: 0x8600106
  Topology: cpus: 1x cores: 8 smt: <unsupported> cache: L1: 512 KiB desc: d-8x32 KiB; i-8x32 KiB
    L2: 4 MiB desc: 8x512 KiB L3: 8 MiB desc: 2x4 MiB
  Speed (MHz): avg: 1397 high: 1400 min/max: 1400/2000 boost: enabled scaling:
    driver: acpi-cpufreq governor: powersave cores: 1: 1397 2: 1400 3: 1397 4: 1397 5: 1398 6: 1397
    7: 1397 8: 1396 bogomips: 31940
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
  Vulnerabilities: <filter>
Graphics:
  Device-1: AMD Renoir vendor: Lenovo driver: amdgpu v: kernel arch: GCN-5 code: Vega
    process: GF 14nm built: 2017-20 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: eDP-1
    empty: DP-1,HDMI-A-1 bus-ID: 03:00.0 chip-ID: 1002:1636 class-ID: 0300 temp: 48.0 C
  Device-2: Syntek Integrated Camera driver: uvcvideo type: USB rev: 2.0 speed: 480 Mb/s lanes: 1
    mode: 2.0 bus-ID: 1-3:3 chip-ID: 174f:244c class-ID: 0e02 serial: <filter>
  Display: x11 server: X.Org v: 21.1.8 with: Xwayland v: 23.1.2 compositor: kwin_x11 driver: X:
    loaded: amdgpu unloaded: modesetting alternate: fbdev,vesa dri: radeonsi gpu: amdgpu
    display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.00x11.22") s-diag: 582mm (22.93")
  Monitor-1: eDP-1 mapped: eDP model: ChiMei InnoLux 0x151e built: 2019 res: 1920x1080 hz: 60
    dpi: 142 gamma: 1.2 size: 344x193mm (13.54x7.6") diag: 394mm (15.5") ratio: 16:9 modes: 1920x1080
  API: OpenGL v: 4.6 Mesa 23.1.3 renderer: AMD Radeon Graphics (renoir LLVM 15.0.7 DRM 3.52
    6.3.9-zen1-1-zen) direct-render: Yes
Audio:
  Device-1: AMD Renoir Radeon High Definition Audio vendor: Lenovo driver: snd_hda_intel v: kernel
    pcie: gen: 4 speed: 16 GT/s lanes: 16 bus-ID: 03:00.1 chip-ID: 1002:1637 class-ID: 0403
  Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor vendor: Lenovo driver: N/A
    alternate: snd_pci_acp3x, snd_rn_pci_acp3x, snd_pci_acp5x, snd_pci_acp6x, snd_acp_pci,
    snd_rpl_pci_acp6x, snd_pci_ps, snd_sof_amd_renoir, snd_sof_amd_rembrandt pcie: gen: 4
    speed: 16 GT/s lanes: 16 bus-ID: 03:00.5 chip-ID: 1022:15e2 class-ID: 0480
  Device-3: AMD Family 17h/19h HD Audio vendor: Lenovo driver: snd_hda_intel v: kernel pcie:
    gen: 4 speed: 16 GT/s lanes: 16 bus-ID: 03:00.6 chip-ID: 1022:15e3 class-ID: 0403
  API: ALSA v: k6.3.9-zen1-1-zen status: kernel-api with: aoss type: oss-emulator tools: N/A
  Server-1: PipeWire v: 0.3.72 status: active with: 1: pipewire-pulse status: active
    2: wireplumber status: active 3: pipewire-alsa type: plugin 4: pw-jack type: plugin
    tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: Intel Wi-Fi 6 AX200 driver: iwlwifi v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1
    bus-ID: 01:00.0 chip-ID: 8086:2723
 class-ID: 0280
  IF: wlp1s0 state: up mac: <filter>
  IF-ID-1: docker0 state: down mac: <filter>
Bluetooth:
  Device-1: Intel AX200 Bluetooth driver: btusb v: 0.8 type: USB rev: 2.0 speed: 12 Mb/s lanes: 1
    mode: 1.1 bus-ID: 3-3:2 chip-ID: 8087:0029 class-ID: e001
  Report: bt-adapter ID: hci0 rfk-id: 5 state: down bt-service: enabled,running rfk-block:
    hardware: no software: yes address: <filter>
Drives:
  Local Storage: total: 476.94 GiB used: 113.76 GiB (23.9%)
  SMART Message: Unable to run smartctl. Root privileges required.
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: SK Hynix model: HFM512GDHTNI-87A0B size: 476.94 GiB
    block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s lanes: 4 tech: SSD serial: <filter>
    fw-rev: 11010C00 temp: 38.9 C scheme: GPT
Partition:
  ID-1: / raw-size: 460.15 GiB size: 460.15 GiB (100.00%) used: 113.76 GiB (24.7%) fs: btrfs
    dev: /dev/dm-0 maj-min: 254:0 mapped: luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%) used: 720 KiB (0.2%) fs: vfat
    dev: /dev/nvme0n1p1 maj-min: 259:1
  ID-3: /home raw-size: 460.15 GiB size: 460.15 GiB (100.00%) used: 113.76 GiB (24.7%) fs: btrfs
    dev: /dev/dm-0 maj-min: 254:0 mapped: luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7
  ID-4: /var/log raw-size: 460.15 GiB size: 460.15 GiB (100.00%) used: 113.76 GiB (24.7%)
    fs: btrfs dev: /dev/dm-0 maj-min: 254:0 mapped: luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7
  ID-5: /var/tmp raw-size: 460.15 GiB size: 460.15 GiB (100.00%) used: 113.76 GiB (24.7%)
    fs: btrfs dev: /dev/dm-0 maj-min: 254:0 mapped: luks-e3504fcb-cee9-4616-9bb2-850f4dbb88b7
Swap:
  Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default)
  ID-1: swap-1 type: zram size: 14.99 GiB used: 4.4 MiB (0.0%) priority: 100 dev: /dev/zram0
  ID-2: swap-2 type: partition size: 16.49 GiB used: 0 KiB (0.0%) priority: -2 dev: /dev/dm-1
    maj-min: 254:1 mapped: luks-f7b84058-1844-4a26-be37-a2f1a18ad87a
Sensors:
  System Temperatures: cpu: 49.6 C mobo: N/A gpu: amdgpu temp: 48.0 C
  Fan Speeds (RPM): N/A
Info:
  Processes: 296 Uptime: 10h 9m wakeups: 3 Memory: available: 14.99 GiB used: 3.58 GiB (23.9%)
  Init: systemd v: 253 default: graphical tool: systemctl Compilers: gcc: 13.1.1 clang: 15.0.7
  Packages: pm: pacman pkgs: 1714 libs: 462 tools: octopi,paru Client: shell wrapper
  v: 5.1.16-release inxi: 3.3.27
Garuda (2.6.16-1):
  System install date:     2022-11-07
  Last full system update: 2023-07-01 ↻
  Is partially upgraded:   No
  Relevant software:       snapper NetworkManager mkinitcpio
  Windows dual boot:       Probably (Run as root to verify)
  Failed units:            pamac-cleancache.service 

Sorry for the long post, any help would be greatly appreciated.

It could be as simple as a kernel bug affecting your hardware. Test by running the LTS kernel for a while and see if the problem keeps happening.

sudo pacman -S linux-lts linux-lts-headers

Choose the LTS kernel from the advanced boot options in the Grub menu.

6 Likes

Will do, thanks!

As @BluishHumility already mentioned, kernel issues often are the cause of this type of behavior. Be sure to also test the mainline kernel as well if the LTS doesn’t help.

An outdated bios can also be responsible for this happening, be sure your bios is up to date.

The above errors could be linked to your wifi not coming out of suspend correctly. Be sure to test disabling your VPN before suspending. If that doesn’t help disable your network connection/wifi completely prior to suspending.

Good luck resolving your issue.

2 Likes

This seems to have solved my problem. My system was stable ever sice changing the kernel. I will also look into possible bios updates. Thank you so much guys!

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.