Freez on every linux I try, what's the problem?

Hello Garuda users.

╭─cris@cris in ~ took 14ms
 ╰─λ garuda-inxi
System:
  Kernel: 6.2.12-x64v1-xanmod1-1 arch: x86_64 bits: 64 compiler: gcc v: 12.2.1
    parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-xanmod
    root=UUID=66de7033-9843-4959-9cb3-e6f49cb0d3c0 rw rootflags=subvol=@
    quiet loglevel=3 ibt=off
  Desktop: KDE Plasma v: 5.27.4 tk: Qt v: 5.15.9 wm: kwin_x11 vt: 1 dm: SDDM
    Distro: Garuda Linux base: Arch Linux
Machine:
  Type: Laptop System: HUAWEI product: KPL-W0X v: M1D
    serial: <superuser required>
  Mobo: HUAWEI model: KPL-W0X-PCB v: M1D serial: <superuser required>
    UEFI: HUAWEI v: 1.24 date: 04/11/2022
Battery:
  ID-1: BAT1 charge: 54.4 Wh (99.8%) condition: 54.5/56.3 Wh (96.8%)
    volts: 7.9 min: 7.6 model: DYNAPACK HB4593R1ECW type: Li-ion
    serial: <filter> status: not charging cycles: 113
CPU:
  Info: model: AMD Ryzen 5 2500U with Radeon Vega Mobile Gfx bits: 64
    type: MT MCP arch: Zen level: v3 note: check built: 2017-19 process: GF 14nm
    family: 0x17 (23) model-id: 0x11 (17) stepping: 0 microcode: 0x8101007
  Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
    L1: 384 KiB desc: d-4x32 KiB; i-4x64 KiB L2: 2 MiB desc: 4x512 KiB L3: 4 MiB
    desc: 1x4 MiB
  Speed (MHz): avg: 1542 high: 1600 min/max: 1600/2000 boost: enabled
    scaling: driver: acpi-cpufreq governor: powersave cores: 1: 1600 2: 1600
    3: 1370 4: 1600 5: 1600 6: 1600 7: 1600 8: 1369 bogomips: 31939
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm
  Vulnerabilities: <filter>
Graphics:
  Device-1: AMD Raven Ridge [Radeon Vega Series / Radeon Mobile Series]
    vendor: Huawei driver: amdgpu v: kernel arch: GCN-5 code: Vega
    process: GF 14nm built: 2017-20 pcie: gen: 3 speed: 8 GT/s lanes: 16
    ports: active: eDP-1 empty: DP-1,DP-2,HDMI-A-1 bus-ID: 02:00.0
    chip-ID: 1002:15dd class-ID: 0300 temp: 40.0 C
  Device-2: Quanta hm1091_techfront type: USB driver: uvcvideo bus-ID: 3-1:2
    chip-ID: 0408:1020 class-ID: 0e02
  Display: x11 server: X.org v: 1.21.1.8 compositor: kwin_x11 driver: X:
    loaded: amdgpu unloaded: modesetting alternate: fbdev,vesa dri: radeonsi
    gpu: amdgpu display-ID: :0 note: <missing: xdpyinfo/xrandr>
  Monitor-1: eDP-1 model: ChiMei InnoLux 0x14d4 built: 2016 res: 1920x1080
    dpi: 158 gamma: 1.2 size: 309x173mm (12.17x6.81") diag: 354mm (13.9")
    ratio: 16:9 modes: max: 1920x1080 min: 640x480
  API: OpenGL Message: Unable to show GL data. Required tool glxinfo
    missing.
Audio:
  Device-1: AMD Raven/Raven2/Fenghuang HDMI/DP Audio vendor: Huawei
    driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
    bus-ID: 02:00.1 chip-ID: 1002:15de class-ID: 0403
  Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor vendor: Huawei
    driver: snd_pci_acp3x v: kernel alternate: snd_rn_pci_acp3x, snd_pci_acp5x,
    snd_pci_acp6x, snd_acp_pci, snd_rpl_pci_acp6x, snd_pci_ps,
    snd_sof_amd_renoir, snd_sof_amd_rembrandt pcie: gen: 3 speed: 8 GT/s
    lanes: 16 bus-ID: 02:00.5 chip-ID: 1022:15e2 class-ID: 0480
  Device-3: AMD Family 17h/19h HD Audio vendor: Huawei driver: snd_hda_intel
    v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16 bus-ID: 02:00.6
    chip-ID: 1022:15e3 class-ID: 0403
  API: ALSA v: k6.2.12-x64v1-xanmod1-1 status: kernel-api tools: N/A
  Server-1: PipeWire v: 0.3.69 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    4: pw-jack type: plugin tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: Intel Wireless 8265 / 8275 driver: iwlwifi v: kernel pcie: gen: 1
    speed: 2.5 GT/s lanes: 1 bus-ID: 01:00.0 chip-ID: 8086:24fd class-ID: 0280
  IF: wlp1s0 state: up mac: <filter>
Bluetooth:
  Device-1: Intel Bluetooth wireless interface type: USB driver: btusb v: 0.8
    bus-ID: 1-2:2 chip-ID: 8087:0a2b class-ID: e001
  Report: bt-adapter ID: hci0 rfk-id: 0 state: down
    bt-service: enabled,running rfk-block: hardware: no software: yes
    address: <filter>
Drives:
  Local Storage: total: 238.47 GiB used: 21.24 GiB (8.9%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/sda maj-min: 8:0 vendor: LITE-ON model: CV8-8E256
    size: 238.47 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    type: SSD serial: <filter> rev: 402 scheme: GPT
Partition:
  ID-1: / raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 21.24 GiB (8.9%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 576 KiB (0.2%) fs: vfat dev: /dev/sda1 maj-min: 8:1
  ID-3: /home raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 21.24 GiB (8.9%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-4: /var/log raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 21.24 GiB (8.9%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-5: /var/tmp raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 21.24 GiB (8.9%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
Swap:
  Kernel: swappiness: 133 (default 60) cache-pressure: 50 (default 100)
  ID-1: swap-1 type: zram size: 6.67 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 40.5 C mobo: N/A gpu: amdgpu temp: 40.0 C
  Fan Speeds (RPM): N/A
Info:
  Processes: 243 Uptime: 8m wakeups: 1 Memory: 6.67 GiB used: 1.69 GiB (25.3%)
  Init: systemd v: 253 default: graphical tool: systemctl Compilers: N/A
  Packages: pm: pacman pkgs: 1058 libs: 301 tools: paru Shell: Bash v: 5.1.16
  running-in: konsole inxi: 3.3.26
Garuda (2.6.16-1):
  System install date:     2023-04-23
  Last full system update: 2023-04-23
  Is partially upgraded:   No
  Relevant software:       snapper NetworkManager dracut
  Windows dual boot:       No/Undetected
  Failed units:            systemd-oomd.socket 

I have 3 or more Freez every day with zen kernel or normal arch kernel, I have tried different linux (arch, debian, fedora, opensuse, mint), different DE (kde, cinnamon,gnome), different kernels (old LTS, current LTS, 6.2, 6.3RC, zen, arch base kernel) but is still teh same, 3 or more freez a day. Now with Xanmod after 1 week that was good enought there was a first freez, so with Xanmod issue seems mitigated but not solved.
Freez happens 90% when watching streaming on the web, but 10% happens when made some basic stuff as open a file or syncing files, and when it happens no touchpad, no keyboard workin, I can only restart using power button.
I think Linux= kernel + litlle other things so I think is kernel related, my huawei matebook 14 AMD with a Ryzen Zen+ Cpu is not 100% linux ready.
What's is the problem? why my linux not been stable enought?
Thanks
This is the log after restart

╭─cris@cris in ~ took 14ms
 ╰─λ journalctl -b -p3 --no-hostname --no-pager
apr 23 12:52:43 kernel: AMD-Vi: [Firmware Bug]: : IOAPIC[4] not in IVRS table
apr 23 12:52:43 kernel: AMD-Vi: [Firmware Bug]: : IOAPIC[5] not in IVRS table
apr 23 12:52:43 kernel: AMD-Vi: [Firmware Bug]: : No southbridge IOAPIC found
apr 23 12:52:43 kernel: AMD-Vi: Disabling interrupt remapping
apr 23 12:52:43 kernel: tpm_crb MSFT0101:00: can't request region for resource [mem 0x8f7a2000-0x8f7a5fff]
apr 23 12:52:46 kernel: kfd kfd: amdgpu: Failed to resume IOMMU for device 1002:15dd
apr 23 12:52:46 kernel: kfd kfd: amdgpu: device 1002:15dd NOT added due to errors
apr 23 12:52:47 kernel: Bluetooth: hci0: Malformed MSFT vendor event: 0x02
apr 23 12:52:48 bluetoothd[612]: src/plugin.c:plugin_init() Failed to init vcp plugin
apr 23 12:52:48 bluetoothd[612]: src/plugin.c:plugin_init() Failed to init mcp plugin
apr 23 12:52:48 bluetoothd[612]: src/plugin.c:plugin_init() Failed to init bap plugin
apr 23 12:52:48 bluetoothd[612]: Failed to set mode: Failed (0x03)
apr 23 12:52:53 nmbd[1206]: [2023/04/23 12:52:53.028600,  0] ../../source3/nmbd/nmbd.c:901(main)
apr 23 12:52:53 nmbd[1206]:   nmbd version 4.18.2 started.
apr 23 12:52:53 nmbd[1206]:   Copyright Andrew Tridgell and the Samba Team 1992-2023
apr 23 12:52:53 nmbd[1206]: [2023/04/23 12:52:53.030905,  0] ../../lib/util/become_daemon.c:150(daemon_status)
apr 23 12:52:53 nmbd[1206]:   daemon_status: daemon 'nmbd' : No local IPv4 non-loopback interfaces available, waiting for interface ...
apr 23 12:52:53 nmbd[1206]: [2023/04/23 12:52:53.030971,  0] ../../source3/nmbd/nmbd_subnetdb.c:252(create_subnets)
apr 23 12:52:53 nmbd[1206]:   NOTE: NetBIOS name resolution is not supported for Internet Protocol Version 6 (IPv6).
apr 23 12:52:55 systemd[1210]: Failed to start Profile-sync-daemon.
apr 23 12:52:58 smbd[1478]: [2023/04/23 12:52:58.702524,  0] ../../source3/smbd/server.c:1746(main)
apr 23 12:52:58 smbd[1478]:   smbd version 4.18.2 started.
apr 23 12:52:58 smbd[1478]:   Copyright Andrew Tridgell and the Samba Team 1992-2023
apr 23 12:52:59 bluetoothd[612]: Failed to set mode: Failed (0x03)
apr 23 12:53:31 nmbd[1206]: [2023/04/23 12:53:31.576275,  0] ../../source3/nmbd/nmbd_become_lmb.c:398(become_local_master_stage2)
apr 23 12:53:31 nmbd[1206]:   *****
apr 23 12:53:31 nmbd[1206]: 
apr 23 12:53:31 nmbd[1206]:   Samba name server CRIS-HUAWEI is now a local master browser for workgroup WORKGROUP on subnet 192.168.1.2
apr 23 12:53:31 nmbd[1206]: 
apr 23 12:53:31 nmbd[1206]:   *****
apr 23 12:56:09 konsole[2348]: kf.xmlgui: Shortcut for action  "" "Mostra comandi rapidi" set with QAction::setShortcut()! Use KActionCollection::setDefaultShortcut(s) instead.
apr 23 12:56:09 konsole[2348]: kf.xmlgui: Shortcut for action  "" "Mostra il gestore SSH" set with QAction::setShortcut()! Use KActionCollection::setDefaultShortcut(s) instead.
apr 23 12:56:09 sudo[2423]:     cris : a password is required ; TTY=pts/0 ; PWD=/home/cris ; USER=root ; COMMAND=/usr/bin/true


It sounds like a hardware issue, but there is not enough information to say for sure.

Make sure your BIOS is up to date, and test your RAM with Memtest86+ or similar. If you must go deeper:

3 Likes

thanks, the problem is that my BIOS is updated but last version by Huawei was in 2020 so is abandoned hardware.
Logs doen't helps or I was not able to find issue into the log.
I will try some Grub parameter to see if can help with my zen+ SoC that not support AMD p-state (ready for Zen2 or new SoC)
If I have only 1 freez a week using Xanmod kernel will be good and stable enought, or I will try
GRUB_CMDLINE_LINUX="vga=current ivrs_ioapic[4]=00:14.0 ivrs_ioapic[5]=00:00.2 iommu=pt idle=nomwait acpi_backlight=vendor acpi_enforce_resources=lax scsi_mod.use_blk_mq=1" as suggested for my Matebook on the web
thanks

What Bluish said, especially running long memtest.

Also, Did you do any tuning via powertop? I've seen that hang systems sometimes if some elements are tuned and others not. One test to try is load into a kernel you know hangs badly, run sudo powertop --auto-tune and see if it make a difference. If the system crashes like normal you can rule out this line. Another would be to disable all power-saving tweaks (auto-cpu for example) if you have them enabled.

2 Likes

No powertop on my system and no powersaving or performance tweak enabled on Garuda.
When I was in dualboot there was no freez using windows so I think there aren't memory issue but I will check for hw issue.
Probably old bios not updated by vendor in addiction to zen+ SoC not well supported in kernel (new zen2-3-4 SoC are well supported instead) is the guilty
Thanks