MSI GE76 Raider: nvidia card "no go"

Hi guys,
I'm a happy user of Garuda in KDE and Gnome flavors. My colleagues at work even switched to It since they saw my laptop.

I have this MSI GE76 Raider 11UH, i9-11980HK and 65GB RAM and I installed Garuda in the KDE Dragonized flavor some months ago. It was great but I stopped using this laptop so much. I had tried the Qtile flavor and I have to say I loved It so a couple of days ago I took a change to wipe the KDE installation and install Garuda Qtile from scratch but I'm having very big issues.

First thing first, this is the output of my garuda-inxi:

System:
  Kernel: 5.18.11-zen1-1-zen arch: x86_64 bits: 64 compiler: gcc v: 12.1.0
    parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-zen
    root=UUID=cb4d784c-4a56-4654-a583-54169cf12d39 rw [email protected]
    quiet quiet splash rd.udev.log_priority=3 vt.global_cursor_default=0
    loglevel=3
  Desktop: Qtile v: 0.21.0 wm: LG3D vt: 1 dm: SDDM Distro: Garuda Linux
    base: Arch Linux
Machine:
  Type: Laptop System: Micro-Star product: GE76 Raider 11UH v: REV:1.0
    serial: <superuser required> Chassis: type: 10 serial: <superuser required>
  Mobo: Micro-Star model: MS-17K3 v: REV:1.0 serial: <superuser required>
    UEFI: American Megatrends LLC. v: E17K3IMS.11B date: 09/15/2021
Battery:
  ID-1: BAT1 charge: 55.3 Wh (92.3%) condition: 59.9/95.0 Wh (63.0%)
    volts: 16.2 min: 15.2 model: MSI BIF0_9 type: Li-ion serial: N/A
    status: charging
CPU:
  Info: model: 11th Gen Intel Core i9-11980HK bits: 64 type: MT MCP
    arch: Tiger Lake gen: core 11 built: 2020 process: Intel 10nm family: 6
    model-id: 0x8D (141) stepping: 1 microcode: 0x3E
  Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
    L1: 640 KiB desc: d-8x48 KiB; i-8x32 KiB L2: 10 MiB desc: 8x1.2 MiB
    L3: 24 MiB desc: 1x24 MiB
  Speed (MHz): avg: 1149 high: 4400 min/max: 800/5000:4900 scaling:
    driver: intel_pstate governor: powersave cores: 1: 801 2: 800 3: 800 4: 801
    5: 801 6: 2776 7: 801 8: 801 9: 800 10: 801 11: 801 12: 801 13: 801
    14: 801 15: 801 16: 4400 bogomips: 105676
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
  Vulnerabilities:
  Type: itlb_multihit status: Not affected
  Type: l1tf status: Not affected
  Type: mds status: Not affected
  Type: meltdown status: Not affected
  Type: mmio_stale_data status: Not affected
  Type: spec_store_bypass
    mitigation: Speculative Store Bypass disabled via prctl
  Type: spectre_v1
    mitigation: usercopy/swapgs barriers and __user pointer sanitization
  Type: spectre_v2
    mitigation: Enhanced IBRS, IBPB: conditional, RSB filling
  Type: srbds status: Not affected
  Type: tsx_async_abort status: Not affected
Graphics:
  Device-1: Intel TigerLake-H GT1 [UHD Graphics] vendor: Micro-Star MSI
    driver: i915 v: kernel arch: Gen12.1 process: Intel 10nm built: 2020-21
    ports: active: eDP-1 empty: DP-1,DP-2 bus-ID: 00:02.0 chip-ID: 8086:9a60
    class-ID: 0300
  Device-2: NVIDIA GA104M [GeForce RTX 3080 Mobile / Max-Q 8GB/16GB]
    vendor: Micro-Star MSI driver: nouveau v: kernel non-free: 515.xx+
    status: current (as of 2022-06) arch: Ampere process: TSMC n7 (7nm)
    built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports: active: none
    empty: DP-3, DP-4, HDMI-A-1, eDP-2 bus-ID: 01:00.0 chip-ID: 10de:249c
    class-ID: 0300
  Display: x11 server: X.Org v: 21.1.4 compositor: Picom v: git-c4107
    driver: X: loaded: modesetting,nouveau alternate: fbdev,intel,nv,vesa
    gpu: i915 display-ID: :0 screens: 1
  Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 507x285mm (19.96x11.22")
    s-diag: 582mm (22.9")
  Monitor-1: eDP-1 model: AU Optronics 0xa988 built: 2019 res: 1920x1080
    dpi: 128 gamma: 1.2 size: 381x214mm (15x8.43") diag: 437mm (17.2")
    ratio: 16:9 modes: 3840x2160
  Message: Unable to show GL data. Required tool glxinfo missing.
Audio:
  Device-1: Intel Tiger Lake-H HD Audio vendor: Micro-Star MSI
    driver: sof-audio-pci-intel-tgl
    alternate: snd_hda_intel,snd_sof_pci_intel_tgl bus-ID: 00:1f.3
    chip-ID: 8086:43c8 class-ID: 0401
  Device-2: NVIDIA GA104 High Definition Audio vendor: Micro-Star MSI
    driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
    bus-ID: 01:00.1 chip-ID: 10de:228b class-ID: 0403
  Sound Server-1: ALSA v: k5.18.11-zen1-1-zen running: yes
  Sound Server-2: PulseAudio v: 16.1 running: no
  Sound Server-3: PipeWire v: 0.3.55 running: yes
Network:
  Device-1: Realtek Killer E3000 2.5GbE vendor: Micro-Star MSI driver: r8169
    v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1 port: 4000 bus-ID: 2e:00.0
    chip-ID: 10ec:3000 class-ID: 0200
  IF: enp46s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
  Device-2: Intel Wi-Fi 6 AX210/AX211/AX411 160MHz vendor: Rivet Networks
    driver: iwlwifi v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1
    bus-ID: 30:00.0 chip-ID: 8086:2725 class-ID: 0280
  IF: wlp48s0 state: up mac: <filter>
Bluetooth:
  Device-1: Intel AX210 Bluetooth type: USB driver: btusb v: 0.8
    bus-ID: 3-14:6 chip-ID: 8087:0032 class-ID: e001
  Report: bt-adapter ID: hci0 rfk-id: 2 state: down
    bt-service: enabled,running rfk-block: hardware: no software: yes
    address: <filter>
Drives:
  Local Storage: total: 2.1 TiB used: 10.59 GiB (0.5%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Samsung
    model: MZVL22T0HBLB-00B00 size: 1.86 TiB block-size: physical: 512 B
    logical: 512 B speed: 63.2 Gb/s lanes: 4 type: SSD serial: <filter>
    rev: GXB7301Q temp: 46.9 C scheme: GPT
  ID-2: /dev/sda maj-min: 8:0 type: USB vendor: JMicron Tech model: N/A
    size: 238.47 GiB block-size: physical: 4096 B logical: 512 B type: N/A
    serial: <filter> rev: 0209 scheme: GPT
Partition:
  ID-1: / raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 10.59 GiB (4.4%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
    used: 608 KiB (0.2%) fs: vfat dev: /dev/sda1 maj-min: 8:1
  ID-3: /home raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 10.59 GiB (4.4%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-4: /var/log raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 10.59 GiB (4.4%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
  ID-5: /var/tmp raw-size: 238.17 GiB size: 238.17 GiB (100.00%)
    used: 10.59 GiB (4.4%) fs: btrfs dev: /dev/sda2 maj-min: 8:2
Swap:
  Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default)
  ID-1: swap-1 type: zram size: 62.51 GiB used: 0 KiB (0.0%) priority: 100
    dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 56.0 C mobo: N/A
  Fan Speeds (RPM): N/A
Info:
  Processes: 379 Uptime: 19m wakeups: 6 Memory: 62.51 GiB
  used: 2.03 GiB (3.2%) Init: systemd v: 251 default: graphical
  tool: systemctl Compilers: gcc: 12.1.0 Packages: pacman: 1172 lib: 302
  Shell: fish v: 3.4.1 default: Bash v: 5.1.16 running-in: alacritty
  inxi: 3.3.19
e[1;34mGaruda (2.6.5-1):e[0m
e[1;34m  System install date:e[0m     2022-07-15
e[1;34m  Last full system update:e[0m 2022-07-15
e[1;34m  Is partially upgraded:  e[0m No
e[1;34m  Relevant software:      e[0m NetworkManager
e[1;34m  Windows dual boot:      e[0m Probably (Run as root to verify)
e[1;34m  Snapshots:              e[0m Snapper
e[1;34m  Failed units:           e[0m 

I'm installing the system on an external 250 GB NVMe drive connected to the thunderbolt port with a proper cable.

First thing I did was booting with proprietary nVidia drivers and I got a bad error message similar to the one you'll see with the niveau driver. That error was blocking, the loading got stuck to "installing driver" and I had to give up.
Then I tried to boot using open source drivers and I actually managed to boot even though I got an error saying may times:

nouveau 0000:01:00.0: timer: stalled at ffffffffffffffff

It boots fine, I get to the login screen and when I login as 'garuda' it takes 2/3 minutes to enable me to use the keyboard (I can move the mouse on the desktop).
It runs kinda fine but sometimes when I open some application (like the terminal :D) it hans for about a minute or more and then resumes. I believe this is pretty visible in the logs.

Now, if you really need it I can enable the nvidia driver again and take a pic of the bad error but I'd have to most probably reinstall the entire system since no kernel flags I'v tried revert to booting with niveau. It's not a problem, I don't have almost anything and I can reinstall everything again using ansible, it's just a bit of a pain in the neck.

Here's the output of dmesg (Antonio would be me btw):

7e691deac0371145b012a4b682aab60868bda96d]
[  224.187721]  nouveau_mem_map+0xb3/0x100 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.187766]  nouveau_bo_move+0x735/0x9c0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.187810]  ttm_bo_handle_move_mem+0x89/0x190 [ttm c53c962fceff8c5feb19d77a789f75ba50209331]
[  224.187813]  ttm_mem_evict_first+0x2d1/0x5b0 [ttm c53c962fceff8c5feb19d77a789f75ba50209331]
[  224.187816]  ? nv50_display_fini+0xa9/0x110 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.187858]  ttm_resource_manager_evict_all+0xa7/0x1d0 [ttm c53c962fceff8c5feb19d77a789f75ba50209331]
[  224.187861]  nouveau_do_suspend+0x94/0x190 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.187904]  nouveau_pmops_runtime_suspend+0x3e/0xb0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.187947]  pci_pm_runtime_suspend+0x5d/0x180
[  224.187949]  ? pci_dev_put+0x20/0x20
[  224.187950]  ? pci_dev_put+0x20/0x20
[  224.187951]  __rpm_callback+0x41/0x160
[  224.187952]  ? pci_dev_put+0x20/0x20
[  224.187953]  rpm_suspend+0x43a/0x970
[  224.187954]  ? __schedule+0xb10/0x1300
[  224.187956]  pm_runtime_work+0x98/0xb0
[  224.187957]  process_one_work+0x252/0x410
[  224.187958]  worker_thread+0x55/0x4d0
[  224.187959]  ? process_one_work+0x410/0x410
[  224.187960]  kthread+0x13c/0x160
[  224.187961]  ? kthread_complete_and_exit+0x20/0x20
[  224.187962]  ret_from_fork+0x1f/0x30
[  224.187964]  </TASK>
[  224.187964] ---[ end trace 0000000000000000 ]---
[  224.190433] nouveau 0000:01:00.0: timer: stalled at ffffffffffffffff
[  224.190434] ------------[ cut here ]------------
[  224.190435] nouveau 0000:01:00.0: timeout
[  224.190444] WARNING: CPU: 2 PID: 275 at drivers/gpu/drm/nouveau/nvkm/subdev/bar/g84.c:35 g84_bar_flush+0xfa/0x110 [nouveau]
[  224.190478] Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm ccm cmac algif_hash algif_skcipher af_alg qrtr bnep snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_sof_probes snd_soc_hdac_hdmi vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core btusb snd_soc_acpi_intel_match snd_soc_acpi btrtl soundwire_bus btbcm btintel ledtrig_audio snd_soc_core btmtk snd_compress iwlmvm intel_tcc_cooling bluetooth ac97_bus x86_pkg_temp_thermal snd_hda_codec_hdmi snd_pcm_dmaengine iTCO_wdt intel_powerclamp snd_hda_intel coretemp hid_multitouch intel_pmc_bxt ecdh_generic mac80211 spi_nor libarc4 kvm_intel iTCO_vendor_support intel_rapl_msr crc16 snd_intel_dspcfg ee1004 mtd pmt_telemetry pmt_class snd_intel_sdw_acpi iwlwifi kvm
[  224.190517] CPU: 2 PID: 275 Comm: kworker/2:2 Tainted: G        W         5.18.11-zen1-1-zen #1 1acf6f24b6567b681b0a060b1dcfe38ff2419835
[  224.190518] Hardware name: Micro-Star International Co., Ltd. GE76 Raider 11UH/MS-17K3, BIOS E17K3IMS.11B 09/15/2021
[  224.190518] Workqueue: pm pm_runtime_work
[  224.190519] RIP: 0010:g84_bar_flush+0xfa/0x110 [nouveau]
[  224.190553] Code: 8b 40 10 48 8b 78 10 48 8b 5f 50 48 85 db 75 03 48 8b 1f e8 c8 d9 25 fc 48 89 da 48 c7 c7 97 4b 3e c0 48 89 c6 e8 df 34 70 fc <0f> 0b eb aa e8 4d bf 76 fc 66 66 2e 0f 1f 84 00 00 00 00 00 66 90
[  224.190554] RSP: 0018:ffffb30b8116b708 EFLAGS: 00010086
[  224.190555] RAX: 0000000000000000 RBX: ffff935402cf6090 RCX: 0000000000000027
[  224.190555] RDX: ffff93634b8a16a8 RSI: 0000000000000001 RDI: ffff93634b8a16a0
[  224.190556] RBP: ffff935401a4b318 R08: 0000000000000001 R09: 00000000ffffffea
[  224.190556] R10: ffffffffbda5aa20 R11: 0000000000000003 R12: 0000000000000246
[  224.190557] R13: ffffffffc0375ce0 R14: 0000000000020000 R15: ffff93540b6aae00
[  224.190557] FS:  0000000000000000(0000) GS:ffff93634b880000(0000) knlGS:0000000000000000
[  224.190558] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  224.190558] CR2: 00007eff84002018 CR3: 00000003ab410004 CR4: 0000000000f70ee0
[  224.190559] PKRU: 55555554
[  224.190559] Call Trace:
[  224.190560]  <TASK>
[  224.190560]  nv50_instobj_release+0x34/0xc0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.191087]  nouveau_pmops_runtime_suspend+0x3e/0xb0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.191130]  pci_pm_runtime_suspend+0x5d/0x180
[  224.191132]  ? pci_dev_put+0x20/0x20
[  224.191133]  ? pci_dev_put+0x20/0x20
[  224.191134]  __rpm_callback+0x41/0x160
[  224.191135]  ? pci_dev_put+0x20/0x20
[  224.191136]  rpm_suspend+0x43a/0x970
[  224.191137]  ? __schedule+0xb10/0x1300
[  224.191139]  pm_runtime_work+0x98/0xb0
[  224.191140]  process_one_work+0x252/0x410
[  224.191141]  worker_thread+0x55/0x4d0
[  224.191142]  ? process_one_work+0x410/0x410
[  224.191143]  kthread+0x13c/0x160
[  224.191144]  ? kthread_complete_and_exit+0x20/0x20
[  224.191145]  ret_from_fork+0x1f/0x30
[  224.191147]  </TASK>
[  224.191147] ---[ end trace 0000000000000000 ]---
[  224.193619] nouveau 0000:01:00.0: timer: stalled at ffffffffffffffff
[  224.193620] ------------[ cut here ]------------
[  224.193620] nouveau 0000:01:00.0: timeout
[  224.193629] WARNING: CPU: 2 PID: 275 at drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmtu102.c:43 tu102_vmm_flush+0x165/0x170 [nouveau]
[  224.193670] Modules linked in: snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm ccm cmac algif_hash algif_skcipher af_alg qrtr bnep snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_sof_probes snd_soc_hdac_hdmi vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core btusb snd_soc_acpi_intel_match snd_soc_acpi btrtl soundwire_bus btbcm btintel ledtrig_audio snd_soc_core btmtk snd_compress iwlmvm intel_tcc_cooling bluetooth ac97_bus x86_pkg_temp_thermal snd_hda_codec_hdmi snd_pcm_dmaengine iTCO_wdt intel_powerclamp snd_hda_intel coretemp hid_multitouch intel_pmc_bxt ecdh_generic mac80211 spi_nor libarc4 kvm_intel iTCO_vendor_support intel_rapl_msr crc16 snd_intel_dspcfg ee1004 mtd pmt_telemetry pmt_class snd_intel_sdw_acpi iwlwifi kvm
[  224.193684]  processor_thermal_device_pci_legacy r8169 snd_hda_codec processor_thermal_device iwlmei snd_hda_core processor_thermal_rfim irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd intel_cstate snd_hwdep cfg80211 intel_uncore psmouse gpio_keys wmi_bmof snd_pcm realtek processor_thermal_mbox intel_lpss_pci processor_thermal_rapl snd_timer i2c_i801 mdio_devres joydev spi_intel_pci msi_wmi rfkill intel_lpss mousedev snd spi_intel intel_rapl_common i2c_smbus sparse_keymap mei libphy thunderbolt idma64 intel_vsec soundcore intel_soc_dts_iosf tpm_crb i2c_hid_acpi tpm_tis i2c_hid tpm_tis_core int3403_thermal int340x_thermal_zone tpm acpi_pad int3400_thermal rng_core acpi_tad mac_hid acpi_thermal_rel soc_button_array uinput fuse crypto_user zram bpf_preload ip_tables x_tables hid_logitech_hidpp btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq hid_logitech_dj uas usb_storage usbhid rtsx_pci_sdmmc mmc_core nvme xhci_pci nvme_core rtsx_pci
[  224.193700]  xhci_pci_renesas serio_raw atkbd libps2 vivaldi_fmap i8042 serio radeon amdgpu gpu_sched intel_agp crc32c_intel i915 nouveau mxm_wmi wmi drm_buddy video drm_ttm_helper ttm drm_dp_helper intel_gtt
[  224.193704] CPU: 2 PID: 275 Comm: kworker/2:2 Tainted: G        W         5.18.11-zen1-1-zen #1 1acf6f24b6567b681b0a060b1dcfe38ff2419835
[  224.193705] Hardware name: Micro-Star International Co., Ltd. GE76 Raider 11UH/MS-17K3, BIOS E17K3IMS.11B 09/15/2021
[  224.193705] Workqueue: pm pm_runtime_work
[  224.193706] RIP: 0010:tu102_vmm_flush+0x165/0x170 [nouveau]
[  224.193745] Code: 8b 40 10 48 8b 78 10 48 8b 5f 50 48 85 db 75 03 48 8b 1f e8 1d 9f 1f fc 48 89 da 48 c7 c7 57 69 3e c0 48 89 c6 e8 34 fa 69 fc <0f> 0b eb a5 e8 a2 84 70 fc 66 90 f3 0f 1e fa 0f 1f 44 00 00 ff 74
[  224.193746] RSP: 0018:ffffb30b8116b740 EFLAGS: 00010282
[  224.193746] RAX: 0000000000000000 RBX: ffff935402cf6090 RCX: 0000000000000027
[  224.193747] RDX: ffff93634b8a16a8 RSI: 0000000000000001 RDI: ffff93634b8a16a0
[  224.193747] RBP: ffff93540b6aae00 R08: 0000000000000001 R09: 00000000ffffffea
[  224.193748] R10: ffffffffbda5aa20 R11: 0000000000000003 R12: 0000000002000001
[  224.193748] R13: ffffffffc0375ce0 R14: ffffb30b8116b880 R15: ffff93540b6aae00
[  224.193749] FS:  0000000000000000(0000) GS:ffff93634b880000(0000) knlGS:0000000000000000
[  224.193749] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  224.193750] CR2: 00007eff84002018 CR3: 00000003ab410004 CR4: 0000000000f70ee0
[  224.193750] PKRU: 55555554
[  224.193751] Call Trace:
[  224.193751]  <TASK>
[  224.193752]  ? gp100_vmm_pgt_sgl+0x170/0x170 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.193789]  nvkm_vmm_map+0x6e4/0xc60 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.193827]  ? gp100_vmm_pgt_sgl+0x170/0x170 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.194202]  nouveau_pmops_runtime_suspend+0x3e/0xb0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.194243]  pci_pm_runtime_suspend+0x5d/0x180
[  224.194245]  ? pci_dev_put+0x20/0x20
[  224.194246]  ? pci_dev_put+0x20/0x20
[  224.194247]  __rpm_callback+0x41/0x160
[  224.194248]  ? pci_dev_put+0x20/0x20
[  224.194249]  rpm_suspend+0x43a/0x970
[  224.194250]  ? __schedule+0xb10/0x1300
[  224.194252]  pm_runtime_work+0x98/0xb0
[  224.194253]  process_one_work+0x252/0x410
[  224.194254]  worker_thread+0x55/0x4d0
[  224.194254]  ? process_one_work+0x410/0x410
[  224.194255]  kthread+0x13c/0x160
[  224.194256]  ? kthread_complete_and_exit+0x20/0x20
[  224.194258]  ret_from_fork+0x1f/0x30
[  224.194259]  </TASK>
[  224.194259] ---[ end trace 0000000000000000 ]---

...

7e691deac0371145b012a4b682aab60868bda96d]
[  224.197019]  ? gp100_vmm_pd0_mem+0x1c0/0x1c0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197060]  nvkm_mem_map_dma+0x5a/0x80 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197100]  nvkm_uvmm_mthd+0x579/0x6b0 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197140]  nvkm_ioctl+0xd9/0x180 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197172]  nvif_object_mthd+0xcc/0x200 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197204]  nvif_vmm_map+0x126/0x140 [nouveau 7e691deac0371145b012a4b682aab60868bda96d]
[  224.197236]  nouveau_mem_map+0xb3/0x100 [nouveau 

...


[  224.197465]  pci_pm_runtime_suspend+0x5d/0x180
[  224.197466]  ? pci_dev_put+0x20/0x20
[  224.197467]  ? pci_dev_put+0x20/0x20
[  224.197468]  __rpm_callback+0x41/0x160
[  224.197469]  ? pci_dev_put+0x20/0x20
[  224.197471]  rpm_suspend+0x43a/0x970
[  224.197472]  ? __schedule+0xb10/0x1300
[  224.197473]  pm_runtime_work+0x98/0xb0
[  224.197474]  process_one_work+0x252/0x410
[  224.197475]  worker_thread+0x55/0x4d0
[  224.197476]  ? process_one_work+0x410/0x410
[  224.197477]  kthread+0x13c/0x160
[  224.197478]  ? kthread_complete_and_exit+0x20/0x20
[  224.197480]  ret_from_fork+0x1f/0x30
[  224.197481]  </TASK>
[  224.197481] ---[ end trace 0000000000000000 ]---
[  224.199951] nouveau 0000:01:00.0: timer: stalled at ffffffffffffffff

Here's a partial log from my last boot (journalctl):

ug 15 20:18:19 garanto kernel: HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Module Device)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Processor Device)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(3.0 _SCP Extensions)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Processor Aggregator Device)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Linux-Dell-Video)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Linux-Lenovo-NV-HDMI-Audio)
lug 15 20:18:19 garanto kernel: ACPI: Added _OSI(Linux-HPI-Hybrid-Graphics)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG2.WKEN], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG2._DSW], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG2._PR0], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG2._PR3], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG3.WKEN], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG3._DSW], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG3._PR0], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)
lug 15 20:18:19 garanto kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG3._PR3], AE_ALREADY_EXISTS (20211217/dswload2-326)
lug 15 20:18:19 garanto kernel: ACPI Error: AE_ALREADY_EXISTS, During name lookup/catalog (20211217/psobject-220)
lug 15 20:18:19 garanto kernel: ACPI: Skipping parse of AML opcode: Method (0x0014)

...

# About 2940 lines of messages like:
ug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 14 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 29 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 25 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 22 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 24 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 20 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 20 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 24 kernel messages
lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: mc: intr 00000040
lug 15 20:18:31 garanto systemd-journald[370]: Missed 21 kernel messages


lug 15 20:18:31 garanto kernel: nouveau 0000:01:00.0: disp: chid 0 stat 00007082 reason 7 [UNRESOLVABLE_HANDLE] mthd 0208 data f0000000 code 00000000


# A lot of:

lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff
lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff
lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff
lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff
lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff
lug 15 20:19:33 garanto kernel: nouveau 0000:01:00.0: i2c: aux 0004: begin idle timeout ffffffff


ug 15 20:21:16 garanto kernel: nouveau 0000:01:00.0: DRM: failed to idle channel 0 [DRM]
lug 15 20:21:16 garanto kernel: ------------[ cut here ]------------
lug 15 20:21:16 garanto kernel: nouveau 0000:01:00.0: disabling already-disabled device
lug 15 20:21:16 garanto kernel: WARNING: CPU: 15 PID: 109 at drivers/pci/pci.c:2196 pci_disable_device+0xd1/0x130
lug 15 20:21:16 garanto kernel: Modules linked in: ufs hfsplus hfs cdrom minix msdos jfs xfs ext4 mbcache jbd2 dm_mod rfcomm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device ccm qrtr cmac algif_hash algif_skcipher af_alg bnep snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_soc_hdac_hdmi snd_sof_probes vfat fat snd_hda_codec_realtek snd_hda_codec_generic snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda iwlmvm snd_hda_ext_core btusb snd_soc_acpi_intel_match btrtl btbcm snd_soc_acpi hid_logitech_hidpp mac80211 soundwire_bus intel_tcc_cooling btintel mousedev ledtrig_audio btmtk x86_pkg_temp_thermal intel_powerclamp libarc4 coretemp iTCO_wdt bluetooth snd_soc_core spi_nor intel_pmc_bxt snd_compress kvm_intel mtd ecdh_generic hid_multitouch crc16 ee1004 ac97_bus iTCO_vendor_support iwlwifi snd_hda_codec_hdmi snd_pcm_dmaengine intel_rapl_msr
lug 15 20:21:16 garanto kernel:  pmt_telemetry pmt_class gpio_keys snd_hda_intel kvm iwlmei snd_intel_dspcfg irqbypass snd_intel_sdw_acpi crct10dif_pclmul snd_hda_codec processor_thermal_device_pci_legacy crc32_pclmul cfg80211 hid_logitech_dj snd_hda_core processor_thermal_device msi_wmi spi_intel_pci ghash_clmulni_intel aesni_intel crypto_simd joydev cryptd snd_hwdep intel_cstate intel_uncore psmouse wmi_bmof spi_intel processor_thermal_rfim snd_pcm rfkill sparse_keymap i2c_i801 processor_thermal_mbox r8169 mei tpm_crb snd_timer intel_lpss_pci processor_thermal_rapl i2c_smbus tpm_tis realtek snd intel_rapl_common intel_lpss mdio_devres tpm_tis_core i2c_hid_acpi libphy thunderbolt idma64 intel_vsec soundcore int3403_thermal intel_soc_dts_iosf tpm i2c_hid int340x_thermal_zone rng_core int3400_thermal mac_hid acpi_thermal_rel acpi_tad acpi_pad soc_button_array uinput ipmi_devintf ipmi_msghandler crypto_user fuse zram bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq
lug 15 20:21:16 garanto kernel:  uas usb_storage usbhid rtsx_pci_sdmmc mmc_core nvme rtsx_pci xhci_pci nvme_core xhci_pci_renesas serio_raw atkbd libps2 vivaldi_fmap i8042 serio radeon amdgpu gpu_sched intel_agp crc32c_intel i915 nouveau mxm_wmi wmi drm_buddy video drm_ttm_helper ttm drm_dp_helper intel_gtt
lug 15 20:21:16 garanto kernel: CPU: 15 PID: 109 Comm: kworker/15:0 Not tainted 5.18.3-zen1-1-zen #1 fefbaba02a89af0c18292541d3bc9730738e4aa5
lug 15 20:21:16 garanto kernel: Hardware name: Micro-Star International Co., Ltd. GE76 Raider 11UH/MS-17K3, BIOS E17K3IMS.11B 09/15/2021
lug 15 20:21:16 garanto kernel: Workqueue: pm pm_runtime_work
lug 15 20:21:16 garanto kernel: RIP: 0010:pci_disable_device+0xd1/0x130
lug 15 20:21:16 garanto kernel: Code: 48 85 ed 75 07 48 8b ab d0 00 00 00 48 8d bb d0 00 00 00 e8 51 61 18 00 48 89 ea 48 c7 c7 f8 ca f1 a3 48 89 c6 e8 20 b9 62 00 <0f> 0b e9 5d ff ff ff 31 c0 48 8d 54 24 06 be 04 00 00 00 48 89 df
lug 15 20:21:16 garanto kernel: RSP: 0018:ffffa929c04e3d30 EFLAGS: 00010282
lug 15 20:21:16 garanto kernel: RAX: 0000000000000000 RBX: ffff9b4cc2cdd000 RCX: 0000000000000027
lug 15 20:21:16 garanto kernel: RDX: ffff9b5c0bbe16a8 RSI: 0000000000000001 RDI: ffff9b5c0bbe16a0
lug 15 20:21:16 garanto kernel: RBP: ffff9b4cc2ca45d0 R08: 0000000000000001 R09: 00000000ffffffea
lug 15 20:21:16 garanto kernel: R10: ffffffffa465aa20 R11: 0000000000000001 R12: ffff9b4ccd698000
lug 15 20:21:16 garanto kernel: R13: 0000000000000003 R14: ffff9b4cc2cdd0d0 R15: 0000000000000000
lug 15 20:21:16 garanto kernel: FS:  0000000000000000(0000) GS:ffff9b5c0bbc0000(0000) knlGS:0000000000000000
lug 15 20:21:16 garanto kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
lug 15 20:21:16 garanto kernel: CR2: 00007f5eb6cc2000 CR3: 0000000c3ce10001 CR4: 0000000000f70ee0
lug 15 20:21:16 garanto kernel: PKRU: 55555554
lug 15 20:21:16 garanto kernel: Call Trace:
lug 15 20:21:16 garanto kernel:  <TASK>
lug 15 20:21:16 garanto kernel:  nouveau_pmops_runtime_suspend+0x50/0xb0 [nouveau 93e7fa97edfcf0fa60f32aff54e5e640eba8bb75]
lug 15 20:21:16 garanto kernel:  pci_pm_runtime_suspend+0x5d/0x180
lug 15 20:21:16 garanto kernel:  ? pci_dev_put+0x20/0x20
lug 15 20:21:16 garanto kernel:  ? pci_dev_put+0x20/0x20
lug 15 20:21:16 garanto kernel:  __rpm_callback+0x45/0x1c0
lug 15 20:21:16 garanto kernel:  ? pci_dev_put+0x20/0x20
lug 15 20:21:16 garanto kernel:  rpm_suspend+0x43a/0x970
lug 15 20:21:16 garanto kernel:  ? __schedule+0xb0d/0x12f0
lug 15 20:21:16 garanto kernel:  pm_runtime_work+0x98/0xb0
lug 15 20:21:16 garanto kernel:  process_one_work+0x252/0x410
lug 15 20:21:16 garanto kernel:  worker_thread+0x55/0x4d0
lug 15 20:21:16 garanto kernel:  ? process_one_work+0x410/0x410
lug 15 20:21:16 garanto kernel:  kthread+0x13c/0x160
lug 15 20:21:16 garanto kernel:  ? kthread_complete_and_exit+0x20/0x20
lug 15 20:21:16 garanto kernel:  ret_from_fork+0x1f/0x30
lug 15 20:21:16 garanto kernel:  </TASK>
lug 15 20:21:16 garanto kernel: ---[ end trace 0000000000000000 ]---

I'd be really thankful if anyone could give a hint on this. I'd really love to have this flavor on this laptop.

Thank you

Antonio

Are you booting with ibt=off ?

1 Like

I was not.
After enabling it nothing changed regarding niveau (same freeze for a couple of minutes right after logon) but after installing the nvidia driver again everything works like a charm, or at least It seems so.
I found this and if It's the real problem we've been lucky with 5.18: both nvidia drivers and samba client are broken apparently.

I had searched for nvidia issues with Qtile but It's pretty clear that I started from a bias, sorry.

Thank you so much for your help!

1 Like