Nv_drm_atomic_commit [nvidia_drm] *ERROR* after update of Garuda Linux

After upgrading Garuda Linux today, I ended up with an error during the next boot. I did the upgrade via the “add/remove software” app. I do this on a weekly basis.

Since I cannot post an image here, the screenshot of the error that I took can be seen in this post on Mastodon:
My post on Mastodon

Next thing I did was uninstalling the proprietary NVidia drivers, but then my system couldn’t boot anymore. It just froze during boot. After that I booted into recovery mode and restored my system using timeshift --restore.

I tried to update my system again, this time via the “Garuda Assistant” => “Garuda update”. This resulted in exactly the same error of the screenshot.

My PC now works fine again, but I cannot update.

I’m running the Gnome (45) edition and I’m on X11.
The kernel parameters configured are these:

quiet quiet rd.udev.log_priority=3 vt.global_cursor_default=0 loglevel=3 ibt=off initcall_blacklist=simpledrm_platform_driver_init # Modified by garuda-migrations: splash

The garuda-inxi output (after restore with Timeshift) is this:

System:
  Kernel: 6.6.1-zen1-1-zen arch: x86_64 bits: 64 compiler: gcc v: 13.2.1
    clocksource: tsc available: hpet,acpi_pm
    parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-zen
    root=UUID=a0b76a9f-8c4f-4fe9-8993-9c7eada6e661 rw rootflags=subvol=@
    quiet quiet rd.udev.log_priority=3 vt.global_cursor_default=0 loglevel=3
    ibt=off initcall_blacklist=simpledrm_platform_driver_init
  Desktop: GNOME v: 45.1 tk: GTK v: 3.24.38 wm: gnome-shell dm: GDM
    v: 45.0.1 Distro: Garuda Linux base: Arch Linux
Machine:
  Type: Desktop Mobo: ASUSTeK model: ROG STRIX Z390-E GAMING v: Rev 1.xx
    serial: <superuser required> UEFI: American Megatrends v: 1704
    date: 09/21/2020
CPU:
  Info: model: Intel Core i7-9700K bits: 64 type: MCP arch: Coffee Lake
    gen: core 9 level: v3 note: check built: 2018 process: Intel 14nm family: 6
    model-id: 0x9E (158) stepping: 0xC (12) microcode: 0xF4
  Topology: cpus: 1x cores: 8 smt: <unsupported> cache: L1: 512 KiB
    desc: d-8x32 KiB; i-8x32 KiB L2: 2 MiB desc: 8x256 KiB L3: 12 MiB
    desc: 1x12 MiB
  Speed (MHz): avg: 800 min/max: 800/4900 scaling: driver: intel_pstate
    governor: powersave cores: 1: 800 2: 800 3: 800 4: 800 5: 800 6: 800 7: 800
    8: 800 bogomips: 57600
  Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
  Vulnerabilities: <filter>
Graphics:
  Device-1: Intel CoffeeLake-S GT2 [UHD Graphics 630] vendor: ASUSTeK
    driver: i915 v: kernel arch: Gen-9.5 process: Intel 14nm built: 2016-20
    ports: active: none empty: DP-1,HDMI-A-1,HDMI-A-2 bus-ID: 00:02.0
    chip-ID: 8086:3e98 class-ID: 0380
  Device-2: NVIDIA TU104 [GeForce RTX 2080 Rev. A] vendor: Gigabyte
    driver: nvidia v: 545.29.02 alternate: nouveau,nvidia_drm non-free: 545.xx+
    status: current (as of 2023-10; EOL~2026-12-xx) arch: Turing code: TUxxx
    process: TSMC 12nm FF built: 2018-2022 pcie: gen: 3 speed: 8 GT/s
    lanes: 16 bus-ID: 01:00.0 chip-ID: 10de:1e87 class-ID: 0300
  Device-3: Logitech C920 HD Pro Webcam driver: snd-usb-audio,uvcvideo
    type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-2.2:7
    chip-ID: 046d:0892 class-ID: 0102 serial: <filter>
  Display: x11 server: X.Org v: 21.1.9 with: Xwayland v: 23.2.2
    compositor: gnome-shell driver: X: loaded: modesetting,nvidia
    alternate: fbdev,intel,nouveau,nv,vesa dri: iris gpu: i915,nvidia
    display-ID: :1 screens: 1
  Screen-1: 0 s-res: 5360x1440 s-dpi: 96 s-size: 1418x381mm (55.83x15.00")
    s-diag: 1468mm (57.81")
  Monitor-1: DP-4 pos: primary,top-left res: 3440x1440 dpi: 109
    size: 800x330mm (31.5x12.99") diag: 865mm (34.07") modes: N/A
  Monitor-2: HDMI-0 pos: bottom-r res: 1920x1080 dpi: 93
    size: 527x296mm (20.75x11.65") diag: 604mm (23.8") modes: N/A
  API: EGL v: 1.5 hw: drv: intel iris drv: nvidia platforms: device: 0
    drv: nvidia device: 2 drv: iris device: 3 drv: swrast gbm: drv: nvidia
    surfaceless: drv: nvidia x11: drv: nvidia inactive: wayland,device-1
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: nvidia mesa v: 545.29.02
    glx-v: 1.4 direct-render: yes renderer: NVIDIA GeForce RTX 2080/PCIe/SSE2
    memory: 7.81 GiB
  API: Vulkan v: 1.3.269 layers: 5 device: 0 type: discrete-gpu
    name: NVIDIA GeForce RTX 2080 driver: nvidia v: 545.29.02
    device-ID: 10de:1e87 surfaces: xcb,xlib device: 1 type: integrated-gpu
    name: Intel UHD Graphics 630 (CFL GT2) driver: mesa intel
    v: 23.2.1-arch1.2 device-ID: 8086:3e98 surfaces: xcb,xlib device: 2
    type: cpu name: llvmpipe (LLVM 16.0.6 256 bits) driver: mesa llvmpipe
    v: 23.2.1-arch1.2 (LLVM 16.0.6) device-ID: 10005:0000 surfaces: xcb,xlib
Audio:
  Device-1: Intel Cannon Lake PCH cAVS vendor: ASUSTeK driver: snd_hda_intel
    v: kernel alternate: snd_soc_skl,snd_sof_pci_intel_cnl bus-ID: 00:1f.3
    chip-ID: 8086:a348 class-ID: 0403
  Device-2: NVIDIA TU104 HD Audio vendor: Gigabyte driver: snd_hda_intel
    v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16 bus-ID: 01:00.1
    chip-ID: 10de:10f8 class-ID: 0403
  Device-3: Logitech C920 HD Pro Webcam driver: snd-usb-audio,uvcvideo
    type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 1-2.2:7
    chip-ID: 046d:0892 class-ID: 0102 serial: <filter>
  Device-4: SteelSeries ApS Arctis Nova 7
    driver: hid-generic,snd-usb-audio,usbhid type: USB rev: 2.0 speed: 12 Mb/s
    lanes: 1 mode: 1.1 bus-ID: 1-3:4 chip-ID: 1038:2202 class-ID: 0300
  API: ALSA v: k6.6.1-zen1-1-zen status: kernel-api tools: N/A
  Server-1: sndiod v: N/A status: off tools: aucat,midicat,sndioctl
  Server-2: PipeWire v: 0.3.84 status: active with: 1: pipewire-pulse
    status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin
    4: pw-jack type: plugin tools: pactl,pw-cat,pw-cli,wpctl
Network:
  Device-1: Intel Cannon Lake PCH CNVi WiFi driver: iwlwifi v: kernel
    bus-ID: 00:14.3 chip-ID: 8086:a370 class-ID: 0280
  IF: wlo1 state: down mac: <filter>
  Device-2: Intel Ethernet I219-V vendor: ASUSTeK driver: e1000e v: kernel
    port: N/A bus-ID: 00:1f.6 chip-ID: 8086:15bc class-ID: 0200
  IF: eno2 state: up speed: 1000 Mbps duplex: full mac: <filter>
  IF-ID-1: tailscale0 state: unknown speed: -1 duplex: full mac: N/A
Bluetooth:
  Device-1: Intel Bluetooth 9460/9560 Jefferson Peak (JfP) driver: btusb
    v: 0.8 type: USB rev: 2.0 speed: 12 Mb/s lanes: 1 mode: 1.1 bus-ID: 1-14:13
    chip-ID: 8087:0aaa class-ID: e001
  Report: btmgmt ID: hci0 rfk-id: 1 state: up address: <filter> bt-v: 5.1
    lmp-v: 10 status: discoverable: no pairing: no class-ID: 6c0104
Drives:
  Local Storage: total: 5.68 TiB used: 1.3 TiB (23.0%)
  SMART Message: Required tool smartctl not installed. Check --recommends
  ID-1: /dev/nvme0n1 maj-min: 259:3 vendor: Samsung model: SSD 970 EVO 1TB
    size: 931.51 GiB block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s
    lanes: 4 tech: SSD serial: <filter> fw-rev: 2B2QEXE7 temp: 34.9 C
    scheme: GPT
  ID-2: /dev/nvme1n1 maj-min: 259:0 vendor: Western Digital
    model: WDS100T2X0C-00L350 size: 931.51 GiB block-size: physical: 512 B
    logical: 512 B speed: 31.6 Gb/s lanes: 4 tech: SSD serial: <filter>
    fw-rev: 101110WD temp: 41.9 C scheme: GPT
  ID-3: /dev/sda maj-min: 8:0 vendor: Kingston model: SVP200S3240G
    size: 223.57 GiB block-size: physical: 512 B logical: 512 B speed: 6.0 Gb/s
    tech: SSD serial: <filter> fw-rev: BBF0 scheme: GPT
  ID-4: /dev/sdb maj-min: 8:16 vendor: Western Digital
    model: WD30EFRX-68AX9N0 size: 2.73 TiB block-size: physical: 4096 B
    logical: 512 B speed: 6.0 Gb/s tech: N/A serial: <filter> fw-rev: 0A80
    scheme: GPT
  ID-5: /dev/sdc maj-min: 8:32 vendor: Western Digital
    model: WD10EADS-00M2B0 size: 931.51 GiB block-size: physical: 512 B
    logical: 512 B speed: 3.0 Gb/s tech: N/A serial: <filter> fw-rev: 0A01
    scheme: MBR
Partition:
  ID-1: / raw-size: 921.51 GiB size: 921.51 GiB (100.00%)
    used: 522.81 GiB (56.7%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
  ID-2: /boot/efi raw-size: 1.39 GiB size: 1.39 GiB (99.80%)
    used: 27.2 MiB (1.9%) fs: vfat dev: /dev/sda1 maj-min: 8:1
  ID-3: /home raw-size: 921.51 GiB size: 921.51 GiB (100.00%)
    used: 522.81 GiB (56.7%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
  ID-4: /var/log raw-size: 921.51 GiB size: 921.51 GiB (100.00%)
    used: 522.81 GiB (56.7%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
  ID-5: /var/tmp raw-size: 921.51 GiB size: 921.51 GiB (100.00%)
    used: 522.81 GiB (56.7%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
Swap:
  Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default) zswap: no
  ID-1: swap-1 type: zram size: 31.19 GiB used: 0 KiB (0.0%) priority: 100
    comp: zstd avail: lzo,lzo-rle,lz4,lz4hc,842 max-streams: 8 dev: /dev/zram0
Sensors:
  System Temperatures: cpu: 38.0 C mobo: N/A gpu: nvidia temp: 56 C
  Fan Speeds (rpm): N/A gpu: nvidia fan: 0%
Info:
  Processes: 318 Uptime: 6m wakeups: 0 Memory: total: 32 GiB
  available: 31.19 GiB used: 6.26 GiB (20.1%) Init: systemd v: 254
  default: graphical tool: systemctl Compilers: gcc: 13.2.1 clang: 16.0.6
  Packages: pm: pacman pkgs: 1489 libs: 415 tools: pamac,paru pm: appimage
  pkgs: 0 Shell: fish v: 3.6.1 default: Bash v: 5.2.15
  running-in: gnome-terminal inxi: 3.3.31
Garuda (2.6.19-2):
  System install date:     2023-07-26
  Last full system update: 2023-11-30
  Is partially upgraded:   No
  Relevant software:       timeshift NetworkManager dracut nvidia-dkms
  Windows dual boot:       Probably (Run as root to verify)
  Failed units:            dev-disk-by-diskseq-5-part1.swap 

If anyone has an idea what could be causing this, any help is greatly appreciated.

(And yes, my machine is set up as dual boot with Windows 11)

After the error that had to do with the Nvidia DRM flipping on boot, did the system stop at that message or was it just really slow to boot?
(Issues about this error can be a bit tricky, one solution seems to not fix all types of installations; I assume the difference has to do with what DE you might be trying to use and if you are on X11/Wayland.)

If the system stops on the message about Nvidia’s DRM flipping not working, it looks like the initcall_blacklist=simpledrm_platform_driver_init kernel parameter might be blocking the DRM (Direct Rendering Manager). (I am assuming something weird has be going on with Nvidia’s drivers lately because I’ve noticed it seems to like to spew out blue text at startup now). From what I understand, you typically do not need Nvidia’s DRM on in an X11 session, but it might be different with Gnome with the new driver.

I would try removing initcall_blacklist=simpledrm_platform_driver_init kernel parameter and maybe adding nvidia-drm.modeset=1 in its place if you are facing a system halt after that error message shows up. You can change kernel parameters in the Garuda Boot Options program. Make sure to do this before you attempt the next garuda-update.
Edit: also stick with the Nvidia drivers you have going at the moment (the one after the restore of Timeshift that your garuda-inxi says you have)
Edit2: Wanted to link Nvidia DRM information here for future referance NVIDIA - ArchWiki
Edit3: If what I said above does not work, you can try what is reported here *ERROR* [nvidia-drm] [GPU ID 0x00000100] Flip event timeout on head 0

2 Likes

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.