NVidia driver fails to load

Lately I've been having issues with NVidia drivers sometimes working and sometimes not working; and after update does not load drivers at all.

I have 3 kernels installed and I tried them all: linux-zen, linux-xanmod, linux-xanmod-edge

There is a thread here with a similar problem

I just ran garuda-update and rebooted. Nvidia drivers do not load.

> sudo modprobe nvidia
modprobe: FATAL: Module nvidia not found in directory /lib/modules/6.2.5-x64v1-xanmod1-1

> sudo pacman -Q linux-zen
linux-zen 6.2.6.zen1-1.1

> sudo pacman -Q linux-xanmod
linux-xanmod 6.2.5-1

> sudo pacman -Q linux-xanmod-edge
linux-xanmod-edge 6.0.3-1

> sudo pacman -Qs nvidia
local/cuda 12.1.0-1
NVIDIA's GPU programming toolkit
local/garuda-nvidia-config 1:r63.2f652e8-3
Meta configuration package for nvidia systems on Garuda Linux
local/garuda-optimus-manager-config 1:r63.2f652e8-3
Meta configuration package for nvidia prime systems on Garuda Linux:
local/gwe 0.15.5-3.1
A system utility for controlling NVIDIA GPUs
local/lib32-libvdpau 1.5-1
Nvidia VDPAU library
local/lib32-nvidia-utils-tkg 515.76-222
NVIDIA driver utilities and libraries for 'nvidia-tkg' (32-bit)
local/lib32-opencl-nvidia-tkg 515.76-222
NVIDIA's OpenCL implemention for 'lib32-nvidia-utils-tkg'
local/libvdpau 1.5-1.1
Nvidia VDPAU library
local/libxnvctrl 525.89.02-1.1
local/mhwd-nvidia 525.60.11-1
mhwd-nvidia pci id
local/mhwd-nvidia-390xx 390.143-1
mhwd-nvidia-390xx PCI ID database
local/mhwd-nvidia-470xx 470.94-1
mhwd-nvidia-470xx PCI ID database
local/nvidia-dkms-tkg 515.76-222
NVIDIA kernel module sources (DKMS)
local/nvidia-egl-wayland-tkg 515.76-222
NVIDIA EGL Wayland library (libnvidia-egl-wayland.so.1.1.9) for 'nvidia-utils-tkg'
local/nvidia-prime 1.0-4
NVIDIA Prime Render Offload configuration and utilities
local/nvidia-settings-tkg 515.76-222
Tool for configuring the NVIDIA graphics driver
local/nvidia-utils-tkg 515.76-222
NVIDIA driver utilities and libraries for 'nvidia-tkg'
local/nvtop 3.0.1-1.1
GPUs process monitoring for AMD, Intel and NVIDIA
local/opencl-nvidia-tkg 515.76-222
NVIDIA's OpenCL implemention for 'nvidia-utils-tkg'
local/xf86-video-nouveau 1.0.17-2.1 (xorg-drivers)
Open Source 3D acceleration driver for nVidia cards

> garuda-inxi
Kernel: 6.2.5-x64v1-xanmod1-1 arch: x86_64 bits: 64 compiler: gcc v: 12.2.1
parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-xanmod
root=UUID=58203bda-e2e2-4c32-a006-c91d933cad4e rw rootflags=subvol=@
loglevel=3 quiet intel_iommu=on iommu=pt ibt=off
Desktop: KDE Plasma v: 5.27.3 tk: Qt v: 5.15.8 wm: kwin_x11 vt: 1 dm: SDDM
Distro: Garuda Linux base: Arch Linux
Type: Laptop System: Acer product: Predator PH315-53 v: V1.01
serial: <superuser required>
Mobo: CML model: QX50_CMS v: V1.01 serial: <superuser required>
UEFI: Insyde v: 1.01 date: 04/27/2020
ID-1: BAT1 charge: 42.4 Wh (100.0%) condition: 42.4/58.8 Wh (72.2%)
volts: 16.6 min: 15.4 model: SMP AP18E7M type: Li-ion serial: <filter>
status: full
Info: model: Intel Core i7-10750H bits: 64 type: MT MCP arch: Comet Lake
gen: core 10 level: v3 note: check built: 2020 process: Intel 14nm family: 6
model-id: 0xA5 (165) stepping: 2 microcode: 0xF4
Topology: cpus: 1x cores: 6 tpc: 2 threads: 12 smt: enabled cache:
L1: 384 KiB desc: d-6x32 KiB; i-6x32 KiB L2: 1.5 MiB desc: 6x256 KiB
L3: 12 MiB desc: 1x12 MiB
Speed (MHz): avg: 1891 high: 2600 min/max: 800/5000 scaling:
driver: intel_pstate governor: powersave cores: 1: 900 2: 2600 3: 2600
4: 2600 5: 900 6: 2600 7: 897 8: 2600 9: 2600 10: 2600 11: 900 12: 900
bogomips: 62399
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Vulnerabilities: <filter>
Device-1: Intel CometLake-H GT2 [UHD Graphics] vendor: Acer Incorporated ALI
driver: i915 v: kernel arch: Gen-9.5 process: Intel 14nm built: 2016-20
ports: active: eDP-1 empty: HDMI-A-1 bus-ID: 00:02.0 chip-ID: 8086:9bc4
class-ID: 0300
Device-2: NVIDIA TU106M [GeForce RTX 2060 Mobile]
vendor: Acer Incorporated ALI driver: N/A alternate: nvidiafb, nouveau
non-free: 525.xx+ status: current (as of 2023-02) arch: Turing code: TUxxx
process: TSMC 12nm FF built: 2018-22 pcie: gen: 3 speed: 8 GT/s lanes: 16
bus-ID: 01:00.0 chip-ID: 10de:1f15 class-ID: 0300
Device-3: Quanta HD User Facing type: USB driver: uvcvideo bus-ID: 2-5:3
chip-ID: 0408:a061 class-ID: 0e02
Display: x11 server: X.Org v: 21.1.7 with: Xwayland v: 22.1.8
compositor: kwin_x11 driver: X: loaded: intel unloaded: modesetting
alternate: fbdev,vesa dri: i965 gpu: i915 display-ID: :0 screens: 1
Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.00x11.22")
s-diag: 582mm (22.93")
Monitor-1: eDP-1 mapped: eDP1 model: AU Optronics 0x82ed built: 2018
res: 1920x1080 hz: 144 dpi: 143 gamma: 1.2 size: 340x190mm (13.39x7.48")
diag: 394mm (15.5") ratio: 16:9 modes: 1920x1080
API: OpenGL v: 4.6 Mesa 22.3.6 renderer: Mesa Intel UHD Graphics (CML GT2)
direct-render: Yes
Device-1: Intel Comet Lake PCH cAVS vendor: Acer Incorporated ALI
driver: snd_hda_intel bus-ID: 2-1.3:6 v: kernel chip-ID: 08bb:29b0
alternate: snd_sof_pci_intel_cnl bus-ID: 00:1f.3 class-ID: 0300
chip-ID: 8086:06c8 class-ID: 0403
Device-2: NVIDIA TU106 High Definition Audio vendor: Acer Incorporated ALI
driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
bus-ID: 01:00.1 chip-ID: 10de:10f9 class-ID: 0403
Device-3: Texas Instruments PCM2900B Audio CODEC type: USB
driver: hid-generic,snd-usb-audio,usbhid
Sound API: ALSA v: k6.2.5-x64v1-xanmod1-1 running: yes
Sound Interface: sndio v: N/A running: no
Sound Server-1: PulseAudio v: 16.1 running: no
Sound Server-2: PipeWire v: 0.3.67 running: yes
Device-1: Intel Comet Lake PCH CNVi WiFi vendor: Rivet Networks
driver: iwlwifi v: kernel bus-ID: 00:14.3 chip-ID: 8086:06f0 class-ID: 0280
IF: wlp0s20f3 state: up mac: <filter>
Device-2: Realtek vendor: Acer Incorporated ALI driver: r8169 v: kernel
pcie: gen: 1 speed: 2.5 GT/s lanes: 1 port: 3000 bus-ID: 08:00.0
chip-ID: 10ec:2600 class-ID: 0200
IF: enp8s0 state: down mac: <filter>
IF-ID-1: virbr0 state: down mac: <filter>
IF-ID-2: virbr1 state: down mac: <filter>
Device-1: Intel AX201 Bluetooth type: USB driver: btusb v: 0.8
bus-ID: 2-14:5 chip-ID: 8087:0026 class-ID: e001
Report: bt-adapter ID: hci0 rfk-id: 1 state: down
bt-service: enabled,running rfk-block: hardware: no software: yes
address: <filter>
Local Storage: total: 3.19 TiB used: 1.21 TiB (38.0%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:2 vendor: Western Digital
model: WD Blue SN570 2TB size: 1.82 TiB block-size: physical: 512 B
logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
rev: 234200WD temp: 60.9 C scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:0 vendor: Western Digital model: PC SN730
SDBQNTY-512G-1014 size: 476.94 GiB block-size: physical: 512 B
logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
rev: 11101100 temp: 24.9 C scheme: GPT
ID-3: /dev/sda maj-min: 8:0 vendor: HGST (Hitachi) model: HTS721010A9E630
size: 931.51 GiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
type: HDD rpm: 7200 serial: <filter> rev: A3J0 scheme: GPT
ID-1: / raw-size: 1.81 TiB size: 1.81 TiB (100.00%) used: 486.32 GiB (26.3%)
fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-2: /boot/efi raw-size: 100 MiB size: 96 MiB (96.00%)
used: 51.2 MiB (53.4%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:3
ID-3: /home raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-4: /var/log raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-5: /var/tmp raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
Kernel: swappiness: 133 (default 60) cache-pressure: 50 (default 100)
ID-1: swap-1 type: zram size: 15.45 GiB used: 0 KiB (0.0%) priority: 100
dev: /dev/zram0
ID-2: swap-2 type: partition size: 11.72 GiB used: 0 KiB (0.0%)
priority: -2 dev: /dev/nvme0n1p2 maj-min: 259:4
System Temperatures: cpu: 56.0 C pch: 74.0 C mobo: N/A
Fan Speeds (RPM): N/A
Processes: 333 Uptime: 11m wakeups: 1 Memory: 15.45 GiB
used: 3.31 GiB (21.4%) Init: systemd v: 253 default: graphical
tool: systemctl Compilers: gcc: 12.2.1 alt: 11 clang: 15.0.7 Packages:
pm: pacman pkgs: 2092 libs: 552 tools: octopi,pamac,paru Shell: fish
v: 3.6.0 default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.25
Garuda (2.6.15-1):
System install date:     2021-11-24
Last full system update: 2023-03-17
Is partially upgraded:   No
Relevant software:       snapper NetworkManager mkinitcpio
Windows dual boot:       Probably (Run as root to verify)
Failed units:            systemd-oomd.socket

I just rebooted into linux-xanmod-edge and this time it loaded. Sometimes I need to reboot multiple times before video driver loads.

huh... xanmod-edge is older than xanmod ??

I see you have some nvidia stuff related to tkg, but are using a different kernel than linux-tkg. I've never used it so I can't say if that's supported or not.

I'm guessing you used nvidia-all? I can't seem to find nvidia-dkms-tkg anywhere else. If so, I'd try updating the local repo and running again. There's a section in the readme about rerunning the script after a kernel update.

Worse comes to worst, I'd say go back to the default nvidia packages and see if you can get it working again that way.

yes I used nvidia-all, otherwise couldn't get driver to load in any other kernel. Was looking for that script and couldn't find it back, thanks.

If xanmod-edge is older, I guess I'll get rid of it; and use nvidia-all to solve the driver issue like before.

No problem. Did you check out the ArchWiki page for Nvidia? Looks like the drivers are pretty tailored for the kernel. linux-lts uses the nvidia-lts package for example. Looks like most custom kernels would use nvidia-dkms, but doesn't hurt to search around a bit for what to use for a specific kernel.

Also, maybe a silly question, but are you also installing the header packages for your kernels? You didn't include these in your output.


For what it's worth I have a Zen 3 AMD and using the AMD-Zen2/3 kernel works great with my GTX 2080TI using whatever packages Garuda supplies.

Edit: Derp on my part, the inxi shows CPU. I'll still leave this in case it's helpful for anyone else.

I booted in xanmod, ran nvidia-all and uninstalled xanmod-edge.

Also brought my car to the dealership for a full service.

Now everything runs smooth as silk.

Except the audio problems, youtube videos on random pause/play with volume changes. At least my car wipers don't go off randomly.

Is JamesDSP installed?

JamesDSP is not installed

