NVidia driver fails to load

Lately I've been having issues with NVidia drivers sometimes working and sometimes not working; and after update does not load drivers at all.

I have 3 kernels installed and I tried them all: linux-zen, linux-xanmod, linux-xanmod-edge

There is a thread here with a similar problem
https://bbs.archlinux.org/viewtopic.php?id=263188

I just ran garuda-update and rebooted. Nvidia drivers do not load.

> sudo modprobe nvidia
modprobe: FATAL: Module nvidia not found in directory /lib/modules/6.2.5-x64v1-xanmod1-1

> sudo pacman -Q linux-zen
linux-zen 6.2.6.zen1-1.1

> sudo pacman -Q linux-xanmod
linux-xanmod 6.2.5-1

> sudo pacman -Q linux-xanmod-edge
linux-xanmod-edge 6.0.3-1

> sudo pacman -Qs nvidia
local/cuda 12.1.0-1
NVIDIA's GPU programming toolkit
local/garuda-nvidia-config 1:r63.2f652e8-3
Meta configuration package for nvidia systems on Garuda Linux
local/garuda-optimus-manager-config 1:r63.2f652e8-3
Meta configuration package for nvidia prime systems on Garuda Linux:
local/gwe 0.15.5-3.1
A system utility for controlling NVIDIA GPUs
local/lib32-libvdpau 1.5-1
Nvidia VDPAU library
local/lib32-nvidia-utils-tkg 515.76-222
NVIDIA driver utilities and libraries for 'nvidia-tkg' (32-bit)
local/lib32-opencl-nvidia-tkg 515.76-222
NVIDIA's OpenCL implemention for 'lib32-nvidia-utils-tkg'
local/libvdpau 1.5-1.1
Nvidia VDPAU library
local/libxnvctrl 525.89.02-1.1
NVIDIA NV-CONTROL X extension
local/mhwd-nvidia 525.60.11-1
mhwd-nvidia pci id
local/mhwd-nvidia-390xx 390.143-1
mhwd-nvidia-390xx PCI ID database
local/mhwd-nvidia-470xx 470.94-1
mhwd-nvidia-470xx PCI ID database
local/nvidia-dkms-tkg 515.76-222
NVIDIA kernel module sources (DKMS)
local/nvidia-egl-wayland-tkg 515.76-222
NVIDIA EGL Wayland library (libnvidia-egl-wayland.so.1.1.9) for 'nvidia-utils-tkg'
local/nvidia-prime 1.0-4
NVIDIA Prime Render Offload configuration and utilities
local/nvidia-settings-tkg 515.76-222
Tool for configuring the NVIDIA graphics driver
local/nvidia-utils-tkg 515.76-222
NVIDIA driver utilities and libraries for 'nvidia-tkg'
local/nvtop 3.0.1-1.1
GPUs process monitoring for AMD, Intel and NVIDIA
local/opencl-nvidia-tkg 515.76-222
NVIDIA's OpenCL implemention for 'nvidia-utils-tkg'
local/xf86-video-nouveau 1.0.17-2.1 (xorg-drivers)
Open Source 3D acceleration driver for nVidia cards

> garuda-inxi
System:
Kernel: 6.2.5-x64v1-xanmod1-1 arch: x86_64 bits: 64 compiler: gcc v: 12.2.1
parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-xanmod
root=UUID=58203bda-e2e2-4c32-a006-c91d933cad4e rw rootflags=subvol=@
loglevel=3 quiet intel_iommu=on iommu=pt ibt=off
Desktop: KDE Plasma v: 5.27.3 tk: Qt v: 5.15.8 wm: kwin_x11 vt: 1 dm: SDDM
Distro: Garuda Linux base: Arch Linux
Machine:
Type: Laptop System: Acer product: Predator PH315-53 v: V1.01
serial: <superuser required>
Mobo: CML model: QX50_CMS v: V1.01 serial: <superuser required>
UEFI: Insyde v: 1.01 date: 04/27/2020
Battery:
ID-1: BAT1 charge: 42.4 Wh (100.0%) condition: 42.4/58.8 Wh (72.2%)
volts: 16.6 min: 15.4 model: SMP AP18E7M type: Li-ion serial: <filter>
status: full
CPU:
Info: model: Intel Core i7-10750H bits: 64 type: MT MCP arch: Comet Lake
gen: core 10 level: v3 note: check built: 2020 process: Intel 14nm family: 6
model-id: 0xA5 (165) stepping: 2 microcode: 0xF4
Topology: cpus: 1x cores: 6 tpc: 2 threads: 12 smt: enabled cache:
L1: 384 KiB desc: d-6x32 KiB; i-6x32 KiB L2: 1.5 MiB desc: 6x256 KiB
L3: 12 MiB desc: 1x12 MiB
Speed (MHz): avg: 1891 high: 2600 min/max: 800/5000 scaling:
driver: intel_pstate governor: powersave cores: 1: 900 2: 2600 3: 2600
4: 2600 5: 900 6: 2600 7: 897 8: 2600 9: 2600 10: 2600 11: 900 12: 900
bogomips: 62399
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Vulnerabilities: <filter>
Graphics:
Device-1: Intel CometLake-H GT2 [UHD Graphics] vendor: Acer Incorporated ALI
driver: i915 v: kernel arch: Gen-9.5 process: Intel 14nm built: 2016-20
ports: active: eDP-1 empty: HDMI-A-1 bus-ID: 00:02.0 chip-ID: 8086:9bc4
class-ID: 0300
Device-2: NVIDIA TU106M [GeForce RTX 2060 Mobile]
vendor: Acer Incorporated ALI driver: N/A alternate: nvidiafb, nouveau
non-free: 525.xx+ status: current (as of 2023-02) arch: Turing code: TUxxx
process: TSMC 12nm FF built: 2018-22 pcie: gen: 3 speed: 8 GT/s lanes: 16
bus-ID: 01:00.0 chip-ID: 10de:1f15 class-ID: 0300
Device-3: Quanta HD User Facing type: USB driver: uvcvideo bus-ID: 2-5:3
chip-ID: 0408:a061 class-ID: 0e02
Display: x11 server: X.Org v: 21.1.7 with: Xwayland v: 22.1.8
compositor: kwin_x11 driver: X: loaded: intel unloaded: modesetting
alternate: fbdev,vesa dri: i965 gpu: i915 display-ID: :0 screens: 1
Screen-1: 0 s-res: 1920x1080 s-dpi: 96 s-size: 508x285mm (20.00x11.22")
s-diag: 582mm (22.93")
Monitor-1: eDP-1 mapped: eDP1 model: AU Optronics 0x82ed built: 2018
res: 1920x1080 hz: 144 dpi: 143 gamma: 1.2 size: 340x190mm (13.39x7.48")
diag: 394mm (15.5") ratio: 16:9 modes: 1920x1080
API: OpenGL v: 4.6 Mesa 22.3.6 renderer: Mesa Intel UHD Graphics (CML GT2)
direct-render: Yes
Audio:
Device-1: Intel Comet Lake PCH cAVS vendor: Acer Incorporated ALI
driver: snd_hda_intel bus-ID: 2-1.3:6 v: kernel chip-ID: 08bb:29b0
alternate: snd_sof_pci_intel_cnl bus-ID: 00:1f.3 class-ID: 0300
chip-ID: 8086:06c8 class-ID: 0403
Device-2: NVIDIA TU106 High Definition Audio vendor: Acer Incorporated ALI
driver: snd_hda_intel v: kernel pcie: gen: 3 speed: 8 GT/s lanes: 16
bus-ID: 01:00.1 chip-ID: 10de:10f9 class-ID: 0403
Device-3: Texas Instruments PCM2900B Audio CODEC type: USB
driver: hid-generic,snd-usb-audio,usbhid
Sound API: ALSA v: k6.2.5-x64v1-xanmod1-1 running: yes
Sound Interface: sndio v: N/A running: no
Sound Server-1: PulseAudio v: 16.1 running: no
Sound Server-2: PipeWire v: 0.3.67 running: yes
Network:
Device-1: Intel Comet Lake PCH CNVi WiFi vendor: Rivet Networks
driver: iwlwifi v: kernel bus-ID: 00:14.3 chip-ID: 8086:06f0 class-ID: 0280
IF: wlp0s20f3 state: up mac: <filter>
Device-2: Realtek vendor: Acer Incorporated ALI driver: r8169 v: kernel
pcie: gen: 1 speed: 2.5 GT/s lanes: 1 port: 3000 bus-ID: 08:00.0
chip-ID: 10ec:2600 class-ID: 0200
IF: enp8s0 state: down mac: <filter>
IF-ID-1: virbr0 state: down mac: <filter>
IF-ID-2: virbr1 state: down mac: <filter>
Bluetooth:
Device-1: Intel AX201 Bluetooth type: USB driver: btusb v: 0.8
bus-ID: 2-14:5 chip-ID: 8087:0026 class-ID: e001
Report: bt-adapter ID: hci0 rfk-id: 1 state: down
bt-service: enabled,running rfk-block: hardware: no software: yes
address: <filter>
Drives:
Local Storage: total: 3.19 TiB used: 1.21 TiB (38.0%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:2 vendor: Western Digital
model: WD Blue SN570 2TB size: 1.82 TiB block-size: physical: 512 B
logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
rev: 234200WD temp: 60.9 C scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:0 vendor: Western Digital model: PC SN730
SDBQNTY-512G-1014 size: 476.94 GiB block-size: physical: 512 B
logical: 512 B speed: 31.6 Gb/s lanes: 4 type: SSD serial: <filter>
rev: 11101100 temp: 24.9 C scheme: GPT
ID-3: /dev/sda maj-min: 8:0 vendor: HGST (Hitachi) model: HTS721010A9E630
size: 931.51 GiB block-size: physical: 4096 B logical: 512 B speed: 6.0 Gb/s
type: HDD rpm: 7200 serial: <filter> rev: A3J0 scheme: GPT
Partition:
ID-1: / raw-size: 1.81 TiB size: 1.81 TiB (100.00%) used: 486.32 GiB (26.3%)
fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-2: /boot/efi raw-size: 100 MiB size: 96 MiB (96.00%)
used: 51.2 MiB (53.4%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:3
ID-3: /home raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-4: /var/log raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
ID-5: /var/tmp raw-size: 1.81 TiB size: 1.81 TiB (100.00%)
used: 486.32 GiB (26.3%) fs: btrfs dev: /dev/nvme0n1p5 maj-min: 259:5
Swap:
Kernel: swappiness: 133 (default 60) cache-pressure: 50 (default 100)
ID-1: swap-1 type: zram size: 15.45 GiB used: 0 KiB (0.0%) priority: 100
dev: /dev/zram0
ID-2: swap-2 type: partition size: 11.72 GiB used: 0 KiB (0.0%)
priority: -2 dev: /dev/nvme0n1p2 maj-min: 259:4
Sensors:
System Temperatures: cpu: 56.0 C pch: 74.0 C mobo: N/A
Fan Speeds (RPM): N/A
Info:
Processes: 333 Uptime: 11m wakeups: 1 Memory: 15.45 GiB
used: 3.31 GiB (21.4%) Init: systemd v: 253 default: graphical
tool: systemctl Compilers: gcc: 12.2.1 alt: 11 clang: 15.0.7 Packages:
pm: pacman pkgs: 2092 libs: 552 tools: octopi,pamac,paru Shell: fish
v: 3.6.0 default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.25
Garuda (2.6.15-1):
System install date:     2021-11-24
Last full system update: 2023-03-17
Is partially upgraded:   No
Relevant software:       snapper NetworkManager mkinitcpio
Windows dual boot:       Probably (Run as root to verify)
Failed units:            systemd-oomd.socket

I just rebooted into linux-xanmod-edge and this time it loaded. Sometimes I need to reboot multiple times before video driver loads.

huh... xanmod-edge is older than xanmod ??

I see you have some nvidia stuff related to tkg, but are using a different kernel than linux-tkg. I've never used it so I can't say if that's supported or not.

I'm guessing you used nvidia-all? I can't seem to find nvidia-dkms-tkg anywhere else. If so, I'd try updating the local repo and running again. There's a section in the readme about rerunning the script after a kernel update.

Worse comes to worst, I'd say go back to the default nvidia packages and see if you can get it working again that way.

1 Like

yes I used nvidia-all, otherwise couldn't get driver to load in any other kernel. Was looking for that script and couldn't find it back, thanks.

If xanmod-edge is older, I guess I'll get rid of it; and use nvidia-all to solve the driver issue like before.

No problem. Did you check out the ArchWiki page for Nvidia? Looks like the drivers are pretty tailored for the kernel. linux-lts uses the nvidia-lts package for example. Looks like most custom kernels would use nvidia-dkms, but doesn't hurt to search around a bit for what to use for a specific kernel.

Also, maybe a silly question, but are you also installing the header packages for your kernels? You didn't include these in your output.

2 Likes

For what it's worth I have a Zen 3 AMD and using the AMD-Zen2/3 kernel works great with my GTX 2080TI using whatever packages Garuda supplies.

Edit: Derp on my part, the inxi shows CPU. I'll still leave this in case it's helpful for anyone else.

I booted in xanmod, ran nvidia-all and uninstalled xanmod-edge.

Also brought my car to the dealership for a full service.

Now everything runs smooth as silk.

Except the audio problems, youtube videos on random pause/play with volume changes. At least my car wipers don't go off randomly.

Is JamesDSP installed?

JamesDSP is not installed

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.