Nvidia-settings ERROR: NVIDIA driver is not loaded

Greetings and Salutations Garuda Friends,

I have happily been using Garuda for 9 months now. Thank you! I am having an issue with the nvidia driver being installed, but not it's not loading or functioning. I think perhaps the kernel module did not get built, and then installed by dkms. Here is my garuda-inxi:

System:
Kernel: 5.15.62-1-lts arch: x86_64 bits: 64 compiler: gcc v: 12.2.0
parameters: BOOT_IMAGE=/vmlinuz-linux-lts
root=UUID=272ad521-3b3f-459e-a9a0-0ce5504a8d22 rw [email protected]
quiet quiet splash rd.udev.log_priority=3 vt.global_cursor_default=0
loglevel=3
Desktop: KDE Plasma v: 5.25.4 tk: Qt v: 5.15.5 info: latte-dock
wm: kwin_x11 vt: 1 dm: SDDM Distro: Garuda Linux base: Arch Linux
Machine:
Type: Laptop System: Micro-Star product: GE76 Raider 11UH v: REV:1.0
serial: <superuser required> Chassis: type: 10 serial: <superuser required>
Mobo: Micro-Star model: MS-17K3 v: REV:1.0 serial: <superuser required>
UEFI: American Megatrends LLC. v: E17K3IMS.11D date: 01/10/2022
Battery:
ID-1: BAT1 charge: 66.8 Wh (75.5%) condition: 88.5/95.0 Wh (93.2%)
volts: 15.1 min: 15.2 model: MSI BIF0_9 type: Li-ion serial: N/A
status: discharging
CPU:
Info: model: 11th Gen Intel Core i7-11800H bits: 64 type: MT MCP
arch: Tiger Lake gen: core 11 built: 2020 process: Intel 10nm family: 6
model-id: 0x8D (141) stepping: 1 microcode: 0x40
Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
L1: 640 KiB desc: d-8x48 KiB; i-8x32 KiB L2: 10 MiB desc: 8x1.2 MiB
L3: 24 MiB desc: 1x24 MiB
Speed (MHz): avg: 979 high: 3526 min/max: 800/4600 scaling:
driver: intel_pstate governor: powersave cores: 1: 800 2: 800 3: 800
4: 3526 5: 802 6: 879 7: 863 8: 800 9: 800 10: 800 11: 801 12: 801
13: 800 14: 800 15: 801 16: 801 bogomips: 73728
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Vulnerabilities:
Type: itlb_multihit status: Not affected
Type: l1tf status: Not affected
Type: mds status: Not affected
Type: meltdown status: Not affected
Type: mmio_stale_data status: Not affected
Type: retbleed status: Not affected
Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
prctl and seccomp
Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
sanitization
Type: spectre_v2 mitigation: Enhanced IBRS, IBPB: conditional, RSB
filling, PBRSB-eIBRS: SW sequence
Type: srbds status: Not affected
Type: tsx_async_abort status: Not affected
Graphics:
Device-1: Intel TigerLake-H GT1 [UHD Graphics] vendor: Micro-Star MSI
driver: i915 v: kernel arch: Gen-12.1 process: Intel 10nm built: 2020-21
ports: active: DP-2 off: eDP-1 empty: DP-1 bus-ID: 00:02.0
chip-ID: 8086:9a60 class-ID: 0300
Device-2: NVIDIA GA104M [GeForce RTX 3080 Mobile / Max-Q 8GB/16GB]
vendor: Micro-Star MSI driver: N/A alternate: nouveau non-free: 515.xx+
status: current (as of 2022-07) arch: Ampere code: GAxxx process: TSMC n7
(7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16
bus-ID: 01:00.0 chip-ID: 10de:249c class-ID: 0300
Display: x11 server: X.Org v: 21.1.4 with: Xwayland v: 22.1.3
compositor: kwin_x11 driver: X: loaded: modesetting
alternate: fbdev,intel,vesa gpu: i915 display-ID: :0 screens: 1
Screen-1: 0 s-res: 3840x2160 s-dpi: 96 s-size: 1015x571mm (39.96x22.48")
s-diag: 1165mm (45.85")
Monitor-1: DP-2 pos: primary model: LG (GoldStar) HDR 4K serial: <filter>
built: 2020 res: 3840x2160 hz: 60 dpi: 140 gamma: 1.2
size: 697x392mm (27.44x15.43") diag: 806mm (31.7") ratio: 16:9 modes:
max: 3840x2160 min: 640x480
Monitor-2: eDP-1 note: disabled model: AU Optronics 0xe295 built: 2020
res: 3840x2160 dpi: 128 gamma: 1.2 size: 382x215mm (15.04x8.46")
diag: 438mm (17.3") ratio: 16:9 modes: 1920x1080
OpenGL: renderer: Mesa Intel UHD Graphics (TGL GT1) v: 4.6 Mesa 22.1.6
direct render: Yes
Audio:
Device-1: Intel Tiger Lake-H HD Audio vendor: Micro-Star MSI
driver: sof-audio-pci-intel-tgl
alternate: snd_hda_intel,snd_sof_pci_intel_tgl bus-ID: 00:1f.3
chip-ID: 8086:43c8 class-ID: 0401
Device-2: NVIDIA GA104 High Definition Audio vendor: Micro-Star MSI
driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
bus-ID: 01:00.1 chip-ID: 10de:228b class-ID: 0403
Sound Server-1: ALSA v: k5.15.62-1-lts running: yes
Sound Server-2: PulseAudio v: 16.1 running: no
Sound Server-3: PipeWire v: 0.3.56 running: yes
Network:
Device-1: Realtek Killer E3000 2.5GbE vendor: Micro-Star MSI driver: r8169
v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1 port: 4000 bus-ID: 2f:00.0
chip-ID: 10ec:3000 class-ID: 0200
IF: enp47s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Device-2: Intel Wi-Fi 6 AX210/AX211/AX411 160MHz vendor: Rivet Networks
driver: iwlwifi v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1
bus-ID: 31:00.0 chip-ID: 8086:2725 class-ID: 0280
IF: wlp49s0 state: down mac: <filter>
Bluetooth:
Device-1: Intel AX210 Bluetooth type: USB driver: btusb v: 0.8
bus-ID: 3-14:9 chip-ID: 8087:0032 class-ID: e001
Report: bt-adapter ID: hci0 rfk-id: 1 state: up address: <filter>
Drives:
Local Storage: total: 2.79 TiB used: 232.65 GiB (8.1%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:9 vendor: Micron
model: 3400 MTFDKBA1T0TFH size: 953.87 GiB block-size: physical: 512 B
logical: 512 B speed: 63.2 Gb/s lanes: 4 type: SSD serial: <filter>
rev: P7MA000 temp: 41.9 C scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:0 model: PCIe SSD size: 1.86 TiB
block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s lanes: 4
type: SSD serial: <filter> rev: ECFM22.6 temp: 26.9 C scheme: GPT
Partition:
ID-1: / raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6 MiB (1.4%)
fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
ID-2: /boot raw-size: 6.6 GiB size: 6.6 GiB (100.00%) used: 257.3 MiB
(3.8%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
ID-3: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%) used: 608 KiB
(0.2%) fs: vfat dev: /dev/nvme1n1p3 maj-min: 259:3
ID-4: /home raw-size: 150 GiB size: 150 GiB (100.00%) used: 116.1 MiB
(0.1%) fs: btrfs dev: /dev/nvme1n1p4 maj-min: 259:4
ID-5: /opt raw-size: 1.04 TiB size: 1.04 TiB (100.00%) used: 224.4 GiB
(21.2%) fs: btrfs dev: /dev/nvme1n1p8 maj-min: 259:8
ID-6: /usr raw-size: 53.2 GiB size: 53.2 GiB (100.00%) used: 6.79 GiB
(12.8%) fs: btrfs dev: /dev/nvme1n1p5 maj-min: 259:5
ID-7: /var raw-size: 8.6 GiB size: 8.6 GiB (100.00%) used: 148 MiB (1.7%)
fs: btrfs dev: /dev/nvme1n1p6 maj-min: 259:6
ID-8: /var/log raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6
MiB (1.4%) fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
ID-9: /var/tmp raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6
MiB (1.4%) fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
Swap:
Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default)
ID-1: swap-1 type: zram size: 31.06 GiB used: 0 KiB (0.0%) priority: 100
dev: /dev/zram0
Sensors:
System Temperatures: cpu: 45.0 C mobo: N/A
Fan Speeds (RPM): N/A
Info:
Processes: 408 Uptime: 1h 34m wakeups: 3 Memory: 31.06 GiB used: 5.24 GiB
(16.9%) Init: systemd v: 251 default: graphical tool: systemctl
Compilers: gcc: 12.2.0 Packages: pacman: 1228 lib: 309 Shell: fish v: 3.5.1
default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.20
Garuda (2.6.6-1):
System install date:     2022-08-23
Last full system update: 2022-08-23 ↻
Is partially upgraded:   No
Relevant software:       NetworkManager
Windows dual boot:       Probably (Run as root to verify)
Snapshots:               Snapper
Failed units:

Some System Configuration info:

I verified that "Secure Boot" is disabled in the BIOS.
There were a number of reboots of my laptop involved here.
I did a brand new installation of Garuda using the latest .iso, and then fully updated with 'garuda-update'. Reboot.
I then installed the linux-lts kernel and headers, which is the kernel I would like to use at this time. Reboot.
Then, I went to 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Install'. Reboot.
After having installed the Nvidia Driver and then discovering that it was not functioning, I went back to the 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Reinstall'. Reboot. But there was no change, the issue still persists.

Here are some commands and results for you to examine:

'nvidia-settings'
ERROR: NVIDIA driver is not loaded

'nvidia-smi'
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

'modprobe nvidia'
modprobe: FATAL: Module nvidia not found in directory /lib/modules/5.15.62-1-lts

'pacman -Qs nvidia'
local/egl-wayland 2:1.1.10-1
EGLStream-based Wayland external platform
local/garuda-nvidia-config 1:r62.889aefa-7
Meta configuration package for nvidia systems on Garuda Linux
local/garuda-nvidia-prime-config 1:r62.889aefa-7
Meta configuration package for nvidia prime systems on Garuda Linux:
local/gwe 0.15.5-3
A system utility for controlling NVIDIA GPUs
local/lib32-libvdpau 1.5-1
Nvidia VDPAU library
local/lib32-nvidia-utils 515.65.01-1
NVIDIA drivers utilities (32-bit)
local/lib32-opencl-nvidia 515.65.01-1
OpenCL implemention for NVIDIA (32-bit)
local/libvdpau 1.5-1
Nvidia VDPAU library
local/libxnvctrl 515.65.01-1
NVIDIA NV-CONTROL X extension
local/mhwd-nvidia 515.43.04-2
mhwd-nvidia pci id
local/mhwd-nvidia-390xx 390.143-1
mhwd-nvidia-390xx PCI ID database
local/mhwd-nvidia-470xx 470.94-1
mhwd-nvidia-470xx PCI ID database
local/nvidia-dkms 515.65.01-2
NVIDIA drivers - module sources
local/nvidia-prime 1.0-4
NVIDIA Prime Render Offload configuration and utilities
local/nvidia-settings 515.65.01-1
Tool for configuring the NVIDIA graphics driver
local/nvidia-utils 515.65.01-2
NVIDIA drivers utilities
local/opencl-nvidia 515.65.01-2
OpenCL implemention for NVIDIA
local/xf86-video-nouveau 1.0.17-2 (xorg-drivers)
Open Source 3D acceleration driver for nVidia cards

'mhwd -li'
> Installed PCI configs:
--------------------------------------------------------------------------------
NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
video-linux            2021.08.29                true            PCI
video-nvidia-prime-render-offload            2022.08.17               false            PCI


Warning: No installed USB configs!

'dkms status'
nvidia/515.65.01: added

Regarding that last command, it shows as "nvidia/515.65.01: added". I was wondering if it should also show as "built/installed"?

I sure appreciate anyone taking the time to have a look at this issue. Thank you.

P.S. I have enjoyed Garuda so much that I have happily given a few donations since I started using it. Thanks again and keep up the great work!

1 Like

Could you try adding ibt=off to your kernel parameters? This shouldn't be necessary because you use LTS kernel but it is still worth trying.

This is not the full output. Please don't trim or truncate output.

Run dkms autoinstall and read the output carefully.

3 Likes

Dear jonathon,

Your recommnded command 'dkms autoinstall' (which had to be done as sudo/root) appears to have delivered the proper fix for the issue. I will need to reboot to be certain.

Here is the output:

[[email protected] ~]#dkms autoinstall
Sign command: /usr/lib/modules/5.15.62-1-lts/build/scripts/sign-file
Signing key: /var/lib/dkms/mok.key
Public certificate (MOK): /var/lib/dkms/mok.pub

Building module:
Cleaning build area...
'make' -j16 IGNORE_PREEMPT_RT_PRESENCE=1 NV_EXCLUDE_BUILD_MODULES='__EXCLUDE_MODULES' KERNEL_UNAME=5.15.62-1-lts modules........
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-uvm.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-modeset.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-drm.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-peermem.ko
Cleaning build area...

nvidia.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-uvm.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-modeset.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-drm.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-peermem.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/
depmod...

Thank you very much!

There is something elese I would like to address....

One thing that we all know is our own experiences. On my system and linux installation, the command 'dkms status' showed exactly what I posted, nothing more and nothing less. And it didn't matter whether it was run as regular user or as root user. I did not truncate anything. Please keep in mind that you do not know what I have experienced, only what you have experienced. I do not appreciate being accused of something that I have not done or am not doing. I hope you can understand that. Anyway...

Again, thank you jonathon! Take care and cheers

That would not be output that I'm used to - dkms status should show the kernel version too, e.g.

nvidia/515.49.10, 5.19.3-lqx1-1-lqx, x86_64: installed
nvidia/515.49.10, 5.19.3-zen1-1-zen, x86_64: installed

I guess "added" is some intermediate broken state...

1 Like

Looks like DKMS itself is broken:

Install regular linux kernel and the precompiled driver for it:

sudo pacman -S linux linux-headers nvidia

then reboot and choose "linux" kernel, not "linux-lts"

The problem was fixed by triggering a DKMS moduyle build - there's no need to install another kernel and the pre-compiled driver module (which will conflict with nvidia-dkms and so break NVIDIA for the other kernels)

Well jonathon, from the looks of the result of your command, I thought the fix was in. But no, the issue persists. However, the 'dkms status' command now has full output and states that it is now 'installed' rather than 'added'.

'nvidia-settings'
ERROR: NVIDIA driver is not loaded

'nvidia-smi'
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

[[email protected] ~]# modprobe nvidia
modprobe: ERROR: could not insert 'nvidia': Invalid argument

'dkms status'
nvidia/515.65.01, 5.15.62-1-lts, x86_64: installed

Thank you for your efforts

Hello mrvictory,

The reason I am using the LTS Kernel is because there has been some "issues" on my end ever since the introduction of the 5.18.x kernel.

Here is what I will do. I already have the latest Zen Kernel installed. I will boot into it, which works ok. But I have no 3D Accelerated output when using that kernel, and when I exit Steam there is a strange issue. We will see what happens with this new installation I am running, but I expect to have the same problem(s) I was having the last time I tried using it.

I will let you know what happens fairly soon. Thank you for trying to help!

Then run this:

sudo pacman -S --needed linux-lts-headers nvidia-lts

Hello again mrvictory,

I was just looking over that bug report. After getting the kernel module built thanks to jonathon's help, this bug report looks like it is the bug I have encountered.

Now that the kernel module is installed, when I do 'modprobe nvidia' as root, I get...

[[email protected] ~]# modprobe nvidia
modprobe: ERROR: could not insert 'nvidia': Invalid argument

I will need to take some time to try the bug reporter's[Matt M. (9numbernine9)] script/fix.

Cheers

The fix was to update pahole, so just update your system.

2 Likes

Ok jonathon,

I will do that and be right back. Thanks

Isn't it downgrade pahole?

1 Like

OT :smiley:

egal = no matter :grin:

2 Likes

Hello again folks,

The issue still persists.

You are correct mrvictory, it was a downgrade. pahole went from version 1.24 to 1.23.


I am wondering if I need to uninstall dkms nvidia kernel module, and build it again since the change to the pahole package.

It looks like Arch devs did timely fix the bug​:partying_face:
I am glad I didn't set off the alarms this time :sweat_smile:

1 Like

Howdy,

It's getting a little hairy, but nothing I can't handle. Here's what's going on at this time:

After the downgrade to pahole, I rebooted.
Then I ran the following command as root...

[[email protected] pkg]# dkms remove nvidia/515.65.01
Module nvidia-515.65.01 for kernel 5.15.62-1-lts (x86_64).
Before uninstall, this module version was ACTIVE on this kernel.

nvidia.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-uvm.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-modeset.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-drm.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-peermem.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.
depmod...
Deleting module nvidia-515.65.01 completely from the DKMS tree.

I rebooted.
When I ran the commands 'dkms status' and 'dkms autoinstall'(as root), they returned nothing.
So, I went to 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Reinstall'.
Rebooted.
Then I ran 'dkms status' and it returned...

'dkms status'
nvidia/515.65.01: added

I have now run the 'sudo dkms autoinstall' command which completed sucessfully and am going to reboot.

Cheers

2 Likes

OMG! OMG! :grinning:



'dkms status'
nvidia/515.65.01, 5.15.62-1-lts, x86_64: installed

╭─[email protected] in ~ took 132ms
╰─λ nvidia-smi
Wed Aug 24 09:54:02 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01    Driver Version: 515.65.01    CUDA Version: 11.7     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ...  Off  | 00000000:01:00.0 Off |                  N/A |
| N/A   36C    P0    N/A /  N/A |      5MiB / 16384MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      1412      G   /usr/lib/Xorg                       4MiB |
+-----------------------------------------------------------------------------+

╭─[email protected] in ~ took 1s
╰─λ prime-run glxinfo -B | grep "OpenGL vendor string"
OpenGL vendor string: NVIDIA Corporation

And the command 'prime-run glxgears' works!

Looks like success. Thank you jonathon and mrvictory for your help!

Cheers,

Jimmy Richards
Proud and Happy to use Garuda :slightly_smiling_face:

4 Likes

Thanks so much for this thread! Fixed my issues also! :slight_smile:

1 Like