Nvidia-settings ERROR: NVIDIA driver is not loaded

Greetings and Salutations Garuda Friends,

I have happily been using Garuda for 9 months now. Thank you! I am having an issue with the nvidia driver being installed, but not it's not loading or functioning. I think perhaps the kernel module did not get built, and then installed by dkms. Here is my garuda-inxi:

System:
Kernel: 5.15.62-1-lts arch: x86_64 bits: 64 compiler: gcc v: 12.2.0
parameters: BOOT_IMAGE=/vmlinuz-linux-lts
root=UUID=272ad521-3b3f-459e-a9a0-0ce5504a8d22 rw rootflags=subvol=@
quiet quiet splash rd.udev.log_priority=3 vt.global_cursor_default=0
loglevel=3
Desktop: KDE Plasma v: 5.25.4 tk: Qt v: 5.15.5 info: latte-dock
wm: kwin_x11 vt: 1 dm: SDDM Distro: Garuda Linux base: Arch Linux
Machine:
Type: Laptop System: Micro-Star product: GE76 Raider 11UH v: REV:1.0
serial: <superuser required> Chassis: type: 10 serial: <superuser required>
Mobo: Micro-Star model: MS-17K3 v: REV:1.0 serial: <superuser required>
UEFI: American Megatrends LLC. v: E17K3IMS.11D date: 01/10/2022
Battery:
ID-1: BAT1 charge: 66.8 Wh (75.5%) condition: 88.5/95.0 Wh (93.2%)
volts: 15.1 min: 15.2 model: MSI BIF0_9 type: Li-ion serial: N/A
status: discharging
CPU:
Info: model: 11th Gen Intel Core i7-11800H bits: 64 type: MT MCP
arch: Tiger Lake gen: core 11 built: 2020 process: Intel 10nm family: 6
model-id: 0x8D (141) stepping: 1 microcode: 0x40
Topology: cpus: 1x cores: 8 tpc: 2 threads: 16 smt: enabled cache:
L1: 640 KiB desc: d-8x48 KiB; i-8x32 KiB L2: 10 MiB desc: 8x1.2 MiB
L3: 24 MiB desc: 1x24 MiB
Speed (MHz): avg: 979 high: 3526 min/max: 800/4600 scaling:
driver: intel_pstate governor: powersave cores: 1: 800 2: 800 3: 800
4: 3526 5: 802 6: 879 7: 863 8: 800 9: 800 10: 800 11: 801 12: 801
13: 800 14: 800 15: 801 16: 801 bogomips: 73728
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Vulnerabilities:
Type: itlb_multihit status: Not affected
Type: l1tf status: Not affected
Type: mds status: Not affected
Type: meltdown status: Not affected
Type: mmio_stale_data status: Not affected
Type: retbleed status: Not affected
Type: spec_store_bypass mitigation: Speculative Store Bypass disabled via
prctl and seccomp
Type: spectre_v1 mitigation: usercopy/swapgs barriers and __user pointer
sanitization
Type: spectre_v2 mitigation: Enhanced IBRS, IBPB: conditional, RSB
filling, PBRSB-eIBRS: SW sequence
Type: srbds status: Not affected
Type: tsx_async_abort status: Not affected
Graphics:
Device-1: Intel TigerLake-H GT1 [UHD Graphics] vendor: Micro-Star MSI
driver: i915 v: kernel arch: Gen-12.1 process: Intel 10nm built: 2020-21
ports: active: DP-2 off: eDP-1 empty: DP-1 bus-ID: 00:02.0
chip-ID: 8086:9a60 class-ID: 0300
Device-2: NVIDIA GA104M [GeForce RTX 3080 Mobile / Max-Q 8GB/16GB]
vendor: Micro-Star MSI driver: N/A alternate: nouveau non-free: 515.xx+
status: current (as of 2022-07) arch: Ampere code: GAxxx process: TSMC n7
(7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16
bus-ID: 01:00.0 chip-ID: 10de:249c class-ID: 0300
Display: x11 server: X.Org v: 21.1.4 with: Xwayland v: 22.1.3
compositor: kwin_x11 driver: X: loaded: modesetting
alternate: fbdev,intel,vesa gpu: i915 display-ID: :0 screens: 1
Screen-1: 0 s-res: 3840x2160 s-dpi: 96 s-size: 1015x571mm (39.96x22.48")
s-diag: 1165mm (45.85")
Monitor-1: DP-2 pos: primary model: LG (GoldStar) HDR 4K serial: <filter>
built: 2020 res: 3840x2160 hz: 60 dpi: 140 gamma: 1.2
size: 697x392mm (27.44x15.43") diag: 806mm (31.7") ratio: 16:9 modes:
max: 3840x2160 min: 640x480
Monitor-2: eDP-1 note: disabled model: AU Optronics 0xe295 built: 2020
res: 3840x2160 dpi: 128 gamma: 1.2 size: 382x215mm (15.04x8.46")
diag: 438mm (17.3") ratio: 16:9 modes: 1920x1080
OpenGL: renderer: Mesa Intel UHD Graphics (TGL GT1) v: 4.6 Mesa 22.1.6
direct render: Yes
Audio:
Device-1: Intel Tiger Lake-H HD Audio vendor: Micro-Star MSI
driver: sof-audio-pci-intel-tgl
alternate: snd_hda_intel,snd_sof_pci_intel_tgl bus-ID: 00:1f.3
chip-ID: 8086:43c8 class-ID: 0401
Device-2: NVIDIA GA104 High Definition Audio vendor: Micro-Star MSI
driver: snd_hda_intel v: kernel pcie: gen: 4 speed: 16 GT/s lanes: 16
bus-ID: 01:00.1 chip-ID: 10de:228b class-ID: 0403
Sound Server-1: ALSA v: k5.15.62-1-lts running: yes
Sound Server-2: PulseAudio v: 16.1 running: no
Sound Server-3: PipeWire v: 0.3.56 running: yes
Network:
Device-1: Realtek Killer E3000 2.5GbE vendor: Micro-Star MSI driver: r8169
v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1 port: 4000 bus-ID: 2f:00.0
chip-ID: 10ec:3000 class-ID: 0200
IF: enp47s0 state: up speed: 1000 Mbps duplex: full mac: <filter>
Device-2: Intel Wi-Fi 6 AX210/AX211/AX411 160MHz vendor: Rivet Networks
driver: iwlwifi v: kernel pcie: gen: 2 speed: 5 GT/s lanes: 1
bus-ID: 31:00.0 chip-ID: 8086:2725 class-ID: 0280
IF: wlp49s0 state: down mac: <filter>
Bluetooth:
Device-1: Intel AX210 Bluetooth type: USB driver: btusb v: 0.8
bus-ID: 3-14:9 chip-ID: 8087:0032 class-ID: e001
Report: bt-adapter ID: hci0 rfk-id: 1 state: up address: <filter>
Drives:
Local Storage: total: 2.79 TiB used: 232.65 GiB (8.1%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:9 vendor: Micron
model: 3400 MTFDKBA1T0TFH size: 953.87 GiB block-size: physical: 512 B
logical: 512 B speed: 63.2 Gb/s lanes: 4 type: SSD serial: <filter>
rev: P7MA000 temp: 41.9 C scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:0 model: PCIe SSD size: 1.86 TiB
block-size: physical: 512 B logical: 512 B speed: 31.6 Gb/s lanes: 4
type: SSD serial: <filter> rev: ECFM22.6 temp: 26.9 C scheme: GPT
Partition:
ID-1: / raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6 MiB (1.4%)
fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
ID-2: /boot raw-size: 6.6 GiB size: 6.6 GiB (100.00%) used: 257.3 MiB
(3.8%) fs: btrfs dev: /dev/nvme1n1p2 maj-min: 259:2
ID-3: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%) used: 608 KiB
(0.2%) fs: vfat dev: /dev/nvme1n1p3 maj-min: 259:3
ID-4: /home raw-size: 150 GiB size: 150 GiB (100.00%) used: 116.1 MiB
(0.1%) fs: btrfs dev: /dev/nvme1n1p4 maj-min: 259:4
ID-5: /opt raw-size: 1.04 TiB size: 1.04 TiB (100.00%) used: 224.4 GiB
(21.2%) fs: btrfs dev: /dev/nvme1n1p8 maj-min: 259:8
ID-6: /usr raw-size: 53.2 GiB size: 53.2 GiB (100.00%) used: 6.79 GiB
(12.8%) fs: btrfs dev: /dev/nvme1n1p5 maj-min: 259:5
ID-7: /var raw-size: 8.6 GiB size: 8.6 GiB (100.00%) used: 148 MiB (1.7%)
fs: btrfs dev: /dev/nvme1n1p6 maj-min: 259:6
ID-8: /var/log raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6
MiB (1.4%) fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
ID-9: /var/tmp raw-size: 66.4 GiB size: 66.4 GiB (100.00%) used: 968.6
MiB (1.4%) fs: btrfs dev: /dev/nvme1n1p1 maj-min: 259:1
Swap:
Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default)
ID-1: swap-1 type: zram size: 31.06 GiB used: 0 KiB (0.0%) priority: 100
dev: /dev/zram0
Sensors:
System Temperatures: cpu: 45.0 C mobo: N/A
Fan Speeds (RPM): N/A
Info:
Processes: 408 Uptime: 1h 34m wakeups: 3 Memory: 31.06 GiB used: 5.24 GiB
(16.9%) Init: systemd v: 251 default: graphical tool: systemctl
Compilers: gcc: 12.2.0 Packages: pacman: 1228 lib: 309 Shell: fish v: 3.5.1
default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.20
Garuda (2.6.6-1):
System install date:     2022-08-23
Last full system update: 2022-08-23 ā†»
Is partially upgraded:   No
Relevant software:       NetworkManager
Windows dual boot:       Probably (Run as root to verify)
Snapshots:               Snapper
Failed units:

Some System Configuration info:

I verified that "Secure Boot" is disabled in the BIOS.
There were a number of reboots of my laptop involved here.
I did a brand new installation of Garuda using the latest .iso, and then fully updated with 'garuda-update'. Reboot.
I then installed the linux-lts kernel and headers, which is the kernel I would like to use at this time. Reboot.
Then, I went to 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Install'. Reboot.
After having installed the Nvidia Driver and then discovering that it was not functioning, I went back to the 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Reinstall'. Reboot. But there was no change, the issue still persists.

Here are some commands and results for you to examine:

'nvidia-settings'
ERROR: NVIDIA driver is not loaded

'nvidia-smi'
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

'modprobe nvidia'
modprobe: FATAL: Module nvidia not found in directory /lib/modules/5.15.62-1-lts

'pacman -Qs nvidia'
local/egl-wayland 2:1.1.10-1
EGLStream-based Wayland external platform
local/garuda-nvidia-config 1:r62.889aefa-7
Meta configuration package for nvidia systems on Garuda Linux
local/garuda-nvidia-prime-config 1:r62.889aefa-7
Meta configuration package for nvidia prime systems on Garuda Linux:
local/gwe 0.15.5-3
A system utility for controlling NVIDIA GPUs
local/lib32-libvdpau 1.5-1
Nvidia VDPAU library
local/lib32-nvidia-utils 515.65.01-1
NVIDIA drivers utilities (32-bit)
local/lib32-opencl-nvidia 515.65.01-1
OpenCL implemention for NVIDIA (32-bit)
local/libvdpau 1.5-1
Nvidia VDPAU library
local/libxnvctrl 515.65.01-1
NVIDIA NV-CONTROL X extension
local/mhwd-nvidia 515.43.04-2
mhwd-nvidia pci id
local/mhwd-nvidia-390xx 390.143-1
mhwd-nvidia-390xx PCI ID database
local/mhwd-nvidia-470xx 470.94-1
mhwd-nvidia-470xx PCI ID database
local/nvidia-dkms 515.65.01-2
NVIDIA drivers - module sources
local/nvidia-prime 1.0-4
NVIDIA Prime Render Offload configuration and utilities
local/nvidia-settings 515.65.01-1
Tool for configuring the NVIDIA graphics driver
local/nvidia-utils 515.65.01-2
NVIDIA drivers utilities
local/opencl-nvidia 515.65.01-2
OpenCL implemention for NVIDIA
local/xf86-video-nouveau 1.0.17-2 (xorg-drivers)
Open Source 3D acceleration driver for nVidia cards

'mhwd -li'
> Installed PCI configs:
--------------------------------------------------------------------------------
NAME               VERSION          FREEDRIVER           TYPE
--------------------------------------------------------------------------------
video-linux            2021.08.29                true            PCI
video-nvidia-prime-render-offload            2022.08.17               false            PCI


Warning: No installed USB configs!

'dkms status'
nvidia/515.65.01: added

Regarding that last command, it shows as "nvidia/515.65.01: added". I was wondering if it should also show as "built/installed"?

I sure appreciate anyone taking the time to have a look at this issue. Thank you.

P.S. I have enjoyed Garuda so much that I have happily given a few donations since I started using it. Thanks again and keep up the great work!

1 Like

Could you try adding ibt=off to your kernel parameters? This shouldn't be necessary because you use LTS kernel but it is still worth trying.

This is not the full output. Please donā€™t trim or truncate output.

Run dkms autoinstall and read the output carefully.

3 Likes

Dear jonathon,

Your recommnded command 'dkms autoinstall' (which had to be done as sudo/root) appears to have delivered the proper fix for the issue. I will need to reboot to be certain.

Here is the output:

[root@msi-ge76raider ~]#dkms autoinstall
Sign command: /usr/lib/modules/5.15.62-1-lts/build/scripts/sign-file
Signing key: /var/lib/dkms/mok.key
Public certificate (MOK): /var/lib/dkms/mok.pub

Building module:
Cleaning build area...
'make' -j16 IGNORE_PREEMPT_RT_PRESENCE=1 NV_EXCLUDE_BUILD_MODULES='__EXCLUDE_MODULES' KERNEL_UNAME=5.15.62-1-lts modules........
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-uvm.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-modeset.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-drm.ko
Signing module /var/lib/dkms/nvidia/515.65.01/build/nvidia-peermem.ko
Cleaning build area...

nvidia.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-uvm.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-modeset.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-drm.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/

nvidia-peermem.ko.zst:
Running module version sanity check.
- Original module
- No original module exists within this kernel
- Installation
- Installing to /usr/lib/modules/5.15.62-1-lts/updates/dkms/
depmod...

Thank you very much!

There is something elese I would like to address....

One thing that we all know is our own experiences. On my system and linux installation, the command 'dkms status' showed exactly what I posted, nothing more and nothing less. And it didn't matter whether it was run as regular user or as root user. I did not truncate anything. Please keep in mind that you do not know what I have experienced, only what you have experienced. I do not appreciate being accused of something that I have not done or am not doing. I hope you can understand that. Anyway...

Again, thank you jonathon! Take care and cheers

That would not be output that Iā€™m used to - dkms status should show the kernel version too, e.g.

nvidia/515.49.10, 5.19.3-lqx1-1-lqx, x86_64: installed
nvidia/515.49.10, 5.19.3-zen1-1-zen, x86_64: installed

I guess ā€œaddedā€ is some intermediate broken stateā€¦

1 Like

Looks like DKMS itself is broken:

Install regular linux kernel and the precompiled driver for it:

sudo pacman -S linux linux-headers nvidia

then reboot and choose "linux" kernel, not "linux-lts"

The problem was fixed by triggering a DKMS moduyle build - there's no need to install another kernel and the pre-compiled driver module (which will conflict with nvidia-dkms and so break NVIDIA for the other kernels)

Well jonathon, from the looks of the result of your command, I thought the fix was in. But no, the issue persists. However, the 'dkms status' command now has full output and states that it is now 'installed' rather than 'added'.

'nvidia-settings'
ERROR: NVIDIA driver is not loaded

'nvidia-smi'
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

[root@msi-ge76raider ~]# modprobe nvidia
modprobe: ERROR: could not insert 'nvidia': Invalid argument

'dkms status'
nvidia/515.65.01, 5.15.62-1-lts, x86_64: installed

Thank you for your efforts

Hello mrvictory,

The reason I am using the LTS Kernel is because there has been some "issues" on my end ever since the introduction of the 5.18.x kernel.

Here is what I will do. I already have the latest Zen Kernel installed. I will boot into it, which works ok. But I have no 3D Accelerated output when using that kernel, and when I exit Steam there is a strange issue. We will see what happens with this new installation I am running, but I expect to have the same problem(s) I was having the last time I tried using it.

I will let you know what happens fairly soon. Thank you for trying to help!

Then run this:

sudo pacman -S --needed linux-lts-headers nvidia-lts

Hello again mrvictory,

I was just looking over that bug report. After getting the kernel module built thanks to jonathon's help, this bug report looks like it is the bug I have encountered.

Now that the kernel module is installed, when I do 'modprobe nvidia' as root, I get...

[root@msi-ge76raider ~]# modprobe nvidia
modprobe: ERROR: could not insert 'nvidia': Invalid argument

I will need to take some time to try the bug reporter's[Matt M. (9numbernine9)] script/fix.

Cheers

The fix was to update pahole, so just update your system.

2 Likes

Ok jonathon,

I will do that and be right back. Thanks

Isnā€™t it downgrade pahole?

1 Like

OT :smiley:

egal = no matter :grin:

2 Likes

Hello again folks,

The issue still persists.

You are correct mrvictory, it was a downgrade. pahole went from version 1.24 to 1.23.


I am wondering if I need to uninstall dkms nvidia kernel module, and build it again since the change to the pahole package.

It looks like Arch devs did timely fix the bugā€‹:partying_face:
I am glad I didnā€™t set off the alarms this time :sweat_smile:

1 Like

Howdy,

It's getting a little hairy, but nothing I can't handle. Here's what's going on at this time:

After the downgrade to pahole, I rebooted.
Then I ran the following command as root...

[root@msi-ge76raider pkg]# dkms remove nvidia/515.65.01
Module nvidia-515.65.01 for kernel 5.15.62-1-lts (x86_64).
Before uninstall, this module version was ACTIVE on this kernel.

nvidia.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-uvm.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-modeset.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-drm.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.

nvidia-peermem.ko.zst:
- Uninstallation
- Deleting from: /usr/lib/modules/5.15.62-1-lts/updates/dkms/
- Original module
- No original module was found for this module on this kernel.
- Use the dkms install command to reinstall any previous module version.
depmod...
Deleting module nvidia-515.65.01 completely from the DKMS tree.

I rebooted.
When I ran the commands 'dkms status' and 'dkms autoinstall'(as root), they returned nothing.
So, I went to 'Garuda Settings Manager' --> 'Hardware Configuration', right-clicked on 'video-nvidia-prime-render-offload' and chose 'Reinstall'.
Rebooted.
Then I ran 'dkms status' and it returned...

'dkms status'
nvidia/515.65.01: added

I have now run the 'sudo dkms autoinstall' command which completed sucessfully and am going to reboot.

Cheers

2 Likes

OMG! OMG! :grinning:



'dkms status'
nvidia/515.65.01, 5.15.62-1-lts, x86_64: installed

ā•­ā”€onetinsoldier@msi in ~ took 132ms
ā•°ā”€Ī» nvidia-smi
Wed Aug 24 09:54:02 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01    Driver Version: 515.65.01    CUDA Version: 11.7     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  NVIDIA GeForce ...  Off  | 00000000:01:00.0 Off |                  N/A |
| N/A   36C    P0    N/A /  N/A |      5MiB / 16384MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A      1412      G   /usr/lib/Xorg                       4MiB |
+-----------------------------------------------------------------------------+

ā•­ā”€onetinsoldier@msi in ~ took 1s
ā•°ā”€Ī» prime-run glxinfo -B | grep "OpenGL vendor string"
OpenGL vendor string: NVIDIA Corporation

And the command 'prime-run glxgears' works!

Looks like success. Thank you jonathon and mrvictory for your help!

Cheers,

Jimmy Richards
Proud and Happy to use Garuda :slightly_smiling_face:

4 Likes

Thanks so much for this thread! Fixed my issues also! :slight_smile:

1 Like