Sometimes when I boot it works fine for hours (or the entire time it is running), but some times it becomes rather choppy (also pretty much all the time after initial boot for a few minutes). Loading and using websites is sluggish/delayed.
At times, while watching a video, the entire screen will freeze up. Sometimes it will correct itself after 10-20 seconds and other times it will remain locked (I can still hear the audio from the playing video).
In the log I see a bunch of stuff (hardware & software) being reset/restarting/reinitializing. I got this CPU crash in the journal today:
May 28 07:13:52 jim-zbf15 kernel: NETDEV WATCHDOG: enp0s20f0u2u3 (r8152): transmit queue 0 timed out
May 28 07:13:52 jim-zbf15 kernel: WARNING: CPU: 6 PID: 0 at net/sched/sch_generic.c:529 dev_watchdog+0x28d/0x2a0
May 28 07:13:52 jim-zbf15 kernel: Modules linked in: dm_mod snd_seq_dummy snd_hrtimer snd_seq rfcomm nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) cmac algif_hash algif_skcipher af_alg bnep btusb btrtl btbcm cdc_ether uvcvideo btintel rtsx_usb_ms usbnet rtsx_usb_sdmmc videobuf2_vmalloc videobuf2_memops mmc_core videobuf2_v4l2 btmtk memstick bluetooth videobuf2_common r8152 rtsx_usb videodev ecdh_generic mii crc16 qrtr hid_sensor_custom_intel_hinge snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common hid_sensor_accel_3d snd_soc_hdac_hdmi hid_sensor_trigger industrialio_triggered_buffer kfifo_buf hid_sensor_iio_common industrialio hid_sensor_custom hid_sensor_hub intel_ishtp_hid vfat fat snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi joydev mousedev
May 28 07:13:52 jim-zbf15 kernel: soundwire_bus ledtrig_audio intel_tcc_cooling snd_soc_core x86_pkg_temp_thermal iwlmvm snd_compress intel_powerclamp iTCO_wdt ac97_bus coretemp tps6598x typec snd_pcm_dmaengine intel_pmc_bxt pmt_telemetry mac80211 roles hid_multitouch kvm_intel pmt_class iTCO_vendor_support libarc4 intel_rapl_msr snd_hda_intel kvm snd_intel_dspcfg snd_intel_sdw_acpi irqbypass snd_usb_audio snd_hda_codec crct10dif_pclmul iwlwifi crc32_pclmul snd_usbmidi_lib ghash_clmulni_intel snd_rawmidi snd_hda_core asus_nb_wmi aesni_intel intel_spi_pci snd_seq_device iwlmei asus_wmi intel_spi snd_hwdep crypto_simd cryptd intel_cstate intel_uncore mc cfg80211 platform_profile wmi_bmof snd_pcm spi_nor intel_lpss_pci rfkill snd_timer i2c_i801 intel_lpss mtd i2c_smbus idma64 mei snd intel_ish_ipc thunderbolt i2c_hid_acpi intel_vsec intel_ishtp usbhid soundcore i2c_multi_instantiate i2c_hid tpm_crb int3403_thermal processor_thermal_device_pci_legacy tpm_tis processor_thermal_device tpm_tis_core
May 28 07:13:52 jim-zbf15 kernel: processor_thermal_rfim tpm processor_thermal_mbox rng_core processor_thermal_rapl intel_hid intel_rapl_common sparse_keymap int3400_thermal soc_button_array int340x_thermal_zone acpi_thermal_rel mac_hid acpi_pad acpi_tad intel_soc_dts_iosf igen6_edac uinput ipmi_devintf ipmi_msghandler crypto_user fuse acpi_call(OE) zram bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq serio_raw atkbd libps2 mxm_wmi nvme xhci_pci nvme_core xhci_pci_renesas i8042 serio wmi radeon amdgpu gpu_sched drm_ttm_helper intel_agp crc32c_intel i915 video ttm intel_gtt
May 28 07:13:52 jim-zbf15 kernel: CPU: 6 PID: 0 Comm: swapper/6 Tainted: P OE 5.17.9-zen1-1-zen #1 b447c96d85e73a9127528593906685354a7b7c5d
May 28 07:13:52 jim-zbf15 kernel: Hardware name: ASUSTeK COMPUTER INC. ZenBook UX564EH_Q528EH/UX564EH, BIOS UX564EH.312 03/09/2022
May 28 07:13:52 jim-zbf15 kernel: RIP: 0010:dev_watchdog+0x28d/0x2a0
May 28 07:13:52 jim-zbf15 kernel: Code: ff e9 cf fe ff ff 48 89 ef c6 05 de 07 48 01 01 e8 d8 f5 f8 ff 44 89 e9 48 89 ee 48 c7 c7 50 1a 76 ac 48 89 c2 e8 45 62 1b 00 <0f> 0b e9 b1 fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 90 0f 1f 44
May 28 07:13:52 jim-zbf15 kernel: RSP: 0018:ffffb1c5c02dce70 EFLAGS: 00010286
May 28 07:13:52 jim-zbf15 kernel: RAX: 0000000000000000 RBX: ffff9519012d041c RCX: 0000000000000027
May 28 07:13:52 jim-zbf15 kernel: RDX: ffff951c877a16e8 RSI: 0000000000000001 RDI: ffff951c877a16e0
May 28 07:13:52 jim-zbf15 kernel: RBP: ffff9519012d0000 R08: 0000000000000001 R09: 00000000ffffffea
May 28 07:13:52 jim-zbf15 kernel: R10: fffffffface5abc0 R11: 0000000000000002 R12: ffff9519012d04c8
May 28 07:13:52 jim-zbf15 kernel: R13: 0000000000000000 R14: ffff951c877a1dc0 R15: ffffffffabb48ef0
May 28 07:13:52 jim-zbf15 kernel: FS: 0000000000000000(0000) GS:ffff951c87780000(0000) knlGS:0000000000000000
May 28 07:13:52 jim-zbf15 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 28 07:13:52 jim-zbf15 kernel: CR2: 00001b78a001f000 CR3: 00000003a1876002 CR4: 0000000000770ee0
May 28 07:13:52 jim-zbf15 kernel: PKRU: 55555554
May 28 07:13:52 jim-zbf15 kernel: Call Trace:
May 28 07:13:52 jim-zbf15 kernel: <IRQ>
May 28 07:13:52 jim-zbf15 kernel: ? mq_change_real_num_tx+0xd0/0xd0
May 28 07:13:52 jim-zbf15 kernel: ? mq_change_real_num_tx+0xd0/0xd0
May 28 07:13:52 jim-zbf15 kernel: call_timer_fn+0x24/0x130
May 28 07:13:52 jim-zbf15 kernel: run_timer_softirq+0x931/0xb20
May 28 07:13:52 jim-zbf15 kernel: ? timerqueue_add+0x91/0xb0
May 28 07:13:52 jim-zbf15 kernel: __do_softirq+0xcd/0x2c4
May 28 07:13:52 jim-zbf15 kernel: ? sched_clock_cpu+0x9/0x120
May 28 07:13:52 jim-zbf15 kernel: irq_exit_rcu+0x91/0xc0
May 28 07:13:52 jim-zbf15 kernel: sysvec_apic_timer_interrupt+0x6e/0x90
May 28 07:13:52 jim-zbf15 kernel: </IRQ>
May 28 07:13:52 jim-zbf15 kernel: <TASK>
May 28 07:13:52 jim-zbf15 kernel: asm_sysvec_apic_timer_interrupt+0x12/0x20
May 28 07:13:52 jim-zbf15 kernel: RIP: 0010:cpuidle_enter_state+0xda/0x780
May 28 07:13:52 jim-zbf15 kernel: Code: 31 ff e8 19 dd 6c ff 45 84 ff 74 17 9c 58 0f 1f 44 00 00 f6 c4 02 0f 85 48 05 00 00 31 ff e8 2d cd 72 ff fb 66 0f 1f 44 00 00 <45> 85 ed 0f 88 6f 01 00 00 49 63 f5 4c 89 f2 48 8d 04 76 48 8d 04
May 28 07:13:52 jim-zbf15 kernel: RSP: 0018:ffffb1c5c0193ea8 EFLAGS: 00000246
May 28 07:13:52 jim-zbf15 kernel: RAX: ffff951c877b2a00 RBX: ffff951c877bd100 RCX: 0000000000000000
May 28 07:13:52 jim-zbf15 kernel: RDX: 00002cb4c1156f76 RSI: fffffffe225231d1 RDI: 0000000000000000
May 28 07:13:52 jim-zbf15 kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: 000000002da97f6a
May 28 07:13:52 jim-zbf15 kernel: R10: 0000000000000042 R11: 000000000000008e R12: ffffffffacf49aa0
May 28 07:13:52 jim-zbf15 kernel: R13: 0000000000000001 R14: 00002cb4c1156f76 R15: 0000000000000000
May 28 07:13:52 jim-zbf15 kernel: ? cpuidle_enter_state+0xb7/0x780
May 28 07:13:52 jim-zbf15 kernel: cpuidle_enter+0x29/0x40
May 28 07:13:52 jim-zbf15 kernel: do_idle+0x1bd/0x220
May 28 07:13:52 jim-zbf15 kernel: cpu_startup_entry+0x19/0x20
May 28 07:13:52 jim-zbf15 kernel: secondary_startup_64_no_verify+0xd5/0xdb
May 28 07:13:52 jim-zbf15 kernel: </TASK>
May 28 07:13:52 jim-zbf15 kernel: ---[ end trace 0000000000000000 ]---
When I open the System Monitor it shows core 8 as being almost maxed out (about 92% usage) all the time even though nothing should be doing that much work, and the processes list shows no processes using more than a fraction of a percent of CPU.
Here is inxi:
System:
Kernel: 5.17.9-zen1-1-zen arch: x86_64 bits: 64 compiler: gcc v: 12.1.0
parameters: BOOT_IMAGE=/@/boot/vmlinuz-linux-zen
root=UUID=a55c4a54-0d20-415e-93d9-23fd6938b438 rw rootflags=subvol=@
quiet quiet splash rd.udev.log_priority=3 vt.global_cursor_default=0
resume=UUID=f2faf51c-db74-457c-bb9d-eaf707594e6f loglevel=3
Desktop: KDE Plasma v: 5.24.5 tk: Qt v: 5.15.4 wm: kwin_x11 vt: 1
dm: SDDM Distro: Garuda Linux base: Arch Linux
Machine:
Type: Convertible System: ASUSTeK product: ZenBook UX564EH_Q528EH v: 1.0
serial: <superuser required>
Mobo: ASUSTeK model: UX564EH v: 1.0 serial: <superuser required>
UEFI: American Megatrends LLC. v: UX564EH.312 date: 03/09/2022
Battery:
ID-1: BAT0 charge: 91.7 Wh (100.0%) condition: 91.7/96.0 Wh (95.6%)
volts: 11.7 min: 11.7 model: ASUSTeK ASUS Battery type: Li-ion serial: N/A
status: not charging cycles: 4
Device-1: hid-0018:04F3:2C26.0003-battery model: ELAN9009:00 04F3:2C26
serial: N/A charge: N/A status: N/A
CPU:
Info: model: 11th Gen Intel Core i7-1165G7 bits: 64 type: MT MCP
arch: Tiger Lake family: 6 model-id: 0x8C (140) stepping: 1 microcode: 0xA4
Topology: cpus: 1x cores: 4 tpc: 2 threads: 8 smt: enabled cache:
L1: 320 KiB desc: d-4x48 KiB; i-4x32 KiB L2: 5 MiB desc: 4x1.2 MiB
L3: 12 MiB desc: 1x12 MiB
Speed (MHz): avg: 3003 high: 4655 min/max: 400/4700 scaling:
driver: intel_pstate governor: powersave cores: 1: 1646 2: 3640 3: 976
4: 4613 5: 2433 6: 3688 7: 2380 8: 4655 bogomips: 44851
Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx
Vulnerabilities:
Type: itlb_multihit status: Not affected
Type: l1tf status: Not affected
Type: mds status: Not affected
Type: meltdown status: Not affected
Type: spec_store_bypass
mitigation: Speculative Store Bypass disabled via prctl
Type: spectre_v1
mitigation: usercopy/swapgs barriers and __user pointer sanitization
Type: spectre_v2
mitigation: Enhanced IBRS, IBPB: conditional, RSB filling
Type: srbds status: Not affected
Type: tsx_async_abort status: Not affected
Graphics:
Device-1: Intel TigerLake-LP GT2 [Iris Xe Graphics] vendor: ASUSTeK
driver: i915 v: kernel ports: active: DP-2,eDP-1 empty: DP-1,HDMI-A-1
bus-ID: 00:02.0 chip-ID: 8086:9a49 class-ID: 0300
Device-2: NVIDIA TU117M vendor: ASUSTeK driver: nvidia v: 515.43.04
alternate: nouveau,nvidia_drm non-free: 515.xx+
status: current (as of 2022-05) arch: Turing pcie: gen: 3 speed: 8 GT/s
lanes: 4 link-max: lanes: 16 bus-ID: 58:00.0 chip-ID: 10de:1f99
class-ID: 0302
Device-3: IMC Networks USB2.0 HD UVC WebCam type: USB driver: uvcvideo
bus-ID: 3-5:5 chip-ID: 13d3:56eb class-ID: fe01 serial: <filter>
Display: x11 server: X.Org v: 21.1.3 with: Xwayland v: 22.1.2
compositor: kwin_x11 driver: X: loaded: modesetting,nvidia gpu: i915
display-ID: :0 screens: 1
Screen-1: 0 s-res: 3840x1521 s-dpi: 96 s-size: 1013x401mm (39.88x15.79")
s-diag: 1089mm (42.89")
Monitor-1: DP-2 pos: top-right model: Asus ML238 serial: <filter>
built: 2011 res: 1920x1080 hz: 60 dpi: 96 gamma: 1.2
size: 509x286mm (20.04x11.26") diag: 584mm (23") ratio: 16:9 modes:
max: 1920x1080 min: 720x400
Monitor-2: eDP-1 pos: primary,bottom-l model: BOE Display 0x07d8
built: 2018 res: 1920x1080 hz: 60 dpi: 142 gamma: 1.2
size: 344x194mm (13.54x7.64") diag: 395mm (15.5") ratio: 16:9
modes: 1920x1080
OpenGL: renderer: Mesa Intel Xe Graphics (TGL GT2) v: 4.6 Mesa 22.1.0
direct render: Yes
Audio:
Device-1: Intel Tiger Lake-LP Smart Sound Audio vendor: ASUSTeK
driver: sof-audio-pci-intel-tgl
alternate: snd_hda_intel,snd_sof_pci_intel_tgl bus-ID: 00:1f.3
chip-ID: 8086:a0c8 class-ID: 0401
Device-2: Logitech H390 headset with microphone type: USB
driver: hid-generic,snd-usb-audio,usbhid bus-ID: 3-2.4.1:6
chip-ID: 046d:0a8f class-ID: 0300
Sound Server-1: ALSA v: k5.17.9-zen1-1-zen running: yes
Sound Server-2: sndio v: N/A running: no
Sound Server-3: PulseAudio v: 15.0 running: no
Sound Server-4: PipeWire v: 0.3.51 running: yes
Network:
Device-1: Intel Wi-Fi 6 AX201 driver: iwlwifi v: kernel bus-ID: 00:14.3
chip-ID: 8086:a0f0 class-ID: 0280
IF: wlo1 state: down mac: <filter>
Device-2: TP-Link UE300 10/100/1000 LAN (ethernet mode) [Realtek RTL8153]
type: USB driver: r8152 bus-ID: 4-2.3:3 chip-ID: 2357:0601 class-ID: 0000
serial: <filter>
IF: enp0s20f0u2u3 state: up speed: 1000 Mbps duplex: full mac: <filter>
Bluetooth:
Device-1: Intel AX201 Bluetooth type: USB driver: btusb v: 0.8
bus-ID: 3-10:8 chip-ID: 8087:0026 class-ID: e001
Report: bt-adapter ID: hci0 rfk-id: 3 state: up address: <filter>
Drives:
Local Storage: total: 504.19 GiB used: 41.22 GiB (8.2%)
SMART Message: Unable to run smartctl. Root privileges required.
ID-1: /dev/nvme0n1 maj-min: 259:0 vendor: Intel model: HBRPEKNX0202A
size: 476.94 GiB block-size: physical: 512 B logical: 512 B
speed: 15.8 Gb/s lanes: 2 type: SSD serial: <filter> rev: G002
temp: 30.9 C scheme: GPT
ID-2: /dev/nvme1n1 maj-min: 259:4 vendor: Intel model: HBRPEKNX0202AO
size: 27.25 GiB block-size: physical: 512 B logical: 512 B speed: 15.8 Gb/s
lanes: 2 type: SSD serial: <filter> rev: K5110440 temp: 38.9 C
scheme: GPT
Partition:
ID-1: / raw-size: 459.78 GiB size: 459.78 GiB (100.00%)
used: 41.22 GiB (9.0%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
ID-2: /boot/efi raw-size: 300 MiB size: 299.4 MiB (99.80%)
used: 576 KiB (0.2%) fs: vfat dev: /dev/nvme0n1p1 maj-min: 259:1
ID-3: /home raw-size: 459.78 GiB size: 459.78 GiB (100.00%)
used: 41.22 GiB (9.0%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
ID-4: /var/log raw-size: 459.78 GiB size: 459.78 GiB (100.00%)
used: 41.22 GiB (9.0%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
ID-5: /var/tmp raw-size: 459.78 GiB size: 459.78 GiB (100.00%)
used: 41.22 GiB (9.0%) fs: btrfs dev: /dev/nvme0n1p2 maj-min: 259:2
Swap:
Kernel: swappiness: 133 (default 60) cache-pressure: 100 (default)
ID-1: swap-1 type: zram size: 15.32 GiB used: 0 KiB (0.0%) priority: 100
dev: /dev/zram0
ID-2: swap-2 type: partition size: 16.86 GiB used: 0 KiB (0.0%)
priority: -2 dev: /dev/nvme0n1p3 maj-min: 259:3
Sensors:
System Temperatures: cpu: 61.0 C mobo: N/A
Fan Speeds (RPM): cpu: 2800
Info:
Processes: 285 Uptime: 14h 34m wakeups: 6882 Memory: 15.32 GiB
used: 4.68 GiB (30.5%) Init: systemd v: 251 tool: systemctl Compilers:
gcc: 12.1.0 clang: 13.0.1 Packages: pacman: 1558 lib: 341 Shell: fish
v: 3.4.1 default: Bash v: 5.1.16 running-in: konsole inxi: 3.3.16
Garuda (2.6.3-2):
System install date: 2022-05-15
Last full system update: 2022-05-27
Is partially upgraded: No
Relevant software: NetworkManager
Windows dual boot: No/Undetected
Snapshots: Snapper
Failed units:
I wrote this guide in the FAQ section just for you
As you have not mentioned trying, the first thing to do is testing alternate kernels. As your hardware is rather new the first one to test is the linux-mainline
kernel.
Lots of other suggestions in that FAQ tutorial for you to read and try as well.
2 Likes
Could you upload a journal file of a sluggish system?
Yes that would be helpful as this is not a complete freeze at times there should be some indication in your logs as to the cause. I'm guessing your video drivers are the culprit, but too soon to say.
I've already tried some of the things in your guide. I tried LTS and mainline kernels, the frequency of issues seems to be about the same. I do noticed that going earlier than LTS the issues become more frequent and sometimes won't completely boot.
I tried some suggestions for kernel parameters for Nvidia cards, but none of them made any difference and one caused it to fail to boot. Don't know if there are any other parameters that I should try.
The BIOS is up-to-date (there was an update last month for a security issue).
The memory and disk seem to be fine, never saw anything odd related to them. It doesn't seem to let me run any tests on the drive (maybe because it's ssd), but the info states everything is fine - no errors or issues.
One odd thing - not sure if it has anything to do with it: Kernels earlier than 5.17 cause the active program window to freeze up for a few seconds when I press the function modifier key.
Here is a link to download the log that includes the CPU crash: Send (Pastebin said it was too big)
1 Like
If you look on the top right hand side of the forum there is a link for a private bin that is very useful for a pastebin type replacement service with an expiry date. This is preferable to making assistants download logs. I am only on my cell ATM, and downloading logs is not very convenient when using a cell.
There was an update to kernel 5.18 (I figured that was going to happen soon).
Only difference I've seen so far is that the nvidia driver segfaults along with optimius manager. So the nvidia card is no longer usable.
May 28 18:01:42 jim-zbf15 kernel: nvidia: module license 'NVIDIA' taints kernel.
May 28 18:01:42 jim-zbf15 kernel: Disabling lock debugging due to kernel taint
May 28 18:01:42 jim-zbf15 kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 506
May 28 18:01:42 jim-zbf15 kernel:
May 28 18:01:42 jim-zbf15 kernel: traps: Missing ENDBR: _nv011430rm+0x0/0x10 [nvidia]
May 28 18:01:42 jim-zbf15 kernel: ------------[ cut here ]------------
May 28 18:01:42 jim-zbf15 kernel: kernel BUG at arch/x86/kernel/traps.c:252!
May 28 18:01:42 jim-zbf15 kernel: fbcon: Taking over console
May 28 18:01:42 jim-zbf15 kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
May 28 18:01:42 jim-zbf15 kernel: CPU: 4 PID: 753 Comm: modprobe Tainted: P OE 5.18.0-zen1-1-zen #1 8c1b4772d057e8d6ef1ec6c49ac9700bcd2a2e4e
May 28 18:01:42 jim-zbf15 kernel: Hardware name: ASUSTeK COMPUTER INC. ZenBook UX564EH_Q528EH/UX564EH, BIOS UX564EH.312 03/09/2022
May 28 18:01:42 jim-zbf15 kernel: RIP: 0010:exc_control_protection+0xc2/0xd0
May 28 18:01:42 jim-zbf15 kernel: Code: 8b 93 80 00 00 00 be f9 00 00 00 48 c7 c7 5e 8c 26 bc e8 71 80 30 ff e9 72 ff ff ff 48 c7 c7 45 8c 26 bc e8 35 2f fa ff 0f 0b <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 90 66 0f 1f 00 41 54 55 53
May 28 18:01:42 jim-zbf15 kernel: RSP: 0018:ffffbddc46c87b08 EFLAGS: 00010002
May 28 18:01:42 jim-zbf15 kernel: RAX: 0000000000000033 RBX: ffffbddc46c87b28 RCX: 0000000000000027
May 28 18:01:42 jim-zbf15 kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff9ba6077216a0
May 28 18:01:42 jim-zbf15 kernel: RBP: 0000000000000003 R08: 0000000000000001 R09: 00000000ffffffea
May 28 18:01:42 jim-zbf15 kernel: R10: ffffffffbca5aa20 R11: 0000000000000002 R12: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: FS: 00007fba60e36740(0000) GS:ffff9ba607700000(0000) knlGS:0000000000000000
May 28 18:01:42 jim-zbf15 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 28 18:01:42 jim-zbf15 kernel: CR2: 00005645ceb60000 CR3: 00000001122b8001 CR4: 0000000000f70ee0
May 28 18:01:42 jim-zbf15 kernel: PKRU: 55555554
May 28 18:01:42 jim-zbf15 kernel: Call Trace:
May 28 18:01:42 jim-zbf15 kernel: <TASK>
May 28 18:01:42 jim-zbf15 kernel: asm_exc_control_protection+0x22/0x30
May 28 18:01:42 jim-zbf15 kernel: RIP: 0010:_nv011430rm+0x0/0x10 [nvidia]
May 28 18:01:42 jim-zbf15 kernel: Code: 66 2e 0f 1f 84 00 00 00 00 00 48 83 ec 08 e8 07 0f 1e 00 48 83 c4 08 48 89 c7 e9 bb ff ff ff 66 2e 0f 1f 84 00 00 00 00 00 90 <48> 89 f7 e9 18 08 00 00 0f 1f 84 00 00 00 00 00 48 89 f7 e9 18 08
May 28 18:01:42 jim-zbf15 kernel: RSP: 0018:ffffbddc46c87bd0 EFLAGS: 00010202
May 28 18:01:42 jim-zbf15 kernel: RAX: ffffffffc2c3ecb0 RBX: ffffffffc4d38b50 RCX: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: RDX: 000000000003bd10 RSI: 0000000000000010 RDI: ffffffffc4d38b50
May 28 18:01:42 jim-zbf15 kernel: RBP: ffff9ba2d0d0afe0 R08: 0000000000000020 R09: ffffffffc4d38b90
May 28 18:01:42 jim-zbf15 kernel: R10: ffffffffc4cef890 R11: 0000000000000000 R12: 0000000000000010
May 28 18:01:42 jim-zbf15 kernel: R13: ffff9ba2d0d08000 R14: ffffbddc46c87cee R15: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: ? _nv034888rm+0x20/0x20 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv011428rm+0x24/0xe0 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: ? nvidia_init_module+0x627/0x627 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv034889rm+0xe/0xa0 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv034892rm+0x1d/0x30 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv034894rm+0x2f/0x40 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv015562rm+0x15/0x70 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: _nv000644rm+0x9/0x20 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: ? cdev_add+0x4d/0x60
May 28 18:01:42 jim-zbf15 kernel: rm_init_rm+0x17/0x60 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: ? nvidia_init_module+0x627/0x627 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: nvidia_init_module+0x22e/0x627 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: ? nvidia_init_module+0x627/0x627 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: nvidia_frontend_init_module+0x50/0x91 [nvidia 5872414adba7d3d23e38cc42a85d4c2fbe012243]
May 28 18:01:42 jim-zbf15 kernel: do_one_initcall+0x118/0x2d0
May 28 18:01:42 jim-zbf15 kernel: do_init_module+0x4a/0x240
May 28 18:01:42 jim-zbf15 kernel: __x64_sys_init_module+0x7c/0xd0
May 28 18:01:42 jim-zbf15 kernel: do_syscall_64+0x5c/0x90
May 28 18:01:42 jim-zbf15 kernel: ? do_syscall_64+0x6b/0x90
May 28 18:01:42 jim-zbf15 kernel: ? exc_page_fault+0x74/0x170
May 28 18:01:42 jim-zbf15 kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
May 28 18:01:42 jim-zbf15 kernel: RIP: 0033:0x7fba60f4bc3e
May 28 18:01:42 jim-zbf15 kernel: Code: 48 8b 0d 5d b1 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2a b1 0e 00 f7 d8 64 89 01 48
May 28 18:01:42 jim-zbf15 kernel: RSP: 002b:00007ffd5ac58e28 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
May 28 18:01:42 jim-zbf15 kernel: RAX: ffffffffffffffda RBX: 000056437b699bf0 RCX: 00007fba60f4bc3e
May 28 18:01:42 jim-zbf15 kernel: RDX: 000056437b699ef0 RSI: 0000000003bb8588 RDI: 00007fba58ad4010
May 28 18:01:42 jim-zbf15 kernel: RBP: 00007fba58ad4010 R08: 0000000002061000 R09: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: R10: 00007fba60f409fb R11: 0000000000000246 R12: 000056437b699ef0
May 28 18:01:42 jim-zbf15 kernel: R13: 000056437b699b70 R14: 000056437b699bf0 R15: 000056437b699f60
May 28 18:01:42 jim-zbf15 kernel: </TASK>
May 28 18:01:42 jim-zbf15 kernel: Modules linked in: nvidia(POE+) cmac algif_hash algif_skcipher af_alg bnep btusb btrtl uvcvideo btbcm cdc_ether btintel videobuf2_vmalloc btmtk usbnet videobuf2_memops bluetooth rtsx_usb_sdmmc videobuf2_v4l2 rtsx_usb_ms r8152 mmc_core videobuf2_common memstick mii ecdh_generic videodev rtsx_usb crc16 qrtr snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_soc_hdac_hdmi snd_sof_probes hid_sensor_custom_intel_hinge hid_sensor_accel_3d hid_sensor_trigger industrialio_triggered_buffer kfifo_buf hid_sensor_iio_common industrialio hid_sensor_custom snd_hda_codec_hdmi hid_sensor_hub snd_hda_codec_realtek snd_hda_codec_generic intel_ishtp_hid snd_soc_dmic snd_sof_pci_intel_tgl snd_sof_intel_hda_common vfat soundwire_intel fat soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp joydev mousedev snd_sof snd_sof_utils snd_soc_hdac_hda iwlmvm snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus intel_tcc_cooling ledtrig_audio
May 28 18:01:42 jim-zbf15 kernel: mac80211 tps6598x x86_pkg_temp_thermal intel_powerclamp snd_soc_core typec iTCO_wdt spi_nor coretemp libarc4 snd_compress intel_pmc_bxt mtd roles ac97_bus iTCO_vendor_support pmt_telemetry hid_multitouch kvm_intel intel_rapl_msr pmt_class snd_pcm_dmaengine iwlwifi kvm snd_hda_intel snd_usb_audio iwlmei processor_thermal_device_pci_legacy asus_nb_wmi snd_intel_dspcfg irqbypass snd_usbmidi_lib snd_intel_sdw_acpi crct10dif_pclmul asus_wmi processor_thermal_device crc32_pclmul snd_hda_codec snd_rawmidi cfg80211 ghash_clmulni_intel snd_hda_core processor_thermal_rfim platform_profile aesni_intel snd_seq_device i2c_i801 spi_intel_pci processor_thermal_mbox crypto_simd cryptd intel_cstate intel_uncore rfkill wmi_bmof spi_intel i2c_smbus snd_hwdep mc processor_thermal_rapl mei intel_lpss_pci intel_ish_ipc intel_rapl_common intel_lpss thunderbolt intel_soc_dts_iosf igen6_edac intel_ishtp idma64 snd_pcm intel_vsec i2c_hid_acpi i2c_hid snd_timer serial_multi_instantiate tpm_crb snd
May 28 18:01:42 jim-zbf15 kernel: int3403_thermal usbhid soundcore int340x_thermal_zone tpm_tis tpm_tis_core tpm rng_core intel_hid mac_hid int3400_thermal sparse_keymap acpi_tad acpi_pad acpi_thermal_rel soc_button_array uinput ipmi_devintf ipmi_msghandler crypto_user fuse acpi_call(OE) zram bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq serio_raw atkbd libps2 vivaldi_fmap mxm_wmi nvme nvme_core xhci_pci xhci_pci_renesas i8042 serio wmi radeon amdgpu gpu_sched drm_ttm_helper intel_agp crc32c_intel i915 drm_buddy video ttm drm_dp_helper intel_gtt
May 28 18:01:42 jim-zbf15 kernel: ---[ end trace 0000000000000000 ]---
May 28 18:01:42 jim-zbf15 kernel: RIP: 0010:exc_control_protection+0xc2/0xd0
May 28 18:01:42 jim-zbf15 kernel: Code: 8b 93 80 00 00 00 be f9 00 00 00 48 c7 c7 5e 8c 26 bc e8 71 80 30 ff e9 72 ff ff ff 48 c7 c7 45 8c 26 bc e8 35 2f fa ff 0f 0b <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 90 66 0f 1f 00 41 54 55 53
May 28 18:01:42 jim-zbf15 kernel: RSP: 0018:ffffbddc46c87b08 EFLAGS: 00010002
May 28 18:01:42 jim-zbf15 kernel: RAX: 0000000000000033 RBX: ffffbddc46c87b28 RCX: 0000000000000027
May 28 18:01:42 jim-zbf15 kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff9ba6077216a0
May 28 18:01:42 jim-zbf15 kernel: RBP: 0000000000000003 R08: 0000000000000001 R09: 00000000ffffffea
May 28 18:01:42 jim-zbf15 kernel: R10: ffffffffbca5aa20 R11: 0000000000000002 R12: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
May 28 18:01:42 jim-zbf15 kernel: FS: 00007fba60e36740(0000) GS:ffff9ba607700000(0000) knlGS:0000000000000000
May 28 18:01:42 jim-zbf15 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 28 18:01:42 jim-zbf15 kernel: CR2: 00005645ceb60000 CR3: 00000001122b8001 CR4: 0000000000f70ee0
May 28 18:01:42 jim-zbf15 kernel: PKRU: 55555554
May 28 18:01:42 jim-zbf15 python3[661]: [2781] ERROR: Xorg pre-start setup error
May 28 18:01:42 jim-zbf15 python3[661]: Traceback (most recent call last):
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/kernel.py", line 245, in _load_module
May 28 18:01:42 jim-zbf15 python3[661]: subprocess.check_call(
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/subprocess.py", line 369, in check_call
May 28 18:01:42 jim-zbf15 python3[661]: raise CalledProcessError(retcode, cmd)
May 28 18:01:42 jim-zbf15 python3[661]: subprocess.CalledProcessError: Command 'modprobe nvidia NVreg_UsePageAttributeTable=1 NVreg_DynamicPowerManagement=0x02' died with <Signals.SIGSEGV: 11>.
May 28 18:01:42 jim-zbf15 python3[661]: The above exception was the direct cause of the following exception:
May 28 18:01:42 jim-zbf15 python3[661]: Traceback (most recent call last):
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/hooks/pre_xorg_start.py", line 51, in main
May 28 18:01:42 jim-zbf15 python3[661]: setup_kernel_state(config, prev_state, requested_mode)
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/kernel.py", line 22, in setup_kernel_state
May 28 18:01:42 jim-zbf15 python3[661]: _nvidia_up(config, hybrid=(requested_mode == "hybrid"))
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/kernel.py", line 95, in _nvidia_up
May 28 18:01:42 jim-zbf15 python3[661]: _load_nvidia_modules(config, available_modules)
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/kernel.py", line 164, in _load_nvidia_modules
May 28 18:01:42 jim-zbf15 python3[661]: _load_module(available_modules, "nvidia", options=nvidia_options)
May 28 18:01:42 jim-zbf15 python3[661]: File "/usr/lib/python3.10/site-packages/optimus_manager/kernel.py", line 249, in _load_module
May 28 18:01:42 jim-zbf15 python3[661]: raise KernelSetupError(f"Error running modprobe for {module}: {e.stderr}") from e
May 28 18:01:42 jim-zbf15 python3[661]: optimus_manager.kernel.KernelSetupError: Error running modprobe for nvidia: None
May 28 18:01:42 jim-zbf15 python3[661]: [2782] INFO: Removing /etc/X11/xorg.conf.d/10-optimus-manager.conf (if present)
May 28 18:01:42 jim-zbf15 python3[661]: [2782] INFO: Writing state {'type': 'pre_xorg_start_failed', 'switch_id': '20220528T180139', 'requested_mode': 'hybrid'}
I guess the current Nvidia driver doesn't work with 5.18.
I'll let you know if I notice any other changes.
I guess the current Nvidia driver doesn’t work with 5.18.
opened 07:12AM - 25 May 22 UTC
closed 05:28PM - 10 Nov 22 UTC
bug
NV-Triaged
### NVIDIA Open GPU Kernel Modules Version
515.43.04
### Does this happen with… the proprietary driver (of the same version) as well?
Yes
### Operating System and Version
Arch Linux
### Kernel Release
5.18.0-arch1-1
### Hardware: GPU
RTX 3070 laptop (System 76 Oryx 8)
### Describe the bug
Since upgrading to Kernel 5.18, loading the nvidia driver (Or proprietary one) fails with the same kernel log:
```
[ 5.429675] nvidia-nvlink: Nvlink Core is being initialized, major device number 510
[ 5.429718] traps: Missing ENDBR: _portMemAllocatorAllocNonPagedWrapper+0x0/0x10 [nvidia]
[ 5.429816] ------------[ cut here ]------------
[ 5.429817] kernel BUG at arch/x86/kernel/traps.c:252!
[ 5.429828] invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
[ 5.429830] CPU: 9 PID: 948 Comm: modprobe Tainted: G OE 5.18.0-arch1-1 #1 b71a70fe104889aac2f32556bc52f649da2881d2
[ 5.429832] Hardware name: System76 Oryx Pro/Oryx Pro, BIOS 2021-09-23_b9b0e89 09/23/2021
[ 5.429833] RIP: 0010:exc_control_protection+0xc2/0xd0
[ 5.429837] Code: 8b 93 80 00 00 00 be f9 00 00 00 48 c7 c7 d3 ab 66 b5 e8 d1 01 50 ff e9 72 ff ff ff 48 c7 c7 ba ab 66 b5 e8 c7 31 fb ff 0f 0b <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 90 66 0f 1f 00 55 53 48 89
[ 5.429838] RSP: 0018:ffffa9c3413b3bb8 EFLAGS: 00010002
[ 5.429839] RAX: 000000000000004d RBX: ffffa9c3413b3bd8 RCX: 0000000000000027
[ 5.429840] RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff9d195fa616a0
[ 5.429841] RBP: 0000000000000003 R08: 0000000000000000 R09: ffffa9c3413b39d8
[ 5.429842] R10: 0000000000000003 R11: ffffffffb5ecaa08 R12: 0000000000000000
[ 5.429842] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 5.429843] FS: 00007f0aa9bbe740(0000) GS:ffff9d195fa40000(0000) knlGS:0000000000000000
[ 5.429844] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5.429845] CR2: 00007f0aa8382000 CR3: 00000001063ce002 CR4: 0000000000f70ee0
[ 5.429846] PKRU: 55555554
[ 5.429847] Call Trace:
[ 5.429848] <TASK>
[ 5.429849] asm_exc_control_protection+0x22/0x30
[ 5.429852] RIP: 0010:_portMemAllocatorAllocNonPagedWrapper+0x0/0x10 [nvidia]
[ 5.429920] Code: 08 48 89 d0 48 89 0f 48 c1 e0 17 48 31 c2 48 89 c8 48 c1 e8 05 48 31 c8 48 31 d0 48 c1 ea 12 48 31 d0 48 89 47 08 01 c8 c3 90 <48> 89 f7 e9 38 0f 00 00 0f 1f 84 00 00 00 00 00 48 89 f7 e9 88 0f
[ 5.429921] RSP: 0018:ffffa9c3413b3c80 EFLAGS: 00010202
[ 5.429922] RAX: ffffffffc1eae5f0 RBX: 0000000000000010 RCX: 0000000000000000
[ 5.429923] RDX: 0000000000000000 RSI: 000000000000002c RDI: ffffffffc20f7b70
[ 5.429923] RBP: ffffa9c3413b3c98 R08: 0000000000000020 R09: ffffffffc20f7bf0
[ 5.429924] R10: ffffffffc20f55d0 R11: 0000000000000000 R12: ffffffffc20f7b70
[ 5.429925] R13: 00007f0aa8382dc0 R14: 000055916224ef30 R15: ffffa9c3413b3e20
[ 5.429926] ? portCryptoPseudoRandomGeneratorGetU32+0x30/0x30 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.429991] _portMemAllocatorAlloc+0x2e/0x170 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430054] portCryptoPseudoRandomGeneratorCreate+0x16/0xb0 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430117] portCryptoInitialize+0x2a/0x40 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430182] portInitialize+0x2b/0x40 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430246] coreInitializeRm+0x24/0x90 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430324] RmInitRm+0x9/0x20 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430399] rm_init_rm+0x9/0x10 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430472] nvidia_init_module+0x22e/0x5b0 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430517] ? nvidia_init_module+0x5b0/0x5b0 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430565] nvidia_frontend_init_module+0x50/0x91 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430616] ? nvidia_init_module+0x5b0/0x5b0 [nvidia 5737a4bc014c2c47af46ebdec30e9ee078e09f14]
[ 5.430663] do_one_initcall+0x5a/0x220
[ 5.430667] do_init_module+0x4a/0x240
[ 5.430670] __do_sys_init_module+0x138/0x1b0
[ 5.430672] do_syscall_64+0x5c/0x90
[ 5.430674] ? exc_page_fault+0x74/0x170
[ 5.430676] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 5.430677] RIP: 0033:0x7f0aa9512c3e
[ 5.430679] Code: 48 8b 0d 5d b1 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2a b1 0e 00 f7 d8 64 89 01 48
[ 5.430680] RSP: 002b:00007fff39f3cc58 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
[ 5.430681] RAX: ffffffffffffffda RBX: 000055916224ebd0 RCX: 00007f0aa9512c3e
[ 5.430682] RDX: 000055916224ef30 RSI: 00000000008f1db0 RDI: 00007f0aa7a91010
[ 5.430682] RBP: 00007f0aa7a91010 R08: 000055916224eae0 R09: 0000000000000000
[ 5.430683] R10: 0000000000000005 R11: 0000000000000246 R12: 000055916224ef30
[ 5.430684] R13: 000055916224ed00 R14: 000055916224ebd0 R15: 000055916224ef60
[ 5.430685] </TASK>
[ 5.430685] Modules linked in: pcc_cpufreq(-) nvidia(OE+) acpi_cpufreq(-) bnep bridge stp llc btusb btrtl btbcm btintel uvcvideo btmtk videobuf2_vmalloc bluetooth videobuf2_memops videobuf2_v4l2 videobuf2_common ecdh_generic videodev mc snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_hda_codec_realtek snd_sof_intel_hda snd_hda_codec_generic snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda iwlmvm snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi joydev intel_tcc_cooling soundwire_bus mousedev ledtrig_audio mac80211 x86_pkg_temp_thermal intel_powerclamp snd_soc_core coretemp snd_compress ac97_bus kvm_intel libarc4 hid_multitouch snd_hda_codec_hdmi 8250_dw spi_nor mei_pxp snd_pcm_dmaengine mei_hdcp ee1004 mtd i915 iTCO_wdt snd_hda_intel kvm intel_pmc_bxt snd_intel_dspcfg iTCO_vendor_support intel_rapl_msr iwlwifi irqbypass snd_intel_sdw_acpi snd_hda_codec crct10dif_pclmul crc32_pclmul
[ 5.430709] ghash_clmulni_intel snd_hda_core iwlmei vfat aesni_intel processor_thermal_device_pci_legacy processor_thermal_device pmt_telemetry snd_hwdep crypto_simd pmt_class cryptd fat intel_cstate r8169 drm_buddy cfg80211 intel_uncore snd_pcm processor_thermal_rfim realtek psmouse ttm processor_thermal_mbox mei_me snd_timer rfkill pcspkr i2c_i801 mdio_devres processor_thermal_rapl intel_lpss_pci spi_intel_pci intel_rapl_common snd libphy intel_lpss drm_dp_helper spi_intel i2c_smbus soundcore int340x_thermal_zone thunderbolt mei i2c_hid_acpi idma64 intel_gtt intel_vsec intel_soc_dts_iosf i2c_hid intel_hid video intel_scu_pltdrv sparse_keymap system76_acpi mac_hid coreboot_table dm_multipath dm_mod ipmi_devintf ipmi_msghandler crypto_user acpi_call(OE) fuse bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 serio_raw atkbd uas libps2 usb_storage usbhid vivaldi_fmap nvme xhci_pci nvme_core crc32c_intel i8042 xhci_pci_renesas serio
[ 5.430736] ---[ end trace 0000000000000000 ]---
```
### To Reproduce
1. Upgrade to kernel 5.18
2. Reboot
3. Observe nvidia module won't load and check kernel logs for the same error
### Bug Incidence
Always
### nvidia-bug-report.log.gz
[nvidia-bug-report.log.gz](https://github.com/NVIDIA/open-gpu-kernel-modules/files/8768756/nvidia-bug-report.log.gz)
### More Info
Originally I thought this issue was to do with optimus-manager (As I am using a hybrid setup I use that utility to switch between intel and nvidia mode), but after uninstalling optimus manager the same issue occurs
Add ibt=off kernel parameter.
Thanks.
I will not add the parameter for a few days and see if I am still experiencing the other issues. If not, it may be the Nvidia driver causing the issues. Not sure why this would be a direct issue though since it seems to be a 12th gen processor feature. I have an 11th gen processor.
Since upgrading to 5.18 final, I've not seen a crash or major issue in the past few days. I will set this as the solution. I can't pinpoint what exact change fixed the issues or what the cause was. Since I tried and had the same issues in an earlier beta of 5.18, it could be something changed or added in a later beta. I don't believe it had anything to do with the Nvidia driver, since it didn't work in the 5.18 beta I had tried, yet the issues were still present then.
I am still seeing a few of the minor "symptoms" I was seeing before, but they don't get progressively worse like they used to... or at least nowhere near as quickly as they did.
system
Closed
2 June 2022 13:55
12
This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.