BUG: kernel NULL pointer dereference after SCALE 22.02.2 upgrade

Description

[ 12.475172] BUG: kernel NULL pointer dereference, address: 0000000000000058
[ 12.487143] #PF: supervisor read access in kernel mode
[ 12.497696] #PF: error_code(0x0000) - not-present page
[ 12.507923] PGD 0 P4D 0
[ 12.516087] Oops: 0000 #1 SMP NOPTI
[ 12.525900] CPU: 9 PID: 283 Comm: systemd-udevd Tainted: G I E 5.10.120+truenas #1
[ 12.539921] Hardware name: Dell Inc. PowerEdge R740xd2/0C2PJH, BIOS 2.13.3 12/17/2021
[ 12.553286] RIP: 0010:ses_match_to_enclosure+0xd/0xd0 [ses]
[ 12.564584] Code: 01 c0 41 8d 58 0c 48 63 db 48 01 fb e9 80 fd ff ff 49 63 f1 e9 05 fe ff ff 0f 1f 40 00 0f 1f 44 00 00 53 48 89 f3 48 83 ec 18 <48> 8b 77 58 65 48 8b 04 25 28 00 00 00 48 89 44 24 10 31 c0 48 c7
[ 12.597979] RSP: 0018:ffffa37bc13f3d28 EFLAGS: 00010296
[ 12.610190] RAX: 0000000000000000 RBX: ffff8be881103000 RCX: 0000000000000000
[ 12.624516] RDX: 0000000000000001 RSI: ffff8be881103000 RDI: 0000000000000000
[ 12.637913] RBP: 0000000000000000 R08: ffff8bea88eeb6a0 R09: ffff8bea88eeb668
[ 12.652577] R10: 0000000000000001 R11: 0000000000000000 R12: ffff8bea88eeb600
[ 12.666123] R13: ffff8be881103470 R14: ffff8be881103000 R15: 00000000ffffffed
[ 12.682233] FS: 00007f6653f458c0(0000) GS:ffff8bffbfa40000(0000) knlGS:0000000000000000
[ 12.696810] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 12.709270] CR2: 0000000000000058 CR3: 0000000113a36005 CR4: 00000000007706e0
[ 12.723234] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 12.737128] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 12.751200] PKRU: 55555554
[ 12.760786] Call Trace:
[ 12.771087] ? enclosure_find+0x8f/0xa0 [enclosure]
[ 12.784117] ses_intf_add+0x619/0x61e [ses]
[ 12.795337] class_interface_register+0x116/0x130
[ 12.808720] ? 0xffffffffc01bc000
[ 12.819435] ses_init+0x13/0x1000 [ses]
[ 12.832375] do_one_initcall+0x44/0x1d0
[ 12.843769] ? do_init_module+0x23/0x240
[ 12.854300] ? kmem_cache_alloc_trace+0xf5/0x200
[ 12.868173] do_init_module+0x4c/0x240
[ 12.879424] __do_sys_finit_module+0xb1/0x110
[ 12.890944] do_syscall_64+0x33/0x80
[ 12.903142] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 12.914999] RIP: 0033:0x7f66543fe9b9
[ 12.928395] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a7 54 0c 00 f7 d8 64 89 01 48
[ 12.961319] RSP: 002b:00007ffe9e649698 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[ 12.975801] RAX: ffffffffffffffda RBX: 0000560a3e461c00 RCX: 00007f66543fe9b9
[ 12.990446] RDX: 0000000000000000 RSI: 00007f6654589e2d RDI: 000000000000000d
[ 13.004875] RBP: 0000000000020000 R08: 0000000000000000 R09: 0000560a3e4932e0
[ 13.019817] R10: 000000000000000d R11: 0000000000000246 R12: 00007f6654589e2d
[ 13.034698] R13: 0000000000000000 R14: 0000560a3e460390 R15: 0000560a3e461c00
[ 13.048659] Modules linked in: ses(E+) enclosure(E) scsi_transport_sas(E) usbhid(E) hid(E) sd_mod(E) t10_pi(E) crc_t10dif(E) crct10dif_generic(E) crct10dif_pclmul(E) crct10dif_common(E) crc32_pclmul(E) ahci(E) crc32c_intel(E) ahciem(E) l ibahci(E) xhci_pci(E) ghash_clmulni_intel(E) tg3(E) xhci_hcd(E) libata(E) megaraid_sas(E) aesni_intel(E) libphy(E) libaes(E) crypto_simd(E) usbcore(E) i2c_i801(E) cryptd(E) ptp(E) bnxt_en(E) glue_helper(E) i2c_smbus(E) lpc_ich(E) pps_core(E ) scsi_mod(E) usb_common(E) wmi(E)
[ 13.139987] CR2: 0000000000000058
[ 13.150665] ---[ end trace 897dad6b0f4b1f0f ]---
[ 13.205819] RIP: 0010:ses_match_to_enclosure+0xd/0xd0 [ses]
[ 13.219183] Code: 01 c0 41 8d 58 0c 48 63 db 48 01 fb e9 80 fd ff ff 49 63 f1 e9 05 fe ff ff 0f 1f 40 00 0f 1f 44 00 00 53 48 89 f3 48 83 ec 18 <48> 8b 77 58 65 48 8b 04 25 28 00 00 00 48 89 44 24 10 31 c0 48 c7
[ 13.253708] RSP: 0018:ffffa37bc13f3d28 EFLAGS: 00010296
[ 13.270294] RAX: 0000000000000000 RBX: ffff8be881103000 RCX: 0000000000000000
[ 13.286908] RDX: 0000000000000001 RSI: ffff8be881103000 RDI: 0000000000000000
[ 13.301928] RBP: 0000000000000000 R08: ffff8bea88eeb6a0 R09: ffff8bea88eeb668
[ 13.317076] R10: 0000000000000001 R11: 0000000000000000 R12: ffff8bea88eeb600
[ 13.331583] R13: ffff8be881103470 R14: ffff8be881103000 R15: 00000000ffffffed
[ 13.346994] FS: 00007f6653f458c0(0000) GS:ffff8bffbfa40000(0000) knlGS:0000000000000000
[ 13.362687] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 13.377991] CR2: 0000000000000058 CR3: 0000000113a36005 CR4: 00000000007706e0
[ 13.392648] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 13.407129] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 13.421472] PKRU: 55555554

I imagine this is related to the new enclosure management feature. This is a fairly standard Dell R740xd2 server with only the internal enclosure on a PERC H730P Mini controller. Let me know if there's any other information I can provide.

Problem/Justification

None

Impact

None

Activity

Bug Clerk 
June 22, 2022 at 7:07 PM

Bug Clerk 
June 22, 2022 at 1:32 PM

Richard Stanway 
June 21, 2022 at 9:44 PM

Thanks for the quick response, I can confirm that with the workaround there is no longer an Oops on boot and systemd-udevd is behaving normally.

Bug Clerk 
June 21, 2022 at 9:27 PM

Ryan Moeller 
June 21, 2022 at 9:25 PM

I see the issue, thanks for the report. For the time being you may disable the AHCI enclosure support from the 22.02.1 boot environment:

Complete

Details

Assignee

Reporter

Labels

Impact

Time remaining

0m

Components

Affects versions

Priority

Katalon Platform

Created June 21, 2022 at 8:04 PM
Updated July 15, 2022 at 3:47 PM
Resolved June 22, 2022 at 7:53 PM