From: Hannes Reinecke <hare@suse.com>
To: Sagi Grimberg <sagi@grimberg.me>
Cc: "linux-nvme@lists.infradead.org" <linux-nvme@lists.infradead.org>,
	"linux-block@vger.kernel.org" <linux-block@vger.kernel.org>
Subject: Kernel oops with 6.14 when enabling TLS
Date: Fri, 28 Feb 2025 11:47:23 +0100	[thread overview]
Message-ID: <08c29e4b-2f71-4b6d-8046-27e407214d8c@suse.com> (raw)

Hi Sagi,

enabling TLS on latest linus tree reliably crashes my system:

[  487.018058] ------------[ cut here ]------------
[  487.024046] WARNING: CPU: 9 PID: 6159 at mm/slub.c:4719 
free_large_kmalloc+0x15/0xa0
[  487.033549] Modules linked in: tls(E) nvme_tcp(E) af_packet(E) 
iscsi_ibft(E) iscsi_boot_sysfs(E) amd_atl(E) intel_rapl_msr(E) 
intel_rapl_common(E) amd64_edac(E) edac_mce_amd(E) nls_iso8859_1(E) 
nls_cp437(E) dax_hmem(E) vfat(E) cxl_acpi(E) fat(E) kvm_amd(E) 
ipmi_ssif(E) cxl_port(E) xfs(E) tg3(E) cxl_core(E) ipmi_si(E) i40e(E) 
kvm(E) einj(E) wmi_bmof(E) acpi_cpufreq(E) ipmi_devintf(E) libphy(E) 
k10temp(E) libie(E) i2c_piix4(E) i2c_smbus(E) ipmi_msghandler(E) 
i2c_designware_platform(E) i2c_designware_core(E) button(E) 
nvme_fabrics(E) nvme_keyring(E) fuse(E) efi_pstore(E) configfs(E) 
dmi_sysfs(E) ip_tables(E) x_tables(E) ahci(E) libahci(E) 
ghash_clmulni_intel(E) libata(E) sha512_ssse3(E) sd_mod(E) 
sha256_ssse3(E) ast(E) sha1_ssse3(E) drm_client_lib(E) scsi_dh_emc(E) 
i2c_algo_bit(E) aesni_intel(E) drm_shmem_helper(E) scsi_dh_rdac(E) 
crypto_simd(E) cryptd(E) xhci_pci(E) drm_kms_helper(E) scsi_dh_alua(E) 
nvme(E) sg(E) xhci_hcd(E) nvme_core(E) scsi_mod(E) drm(E) nvme_auth(E) 
scsi_common(E) usbcore(E) sp5100_tco(E) ccp(E)
[  487.033696]  wmi(E) btrfs(E) blake2b_generic(E) xor(E) raid6_pq(E) 
efivarfs(E)
[  487.033707] CPU: 9 UID: 0 PID: 6159 Comm: nvme Kdump: loaded Tainted: 
G        W   E      6.14.0-rc4-default+ #292 
f1e35f01b401c038558e67f3c2d644747de50dbd
[  487.033713] Tainted: [W]=WARN, [E]=UNSIGNED_MODULE
[  487.033715] Hardware name: Lenovo ThinkSystem SR655V3/SB27B09914, 
BIOS KAE111E-2.10 04/11/2023
[  487.033717] RIP: 0010:free_large_kmalloc+0x15/0xa0
[  487.033722] Code: 80 00 00 00 00 90 90 90 90 90 90 90 90 90 90 90 90 
90 90 90 90 0f 1f 44 00 00 55 53 48 89 fb 48 83 ec 08 48 8b 07 a8 40 75 
4b <0f> 0b 80 3d 24 4e 8e 01 00 ba 00 f0 ff ff 74 5d 9c 58 0f 1f 40 00
[  487.205280] RSP: 0018:ff4de44e432a7b70 EFLAGS: 00010246
[  487.205284] RAX: 000fffffc0000000 RBX: ffd659b280210b80 RCX: 
ff42118489e0cd80
[  487.205286] RDX: 0000000000000000 RSI: ff4211848842e000 RDI: 
ffd659b280210b80
[  487.205288] RBP: ff4de44e432a7be0 R08: 0000000000000001 R09: 
0000000000000002
[  487.205289] R10: ff4de44e432a7c00 R11: 0000000000000104 R12: 
ffd659b280210b80
[  487.205291] R13: ff4211848842e000 R14: ff421186d0696520 R15: 
ff421186e19c4000
[  487.205292] FS:  00007f66b8ffd800(0000) GS:ff4211874d980000(0000) 
knlGS:0000000000000000
[  487.205294] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  487.205296] CR2: 00007f66b9150d7e CR3: 0000000281a5e005 CR4: 
0000000000771ef0
[  487.205298] PKRU: 55555554
[  487.205306] Call Trace:
[  487.205309]  <TASK>
[  487.205314]  ? __warn+0x85/0x130
[  487.296763]  ? free_large_kmalloc+0x15/0xa0
[  487.296772]  ? report_bug+0xf8/0x1e0
[  487.296779]  ? handle_bug+0x50/0xa0
[  487.296783]  ? exc_invalid_op+0x13/0x60
[  487.296786]  ? asm_exc_invalid_op+0x16/0x20
[  487.296796]  ? free_large_kmalloc+0x15/0xa0
[  487.296801]  kfree+0x234/0x320
[  487.332065]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.332075]  ? nvmf_connect_admin_queue+0x105/0x1a0 [nvme_fabrics 
34d997d53c805aa2fae8e8baee6a736e8da38358]
[  487.332081]  ? nvmf_connect_admin_queue+0xa1/0x1a0 [nvme_fabrics 
34d997d53c805aa2fae8e8baee6a736e8da38358]
[  487.332084]  nvmf_connect_admin_queue+0x105/0x1a0 [nvme_fabrics 
34d997d53c805aa2fae8e8baee6a736e8da38358]
[  487.332093]  nvme_tcp_start_queue+0x18f/0x310 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  487.332102]  nvme_tcp_setup_ctrl+0xf8/0x700 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  487.394479]  ? nvme_change_ctrl_state+0x99/0x1b0 [nvme_core 
22f0ce18ead628230226a9b87ebf48eb576bf299]
[  487.394495]  nvme_tcp_create_ctrl+0x2e3/0x4d0 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  487.394503]  nvmf_dev_write+0x323/0x3d0 [nvme_fabrics 
34d997d53c805aa2fae8e8baee6a736e8da38358]
[  487.394508]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.394514]  vfs_write+0xd9/0x430
[  487.394521]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.394523]  ? __handle_mm_fault+0x7da/0xca0
[  487.394531]  ksys_write+0x68/0xe0
[  487.394536]  do_syscall_64+0x74/0x160
[  487.394543]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.394545]  ? __count_memcg_events+0x98/0x130
[  487.394550]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.476947]  ? count_memcg_events.constprop.163+0x1a/0x30
[  487.476956]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.476960]  ? handle_mm_fault+0xa1/0x290
[  487.476966]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.476968]  ? do_user_addr_fault+0x56b/0x830
[  487.476975]  ? srso_alias_return_thunk+0x5/0xfbef5
[  487.476977]  ? exc_page_fault+0x68/0x150
[  487.476983]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
[  487.476989] RIP: 0033:0x7f66b91216f0
[  487.476994] Code: 66 2e 0f 1f 84 00 00 00 00 00 90 90 90 90 90 90 90 
90 90 90 90 90 90 90 90 90 80 3d 19 c3 0e 00 00 74 17 b8 01 00 00 00 0f 
05 <48> 3d 00 f0 ff ff 77 58 c3 0f 1f 80 00 00 00 00 48 83 ec 28 48 89
[  487.551619] RSP: 002b:00007ffef5393ed8 EFLAGS: 00000202 ORIG_RAX: 
0000000000000001
[  487.551623] RAX: ffffffffffffffda RBX: 000055e61c2113b0 RCX: 
00007f66b91216f0
[  487.551625] RDX: 00000000000000ed RSI: 000055e61c2113b0 RDI: 
0000000000000003
[  487.551627] RBP: 0000000000000003 R08: 00000000000000ed R09: 
000055e61c2113b0
[  487.551628] R10: 0000000000000000 R11: 0000000000000202 R12: 
00000000000000ed
[  487.551630] R13: 00007f66b9380008 R14: 000055e61c20a980 R15: 
000055e61c20b100
[  487.551637]  </TASK>
[  487.551639] ---[ end trace 0000000000000000 ]---
[  487.551642] object pointer: 0x00000000346cb6fc
[  487.554112] nvme nvme1: creating 32 I/O queues.
[  489.396262] nvme nvme1: mapped 32/0/0 default/read/poll queues.
[  489.405197] Oops: general protection fault, probably for 
non-canonical address 0xdead000000000100: 0000 [#1] PREEMPT SMP NOPTI
[  489.418790] CPU: 9 UID: 0 PID: 6159 Comm: nvme Kdump: loaded Tainted: 
G        W   E      6.14.0-rc4-default+ #292 
f1e35f01b401c038558e67f3c2d644747de50dbd
[  489.435212] Tainted: [W]=WARN, [E]=UNSIGNED_MODULE
[  489.441381] Hardware name: Lenovo ThinkSystem SR655V3/SB27B09914, 
BIOS KAE111E-2.10 04/11/2023
[  489.451841] RIP: 0010:__rmqueue_pcplist+0xe1/0xc80
[  489.458016] Code: 06 48 83 e8 08 48 89 44 24 70 48 8b 45 00 48 39 c5 
0f 84 72 01 00 00 48 8b 55 00 48 8b 32 48 8b 4a 08 48 8d 42 f8 48 89 4e 
08 <48> 89 31 48 b9 00 01 00 00 00 00 ad de 48 89 0a 48 8b 4c 24 20 48
[  489.479905] RSP: 0018:ff4de44e432a7688 EFLAGS: 00010293
[  489.486567] RAX: ffd659b280210a00 RBX: ff4211874d9bf400 RCX: 
dead000000000100
[  489.495370] RDX: ffd659b280210a08 RSI: ffd659b281ed7e08 RDI: 
ff421184fffd60c0
[  489.504174] RBP: ff4211874d9bf4b0 R08: ff4211874d9bf400 R09: 
ff4211874d9bf4b0
[  489.512976] R10: 0000000000002acd R11: 0000000000000002 R12: 
0000000000000003
[  489.521779] R13: 0000000000000003 R14: 0000000000252800 R15: 
ff421184fffd60c0
[  489.530584] FS:  00007f66b8ffd800(0000) GS:ff4211874d980000(0000) 
knlGS:0000000000000000
[  489.540460] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  489.547700] CR2: 00007f66b9150d7e CR3: 0000000281a5e005 CR4: 
0000000000771ef0
[  489.556502] PKRU: 55555554
[  489.560337] Call Trace:
[  489.563876]  <TASK>
[  489.567022]  ? __die_body+0x1a/0x60
[  489.571728]  ? die_addr+0x38/0x60
[  489.576241]  ? exc_general_protection+0x19e/0x430
[  489.582325]  ? asm_exc_general_protection+0x22/0x30
[  489.588603]  ? __rmqueue_pcplist+0xe1/0xc80
[  489.594092]  ? __rmqueue_pcplist+0x51b/0xc80
[  489.599687]  get_page_from_freelist+0xe10/0x1680
[  489.605675]  __alloc_frozen_pages_noprof+0x171/0x340
[  489.612048]  new_slab+0x90/0x4d0
[  489.616466]  ___slab_alloc+0x6f3/0xb20
[  489.621469]  ? sbitmap_init_node+0x77/0x1a0
[  489.626961]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.633134]  ? _get_random_bytes.part.18+0x90/0x120
[  489.639409]  ? __slab_alloc.isra.98+0x22/0x40
[  489.645101]  __slab_alloc.isra.98+0x22/0x40
[  489.650597]  __kmalloc_node_noprof+0x218/0x510
[  489.656380]  ? sbitmap_init_node+0x77/0x1a0
[  489.661874]  ? sbitmap_init_node+0x77/0x1a0
[  489.667362]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.673534]  sbitmap_init_node+0x77/0x1a0
[  489.678830]  sbitmap_queue_init_node+0x24/0x150
[  489.684709]  blk_mq_init_tags+0x7e/0x110
[  489.689915]  blk_mq_alloc_map_and_rqs+0x44/0x320
[  489.695898]  __blk_mq_alloc_map_and_rqs+0x3b/0x60
[  489.701973]  blk_mq_alloc_tag_set+0x1f1/0x380
[  489.707662]  nvme_alloc_io_tag_set+0xc2/0x1a0 [nvme_core 
22f0ce18ead628230226a9b87ebf48eb576bf299]
[  489.718534]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.724707]  ? nvme_tcp_alloc_queue+0x293/0x7b0 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  489.735676]  ? __pfx_nvme_tcp_tls_done+0x10/0x10 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  489.746737]  nvme_tcp_setup_ctrl+0x3ee/0x700 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  489.757401]  nvme_tcp_create_ctrl+0x2e3/0x4d0 [nvme_tcp 
68f6be106f52ac467179f8a0922f02aeb6fa1f1c]
[  489.768165]  nvmf_dev_write+0x323/0x3d0 [nvme_fabrics 
34d997d53c805aa2fae8e8baee6a736e8da38358]
[  489.778732]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.784906]  vfs_write+0xd9/0x430
[  489.789422]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.795591]  ? __handle_mm_fault+0x7da/0xca0
[  489.801183]  ksys_write+0x68/0xe0
[  489.805700]  do_syscall_64+0x74/0x160
[  489.810616]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.816794]  ? __count_memcg_events+0x98/0x130
[  489.822580]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.828750]  ? count_memcg_events.constprop.163+0x1a/0x30
[  489.835606]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.841777]  ? handle_mm_fault+0xa1/0x290
[  489.847070]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.853241]  ? do_user_addr_fault+0x56b/0x830
[  489.858928]  ? srso_alias_return_thunk+0x5/0xfbef5
[  489.865101]  ? exc_page_fault+0x68/0x150
[  489.870302]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
[  489.876769] RIP: 0033:0x7f66b91216f0
[  489.881573] Code: 66 2e 0f 1f 84 00 00 00 00 00 90 90 90 90 90 90 90 
90 90 90 90 90 90 90 90 90 80 3d 19 c3 0e 00 00 74 17 b8 01 00 00 00 0f 
05 <48> 3d 00 f0 ff ff 77 58 c3 0f 1f 80 00 00 00 00 48 83 ec 28 48 89
[  489.903462] RSP: 002b:00007ffef5393ed8 EFLAGS: 00000202 ORIG_RAX: 
0000000000000001
[  489.912761] RAX: ffffffffffffffda RBX: 000055e61c2113b0 RCX: 
00007f66b91216f0
[  489.921564] RDX: 00000000000000ed RSI: 000055e61c2113b0 RDI: 
0000000000000003
[  489.930366] RBP: 0000000000000003 R08: 00000000000000ed R09: 
000055e61c2113b0
[  489.939169] R10: 0000000000000000 R11: 0000000000000202 R12: 
00000000000000ed
[  489.947970] R13: 00007f66b9380008 R14: 000055e61c20a980 R15: 
000055e61c20b100
[  489.956783]  </TASK>
[  489.960024] Modules linked in: tls(E) nvme_tcp(E) af_packet(E) 
iscsi_ibft(E) iscsi_boot_sysfs(E) amd_atl(E) intel_rapl_msr(E) 
intel_rapl_common(E) amd64_edac(E) edac_mce_amd(E) nls_iso8859_1(E) 
nls_cp437(E) dax_hmem(E) vfat(E) cxl_acpi(E) fat(E) kvm_amd(E) 
ipmi_ssif(E) cxl_port(E) xfs(E) tg3(E) cxl_core(E) ipmi_si(E) i40e(E) 
kvm(E) einj(E) wmi_bmof(E) acpi_cpufreq(E) ipmi_devintf(E) libphy(E) 
k10temp(E) libie(E) i2c_piix4(E) i2c_smbus(E) ipmi_msghandler(E) 
i2c_designware_platform(E) i2c_designware_core(E) button(E) 
nvme_fabrics(E) nvme_keyring(E) fuse(E) efi_pstore(E) configfs(E) 
dmi_sysfs(E) ip_tables(E) x_tables(E) ahci(E) libahci(E) 
ghash_clmulni_intel(E) libata(E) sha512_ssse3(E) sd_mod(E) 
sha256_ssse3(E) ast(E) sha1_ssse3(E) drm_client_lib(E) scsi_dh_emc(E) 
i2c_algo_bit(E) aesni_intel(E) drm_shmem_helper(E) scsi_dh_rdac(E) 
crypto_simd(E) cryptd(E) xhci_pci(E) drm_kms_helper(E) scsi_dh_alua(E) 
nvme(E) sg(E) xhci_hcd(E) nvme_core(E) scsi_mod(E) drm(E) nvme_auth(E) 
scsi_common(E) usbcore(E) sp5100_tco(E) ccp(E)
[  489.960207]  wmi(E) btrfs(E) blake2b_generic(E) xor(E) raid6_pq(E) 
efivarfs(E)

Haven't found a culprit for that one for now, started bisecting.
Just wanted to report that as a heads-up, maybe you have some idea.

Cheers,

Hannes
-- 
Dr. Hannes Reinecke                  Kernel Storage Architect
hare@suse.com                               +49 911 74053 688
SUSE Software Solutions GmbH, Frankenstr. 146, 90461 Nürnberg
HRB 36809 (AG Nürnberg), GF: I. Totev, A. McDonald, W. Knoblich


             reply	other threads:[~2025-02-28 10:47 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-02-28 10:47 Hannes Reinecke [this message]
2025-02-28 21:49 ` Kernel oops with 6.14 when enabling TLS Sagi Grimberg
2025-03-03  7:48 ` Hannes Reinecke
2025-03-03 11:06   ` Hannes Reinecke
2025-03-03 12:57     ` Hannes Reinecke
2025-03-03 13:57     ` Matthew Wilcox
2025-03-03 14:05       ` Hannes Reinecke
2025-03-03 14:27   ` Matthew Wilcox
2025-03-03 14:42     ` Matthew Wilcox
2025-03-03 15:12       ` Vlastimil Babka
2025-03-03 15:39       ` Hannes Reinecke
2025-03-03 15:48         ` Matthew Wilcox
2025-03-03 16:15           ` Vlastimil Babka
2025-03-03 22:02             ` Vlastimil Babka
2025-03-04  7:58               ` Hannes Reinecke
2025-03-04  8:18                 ` Vlastimil Babka
2025-03-04 10:20                   ` Hannes Reinecke
2025-03-04 10:26                     ` Vlastimil Babka
2025-03-04 15:11                       ` Hannes Reinecke
2025-03-04 15:29                       ` Vlastimil Babka
2025-03-04 16:20                         ` Hannes Reinecke
2025-03-04 16:14                       ` Matthew Wilcox
2025-03-04 16:32                         ` Hannes Reinecke
2025-03-04 16:53                           ` Matthew Wilcox
2025-03-04 18:05                             ` Matthew Wilcox
2025-03-04 18:31                               ` Vlastimil Babka
2025-03-04 19:39                               ` Hannes Reinecke
2025-03-04 19:44                                 ` Vlastimil Babka
2025-03-05  7:14                                   ` Hannes Reinecke
2025-03-05  8:20                                   ` Hannes Reinecke
2025-03-05  8:58                                     ` Vlastimil Babka
2025-03-05 11:43                                       ` Hannes Reinecke
2025-03-05 18:11                                         ` Networking people smell funny and make poor life choices Matthew Wilcox
2025-03-06  0:46                                           ` Cong Wang
2025-03-12 15:09                                           ` Christoph Hellwig
2025-03-12 18:28                                             ` James R. Bergsten
2025-03-13  9:43                                           ` David Laight
2025-03-06  9:15                                         ` Kernel oops with 6.14 when enabling TLS Vlastimil Babka

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=08c29e4b-2f71-4b6d-8046-27e407214d8c@suse.com \
    --to=hare@suse.com \
    --cc=linux-block@vger.kernel.org \
    --cc=linux-nvme@lists.infradead.org \
    --cc=sagi@grimberg.me \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.