View Issue Details

IDProjectCategoryView StatusLast Update
0001494channel: elrepo/el9kmod-ib_qibpublic2024-11-26 14:31
Reporternscfreny Assigned Totqhoang  
PrioritynormalSeveritycrashReproducibilityalways
Status acknowledgedResolutionopen 
Platform5.14.0-503.14.1.el9_5.x86_64OSRocky LinuxOS Version9.5
Summary0001494: Kernel panic loading ib_qib with EL 9.5 kernel
DescriptionKernel panic when loading ib_qib from kmod-ib_qib-1.11-12.el9_5.elrepo.x86_64.rpm with Rocky Linux 9.5 kernel.

See console log here: https://rpa.st/UW5Q

(Failed to attach as text file here? Or perhaps timeout?)
Steps To ReproduceInstall kmod-ib_qib + modprobe ib_qib:

dnf install https://elrepo.org/linux/elrepo/el9/x86_64/RPMS/kmod-ib_qib-1.11-12.el9_5.elrepo.x86_64.rpm
modprobe ib_qib
Additional InformationInformation about HCA:

lspci -vnn | grep -A1 "InfiniBand \["
02:00.0 InfiniBand [0c06]: QLogic Corp. IBA7322 QDR InfiniBand HCA [1077:7322] (rev 02)
        Subsystem: QLogic Corp. IBA7322 QDR InfiniBand HCA [1077:7322]
TagsNo tags attached.

Activities

nscfreny

2024-11-26 09:46

reporter   ~0010207

Works with Rocky 9.5 by downgrading kernel + kmod-ib_qib to latest 9.4 versions.

toracat

2024-11-26 10:46

administrator   ~0010208

Acknowledged.

toracat

2024-11-26 12:28

administrator   ~0010210

We cannot reproduce the issue most likely because we do not have proper hardware to test on.

Can you try installing the current kernel-ml package? By this, you'll be testing the latest code available from kernel.org.

$ sudo dnf --enablerepo=elrepo-kernel install kernel-ml

nscfreny

2024-11-26 14:31

reporter   ~0010211

We don't boot from disk but I did manage to kexec kernel-ml, and ib_qib seems to work.

# Staring point is a stripped compute node image with kernel-5.14.0-503.14.1.el9_5.x86_64 without kmod-ib_qib.
dnf install elrepo-release
dnf --enablerepo=elrepo-kernel install kernel-ml
kexec -l /boot/vmlinuz-6.12.1-1.el9.elrepo.x86_64 --append='ro root=/dev/sda1 console=ttyS1,57600' --initrd=/boot/initramfs-6.12.1-1.el9.elrepo.x86_64.img
kexec -e
# Wait...

uname -r
6.12.1-1.el9.elrepo.x86_64

ibstat
CA 'qib0'
        CA type: InfiniPath_QLE7340
        Number of ports: 1
        Firmware version:
        Hardware version: 2
        Node GUID: 0x00117500006f85e8
        System image GUID: 0x00117500006f85e8
        Port 1:
                State: Active
                Physical state: LinkUp
                Rate: 40
                Base lid: 10
                LMC: 0
                SM lid: 1
                Capability mask: 0x07610868
                Port GUID: 0x00117500006f85e8
                Link layer: InfiniBand

Issue History

Date Modified Username Field Change
2024-11-26 09:36 nscfreny New Issue
2024-11-26 09:36 nscfreny Status new => assigned
2024-11-26 09:36 nscfreny Assigned To => tqhoang
2024-11-26 09:46 nscfreny Note Added: 0010207
2024-11-26 10:46 toracat Status assigned => acknowledged
2024-11-26 10:46 toracat Note Added: 0010208
2024-11-26 12:28 toracat Note Added: 0010210
2024-11-26 14:31 nscfreny Note Added: 0010211