Thread (33 messages) 33 messages, 8 authors, 2023-06-02

Re: system hang on start-up (mlx5?)

From: Linux regression tracking #update (Thorsten Leemhuis) <hidden>
Date: 2023-06-02 11:05:49
Also in: linux-rdma, regressions

[TLDR: This mail in primarily relevant for Linux regression tracking. A
change or fix related to the regression discussed in this thread was
posted or applied, but it did not use a Link: tag to point to the
report, as Linus and the documentation call for. Things happen, no
worries -- but now the regression tracking bot needs to be told manually
about the fix. See link in footer if these mails annoy you.]

On 08.05.23 14:29, Linux regression tracking #adding (Thorsten Leemhuis)
wrote:
On 03.05.23 03:03, Chuck Lever III wrote:
quoted
I have a Supermicro X10SRA-F/X10SRA-F with a ConnectX®-5 EN network
interface card, 100GbE single-port QSFP28, PCIe3.0 x16, tall bracket;
MCX515A-CCAT

When booting a v6.3+ kernel, the boot process stops cold after a
few seconds. The last message on the console is the MLX5 driver
note about "PCIe slot advertised sufficient power (27W)".

bisect reports that bbac70c74183 ("net/mlx5: Use newer affinity
descriptor") is the first bad commit.

I've trolled lore a couple of times and haven't found any discussion
of this issue.
#regzbot ^introduced bbac70c74183
#regzbot title system hang on start-up (irq or mlx5 problem?)
#regzbot ignore-activity
#regzbot fix: 368591995d010e6
#regzbot ignore-activity

Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat)
--
Everything you wanna know about Linux kernel regression tracking:
https://linux-regtracking.leemhuis.info/about/#tldr
That page also explains what to do if mails like this annoy you.
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help