Thread (7 messages) 7 messages, 3 authors, 2024-12-08

Re: deploying both NFS client and server on the same machine trigger hungtask

From: Jeff Layton via Bugspray Bot <bugbot@kernel.org>
Date: 2024-12-04 15:44:52

Jeff Layton writes via Kernel.org Bugzilla:

Li Lingfeng's analysis looks basically correct to me, though I think the nfsd_mutex is mostly a red-herring here.

The client holds the shrinker rwsem and is trying to write back data. The server is trying to unregister a shrinker as part of server shutdown and has almost certainly stopped responding to requests at that point. The client is using hard RPCs, so it's going to retry the writeback indefinitely while holding the shrinker mutex, which will block server shutdown.

I don't see a great way to fix this right offhand, though I wonder if localio might help mitigate this problem.

View: https://bugzilla.kernel.org/show_bug.cgi?id=219550#c4
You can reply to this message to join the discussion.
-- 
Deet-doot-dot, I am a bot.
Kernel.org Bugzilla (bugspray 0.1-dev)
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help