Re: stalling IO regression since linux 5.12, through 5.18
From: Ming Lei <hidden>
Date: 2022-08-17 15:35:18
Also in:
linux-block, linux-btrfs, lkml
From: Ming Lei <hidden>
Date: 2022-08-17 15:35:18
Also in:
linux-block, linux-btrfs, lkml
On Wed, Aug 17, 2022 at 11:02:25AM -0400, Chris Murphy wrote:
On Wed, Aug 17, 2022, at 10:53 AM, Ming Lei wrote:quoted
On Wed, Aug 17, 2022 at 10:34:38AM -0400, Chris Murphy wrote:quoted
On Wed, Aug 17, 2022, at 8:06 AM, Ming Lei wrote:quoted
blk-mq debugfs log is usually helpful for io stall issue, care to post the blk-mq debugfs log: (cd /sys/kernel/debug/block/$disk && find . -type f -exec grep -aH . {} \;)This is only sda https://drive.google.com/file/d/1aAld-kXb3RUiv_ShAvD_AGAFDRS03Lr0/view?usp=sharingFrom the log, there isn't any in-flight IO request. So please confirm that it is collected after the IO stall is triggered.Yes, iotop reports no reads or writes at the time of collection. IO pressure 99% for auditd, systemd-journald, rsyslogd, and postgresql, with increasing pressure from all the qemu processes. Keep in mind this is a raid10, so maybe it's enough for just one block device IO to stall and the whole thing stops? That's why I included all block devices.
From the 2nd log of blockdebugfs-all.txt, still not see any in-flight IO on request based block devices, but sda is _not_ included in this log, and only sdi, sdg and sdf are collected, is that expected? BTW, all request based block devices should be observed in blk-mq debugfs. thanks, Ming