Re: [PATCH V4] scsi: core: only re-run queue in scsi_end_request() if device queue is busy
From: Ming Lei <hidden>
Date: 2020-09-02 07:02:21
Also in:
linux-scsi
On Tue, Sep 01, 2020 at 07:40:54PM -0700, Bart Van Assche wrote:
On 2020-08-17 03:08, Ming Lei wrote:quoted
diff --git a/drivers/scsi/scsi_lib.c b/drivers/scsi/scsi_lib.c index 7c6dd6f75190..a62c29058d26 100644 --- a/drivers/scsi/scsi_lib.c +++ b/drivers/scsi/scsi_lib.c@@ -551,8 +551,27 @@ static void scsi_run_queue_async(struct scsi_device *sdev) if (scsi_target(sdev)->single_lun || !list_empty(&sdev->host->starved_list)) kblockd_schedule_work(&sdev->requeue_work); - else - blk_mq_run_hw_queues(sdev->request_queue, true); + else {Has this patch been verified with checkpatch? Checkpatch should have warned about the unbalanced braces.
[linux]$ ./scripts/checkpatch.pl -g HEAD
total: 0 errors, 0 warnings, 71 lines checked
Commit 0cbe51645b54 ("scsi: core: only re-run queue in scsi_end_request() if device queue is busy") has no obvious style problems and is ready for submission.
quoted
+ /* + * smp_mb() implied in either rq->end_io or blk_mq_free_request + * is for ordering writing .device_busy in scsi_device_unbusy() + * and reading sdev->restarts. + */Hmm ... I don't see what orders the atomic_dec(&sdev->device_busy) from scsi_device_unbusy() and the atomic_read() below? I don't think that the block layer guarantees ordering of these two memory accesses since both accesses happen in the request completion path.
__blk_mq_end_request() is called between scsi_device_unbusy() and scsi_run_queue_async(). When __blk_mq_end_request() is called, this request is actually ended really because SCMD_STATE_COMPLETE is covered race between timeout and normal completion, so: 1) either __blk_mq_free_request() is called, smp_mb__after_atomic() is implied in sbitmap_queue_clear() called from blk_mq_put_tag() 2) or rq->end_io() is called. We don't have too many ->end_io() implemented. Either wake_up_process() or blk_mq_free_request() is called in ->end_io(), so memory barrier is implied.
quoted
+ int old = atomic_read(&sdev->restarts); + + if (old) { + /* + * ->restarts has to be kept as non-zero if there is + * new budget contention comes.There are two verbs in the above sentence ("is" and "comes"). Please remove "comes" such that the sentence becomes grammatically correct.quoted
+ * + * No need to run queue when either another re-run + * queue wins in updating ->restarts or one new budget + * contention comes. + */ + if (atomic_cmpxchg(&sdev->restarts, old, 0) == old) + blk_mq_run_hw_queues(sdev->request_queue, true); + } + }Please combine the two if-statements into a single if-statement using "&&" to keep the indentation level low.quoted
@@ -1611,8 +1630,34 @@ static void scsi_mq_put_budget(struct request_queue *q) static bool scsi_mq_get_budget(struct request_queue *q) { struct scsi_device *sdev = q->queuedata; + int ret = scsi_dev_queue_ready(q, sdev); + + if (ret) + return true; + + atomic_inc(&sdev->restarts); - return scsi_dev_queue_ready(q, sdev); + /* + * Order writing .restarts and reading .device_busy, and make sure + * .restarts is visible to scsi_end_request(). Its pair is implied by + * __blk_mq_end_request() in scsi_end_request() for ordering + * writing .device_busy in scsi_device_unbusy() and reading .restarts. + * + */ + smp_mb__after_atomic();Barriers do not guarantee "is visible to". Barriers enforce ordering of memory accesses performed by a certain CPU core. Did you perhaps mean that sdev->restarts must be incremented before the code below reads sdev->device busy?
Right, ->restart has to be incremented before reading sdev->device_busy. Thanks, Ming