[PATCH openEuler-5.10 066/101] nbd: Aovid double completion of a request

19 Oct 2021

From: Xie Yongji xieyongji@bytedance.com
stable inclusion
from stable-5.10.60
commit e0ee8d9c31b5a670f35a4ff7e2daf59967f2f27a
bugzilla: 177018 https://gitee.com/openeuler/kernel/issues/I4EAUG
Reference: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?id=...
--------------------------------
[ Upstream commit cddce01160582a5f52ada3da9626c052d852ec42 ]
There is a race between iterating over requests in
nbd_clear_que() and completing requests in recv_work(),
which can lead to double completion of a request.
To fix it, flush the recv worker before iterating over
the requests and don't abort the completed request
while iterating.
Fixes: 96d97e17828f ("nbd: clear_sock on netlink disconnect")
Reported-by: Jiang Yadong jiangyadong@bytedance.com
Signed-off-by: Xie Yongji xieyongji@bytedance.com
Reviewed-by: Josef Bacik josef@toxicpanda.com
Link: https://lore.kernel.org/r/20210813151330.96-1-xieyongji@bytedance.com
Signed-off-by: Jens Axboe axboe@kernel.dk
Signed-off-by: Sasha Levin sashal@kernel.org
Signed-off-by: Chen Jun chenjun102@huawei.com
Acked-by: Weilong Chen chenweilong@huawei.com
Signed-off-by: Chen Jun chenjun102@huawei.com
---
 drivers/block/nbd.c | 14 +++++++++++---
 1 file changed, 11 insertions(+), 3 deletions(-)

diff --git a/drivers/block/nbd.c b/drivers/block/nbd.c
index 9a70eab7edbf..59c452fff835 100644
--- a/drivers/block/nbd.c
+++ b/drivers/block/nbd.c
@@ -812,6 +812,10 @@ static bool nbd_clear_req(struct request *req, void *data, bool reserved)
 {
    struct nbd_cmd *cmd = blk_mq_rq_to_pdu(req);
+	/* don't abort one completed request */
+	if (blk_mq_request_completed(req))
+		return true;
+
    mutex_lock(&cmd->lock);
    cmd->status = BLK_STS_IOERR;
    mutex_unlock(&cmd->lock);
@@ -2024,15 +2028,19 @@ static void nbd_disconnect_and_put(struct nbd_device *nbd)
 {
    mutex_lock(&nbd->config_lock);
    nbd_disconnect(nbd);
-	nbd_clear_sock(nbd);
-	mutex_unlock(&nbd->config_lock);
+	sock_shutdown(nbd);
    /*
     * Make sure recv thread has finished, so it does not drop the last
     * config ref and try to destroy the workqueue from inside the work
-	 * queue.
+	 * queue. And this also ensure that we can safely call nbd_clear_que()
+	 * to cancel the inflight I/Os.
     */
    if (nbd->recv_workq)
    	flush_workqueue(nbd->recv_workq);
+	nbd_clear_que(nbd);
+	nbd->task_setup = NULL;
+	mutex_unlock(&nbd->config_lock);
+
    if (test_and_clear_bit(NBD_RT_HAS_CONFIG_REF,
    		       &nbd->config->runtime_flags))
    	nbd_config_put(nbd);
-- 
2.20.1

    

2025

2024

2023

2022

2021

2020

2019

[PATCH openEuler-5.10 066/101] nbd: Aovid double completion of a request