RDMA/addr: Fix race with netevent_callback()/rdma_addr_cancel()

commit 2ee9bf346fbfd1dad0933b9eb3a4c2c0979b633e upstream. This three thread race can result in the work being run once the callback becomes NULL: CPU1 CPU2 CPU3 netevent_callback() process_one_req() rdma_addr_cancel() [..] spin_lock_bh() set_timeout() spin_unlock_bh() spin_lock_bh() list_del_init(&req->list); spin_unlock_bh() req->callback = NULL spin_lock_bh() if (!list_empty(&req->list)) // Skipped! // cancel_delayed_work(&req->work); spin_unlock_bh() process_one_req() // again req->callback() // BOOM cancel_delayed_work_sync() The solution is to always cancel the work once it is completed so any in between set_timeout() does not result in it running again. Cc: stable@vger.kernel.org Fixes: 44e75052bc2a ("RDMA/rdma_cm: Make rdma_addr_cancel into a fence") Link: https://lore.kernel.org/r/20200930072007.1009692-1-leon@kernel.org Reported-by: Dan Aloni <dan@kernelim.com> Signed-off-by: Leon Romanovsky <leonro@nvidia.com> Signed-off-by: Jason Gunthorpe <jgg@nvidia.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
author: Jason Gunthorpe <jgg@nvidia.com> 2020-09-30 10:20:07 +0300
committer: Greg Kroah-Hartman <gregkh@linuxfoundation.org> 2020-11-01 12:47:08 +0100
commit: 35a1d6270d451d5e7fffca87f965be5e266a7916 (patch)
tree: 4e38bd5eeff8bdf4d55ade44d6dc460b982fed1f
parent: 0f6dd1b5f0fa9962c9d8f207138d86f6675f04d4 (diff)
download: linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.tar.gz
linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.tar.bz2
linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.zip
1 files changed, 5 insertions, 6 deletions
diff --git a/drivers/infiniband/core/addr.c b/drivers/infiniband/core/addr.c
index 3a98439bba83..0abce004a959 100644
--- a/drivers/infiniband/core/addr.c
+++ b/drivers/infiniband/core/addr.c
@@ -647,13 +647,12 @@ static void process_one_req(struct work_struct *_work)
 	req->callback = NULL;
 
 	spin_lock_bh(&lock);
+	/*
+	 * Although the work will normally have been canceled by the workqueue,
+	 * it can still be requeued as long as it is on the req_list.
+	 */
+	cancel_delayed_work(&req->work);
 	if (!list_empty(&req->list)) {
-		/*
-		 * Although the work will normally have been canceled by the
-		 * workqueue, it can still be requeued as long as it is on the
-		 * req_list.
-		 */
-		cancel_delayed_work(&req->work);
 		list_del_init(&req->list);
 		kfree(req);
 	}
author	Jason Gunthorpe <jgg@nvidia.com>	2020-09-30 10:20:07 +0300
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2020-11-01 12:47:08 +0100
commit	35a1d6270d451d5e7fffca87f965be5e266a7916 (patch)
tree	4e38bd5eeff8bdf4d55ade44d6dc460b982fed1f
parent	0f6dd1b5f0fa9962c9d8f207138d86f6675f04d4 (diff)
download	linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.tar.gz linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.tar.bz2 linux-stable-35a1d6270d451d5e7fffca87f965be5e266a7916.zip