summaryrefslogtreecommitdiffstats
path: root/block
diff options
context:
space:
mode:
authorJosef Bacik <josef@toxicpanda.com>2019-03-07 21:37:18 +0000
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>2019-07-31 07:24:52 +0200
commit65dfe3fafd1ce222e68b7821cae5bbc928aeb199 (patch)
tree6e04e88088df0fa9dc0054e40a2fe9b9119de488 /block
parent14deb9df67c3544cc6a975dd0e3d2a88d3333085 (diff)
downloadlinux-stable-65dfe3fafd1ce222e68b7821cae5bbc928aeb199.tar.gz
linux-stable-65dfe3fafd1ce222e68b7821cae5bbc928aeb199.tar.bz2
linux-stable-65dfe3fafd1ce222e68b7821cae5bbc928aeb199.zip
block: init flush rq ref count to 1
[ Upstream commit b554db147feea39617b533ab6bca247c91c6198a ] We discovered a problem in newer kernels where a disconnect of a NBD device while the flush request was pending would result in a hang. This is because the blk mq timeout handler does if (!refcount_inc_not_zero(&rq->ref)) return true; to determine if it's ok to run the timeout handler for the request. Flush_rq's don't have a ref count set, so we'd skip running the timeout handler for this request and it would just sit there in limbo forever. Fix this by always setting the refcount of any request going through blk_init_rq() to 1. I tested this with a nbd-server that dropped flush requests to verify that it hung, and then tested with this patch to verify I got the timeout as expected and the error handling kicked in. Thanks, Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Sasha Levin <sashal@kernel.org>
Diffstat (limited to 'block')
-rw-r--r--block/blk-core.c1
1 files changed, 1 insertions, 0 deletions
diff --git a/block/blk-core.c b/block/blk-core.c
index 8340f69670d8..5183fca0818a 100644
--- a/block/blk-core.c
+++ b/block/blk-core.c
@@ -117,6 +117,7 @@ void blk_rq_init(struct request_queue *q, struct request *rq)
rq->internal_tag = -1;
rq->start_time_ns = ktime_get_ns();
rq->part = NULL;
+ refcount_set(&rq->ref, 1);
}
EXPORT_SYMBOL(blk_rq_init);