shmem,percpu_counter: add _limited_add(fbc, limit, amount)

Percpu counter's compare and add are separate functions: without locking around them (which would defeat their purpose), it has been possible to overflow the intended limit. Imagine all the other CPUs fallocating tmpfs huge pages to the limit, in between this CPU's compare and its add. I have not seen reports of that happening; but tmpfs's recent addition of dquot_alloc_block_nodirty() in between the compare and the add makes it even more likely, and I'd be uncomfortable to leave it unfixed. Introduce percpu_counter_limited_add(fbc, limit, amount) to prevent it. I believe this implementation is correct, and slightly more efficient than the combination of compare and add (taking the lock once rather than twice when nearing full - the last 128MiB of a tmpfs volume on a machine with 128 CPUs and 4KiB pages); but it does beg for a better design - when nearing full, there is no new batching, but the costly percpu counter sum across CPUs still has to be done, while locked. Follow __percpu_counter_sum()'s example, including cpu_dying_mask as well as cpu_online_mask: but shouldn't __percpu_counter_compare() and __percpu_counter_limited_add() then be adding a num_dying_cpus() to num_online_cpus(), when they calculate the maximum which could be held across CPUs? But the times when it matters would be vanishingly rare. Link: https://lkml.kernel.org/r/bb817848-2d19-bcc8-39ca-ea179af0f0b4@google.com Signed-off-by: Hugh Dickins <hughd@google.com> Reviewed-by: Jan Kara <jack@suse.cz> Cc: Tim Chen <tim.c.chen@intel.com> Cc: Dave Chinner <dchinner@redhat.com> Cc: Darrick J. Wong <djwong@kernel.org> Cc: Axel Rasmussen <axelrasmussen@google.com> Cc: Carlos Maiolino <cem@kernel.org> Cc: Christian Brauner <brauner@kernel.org> Cc: Chuck Lever <chuck.lever@oracle.com> Cc: Johannes Weiner <hannes@cmpxchg.org> Cc: Matthew Wilcox (Oracle) <willy@infradead.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
author: Hugh Dickins <hughd@google.com> 2023-09-29 20:42:45 -0700
committer: Andrew Morton <akpm@linux-foundation.org> 2023-10-18 14:34:14 -0700
commit: beb9868628445306958fd7b2da1cd369a4a381cc (patch)
tree: 52d12a5752afdbde62d0f55ed3230c1eac56864b /mm/shmem.c
parent: 3022fd7af9604d44ec43da8a4398872989599b18 (diff)
download: linux-beb9868628445306958fd7b2da1cd369a4a381cc.tar.gz
linux-beb9868628445306958fd7b2da1cd369a4a381cc.tar.bz2
linux-beb9868628445306958fd7b2da1cd369a4a381cc.zip
1 files changed, 5 insertions, 5 deletions
diff --git a/mm/shmem.c b/mm/shmem.c
index 269cd3c1110f..61b170324e5c 100644
--- a/mm/shmem.c
+++ b/mm/shmem.c
@@ -217,15 +217,15 @@ static int shmem_inode_acct_blocks(struct inode *inode, long pages)
 
 	might_sleep();	/* when quotas */
 	if (sbinfo->max_blocks) {
-		if (percpu_counter_compare(&sbinfo->used_blocks,
-					   sbinfo->max_blocks - pages) > 0)
+		if (!percpu_counter_limited_add(&sbinfo->used_blocks,
+						sbinfo->max_blocks, pages))
 			goto unacct;
 
 		err = dquot_alloc_block_nodirty(inode, pages);
-		if (err)
+		if (err) {
+			percpu_counter_sub(&sbinfo->used_blocks, pages);
 			goto unacct;
-
-		percpu_counter_add(&sbinfo->used_blocks, pages);
+		}
 	} else {
 		err = dquot_alloc_block_nodirty(inode, pages);
 		if (err)
author	Hugh Dickins <hughd@google.com>	2023-09-29 20:42:45 -0700
committer	Andrew Morton <akpm@linux-foundation.org>	2023-10-18 14:34:14 -0700
commit	beb9868628445306958fd7b2da1cd369a4a381cc (patch)
tree	52d12a5752afdbde62d0f55ed3230c1eac56864b /mm/shmem.c
parent	3022fd7af9604d44ec43da8a4398872989599b18 (diff)
download	linux-beb9868628445306958fd7b2da1cd369a4a381cc.tar.gz linux-beb9868628445306958fd7b2da1cd369a4a381cc.tar.bz2 linux-beb9868628445306958fd7b2da1cd369a4a381cc.zip