Linux - Linux kernel mainline tree

	Commit message (Collapse)	Author	Age	Files	Lines
*	Merge branch 'for-linus' into for-next	Al Viro	2015-04-11	4	-15/+34
\|\
\| *	ocfs2: _really_ sync the right range	Al Viro	2015-04-09	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	"ocfs2 syncs the wrong range" had been broken; prior to it the code was doing the wrong thing in case of O_APPEND, all right, but _after_ it we were syncing the wrong range in 100% cases. *ppos, aka iocb->ki_pos is incremented prior to that point, so we are always doing sync on the area _after_ the one we'd written to. Spotted by Joseph Qi <joseph.qi@huawei.com> back in January; unfortunately, I'd missed his mail back then ;-/ Cc: stable@vger.kernel.org Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| *	ocfs2_file_write_iter: keep return value and current position update in sync	Al Viro	2015-04-08	1	-1/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| *	[regression] ocfs2: do not increment ->ki_pos twice	Al Viro	2015-04-08	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	generic_file_direct_write() already does that. Broken by "ocfs2: do not fallback to buffer I/O write if appending" Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| *	ioctx_alloc(): fix vma (and file) leak on failure	Al Viro	2015-04-06	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we fail past the aio_setup_ring(), we need to destroy the mapping. We don't need to care about anybody having found ctx, or added requests to it, since the last failure exit is exactly the failure to make ctx visible to lookups. Reproducer (based on one by Joe Mario <jmario@redhat.com>): void count(char p) { char s[80]; printf("%s: ", p); fflush(stdout); sprintf(s, "/bin/cat /proc/%d/maps\|/bin/fgrep -c '/[aio] (deleted)'", getpid()); system(s); } int main() { io_context_t ctx; int created, limit, i, destroyed; FILE *f; count("before"); if ((f = fopen("/proc/sys/fs/aio-max-nr", "r")) == NULL) perror("opening aio-max-nr"); else if (fscanf(f, "%d", &limit) != 1) fprintf(stderr, "can't parse aio-max-nr\n"); else if ((ctx = calloc(limit, sizeof(io_context_t))) == NULL) perror("allocating aio_context_t array"); else { for (i = 0, created = 0; i < limit; i++) { if (io_setup(1000, ctx + created) == 0) created++; } for (i = 0, destroyed = 0; i < created; i++) if (io_destroy(ctx[i]) == 0) destroyed++; printf("created %d, failed %d, destroyed %d\n", created, limit - created, destroyed); count("after"); } } Found-by: Joe Mario <jmario@redhat.com> Cc: stable@vger.kernel.org Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| *	fix mremap() vs. ioctx_kill() race	Al Viro	2015-04-06	3	-9/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	teach ->mremap() method to return an error and have it fail for aio mappings in process of being killed Note that in case of ->mremap() failure we need to undo move_page_tables() we'd already done; we could call ->mremap() first, but then the failure of move_page_tables() would require undoing whatever _successful_ ->mremap() has done, which would be a lot more headache in general. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	udf_file_write_iter: reorder and simplify	Al Viro	2015-04-11	1	-20/+14
\| \| \| \| \| \| \| \| \| \| \| \|	it's easier to do generic_write_checks() first Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	fuse: ->direct_IO() doesn't need generic_write_checks()	Al Viro	2015-04-11	1	-8/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	already done by caller. We used to call __fuse_direct_write(), which called generic_write_checks(); now the former got expanded, bringing the latter to the surface. It used to be called all along and calling it from there had been wrong all along... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	ext4_file_write_iter: move generic_write_checks() up	Al Viro	2015-04-11	1	-19/+20
\| \| \| \| \| \| \| \| \| \| \| \|	simpler that way... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	xfs_file_aio_write_checks: switch to iocb/iov_iter	Al Viro	2015-04-11	1	-15/+16
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	generic_write_checks(): drop isblk argument	Al Viro	2015-04-11	14	-60/+36
\| \| \| \| \| \| \| \| \| \| \| \|	all remaining callers are passing 0; some just obscure that fact. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	blkdev_write_iter: expand generic_file_checks() call in there	Al Viro	2015-04-11	1	-6/+9
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	lift generic_write_checks() into callers of __generic_file_write_iter()	Al Viro	2015-04-11	5	-30/+60
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	__generic_file_write_iter: keep ->ki_pos and return value consistent	Al Viro	2015-04-11	1	-14/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A side effect worth noting: in O_APPEND case we set ->ki_pos early, so if it turns out to be an error or a zero-length write, we'll end up with ->ki_pos modified. Safe, since all callers never look at the ->ki_pos after the call of __generic_file_write_iter() returning non-positive, all the way to caller of ->write_iter() and those discard ->ki_pos when getting that. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	cifs: fold cifs_iovec_write() into the only caller	Al Viro	2015-04-11	1	-31/+16
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	ntfs: move iov_iter_truncate() closer to generic_write_checks()	Al Viro	2015-04-11	1	-52/+29
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	new_sync_write(): discard ->ki_pos unless the return value is positive	Al Viro	2015-04-11	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	That allows ->write_iter() instances much more convenient life wrt iocb->ki_pos (and fixes several filesystems with borderline POSIX violations when zero-length write succeeds and changes the current position). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	direct_IO: remove rw from a_ops->direct_IO()	Omar Sandoval	2015-04-11	31	-59/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that no one is using rw, remove it completely. Signed-off-by: Omar Sandoval <osandov@osandov.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	direct_IO: use iov_iter_rw() instead of rw everywhere	Omar Sandoval	2015-04-11	22	-69/+69
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The rw parameter to direct_IO is redundant with iov_iter->type, and treated slightly differently just about everywhere it's used: some users do rw & WRITE, and others do rw == WRITE where they should be doing a bitwise check. Simplify this with the new iov_iter_rw() helper, which always returns either READ or WRITE. Signed-off-by: Omar Sandoval <osandov@osandov.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	Remove rw from dax_{do_,}io()	Omar Sandoval	2015-04-11	5	-21/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	And use iov_iter_rw() instead. Signed-off-by: Omar Sandoval <osandov@osandov.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	Remove rw from {,__,do_}blockdev_direct_IO()	Omar Sandoval	2015-04-11	20	-74/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Most filesystems call through to these at some point, so we'll start here. Signed-off-by: Omar Sandoval <osandov@osandov.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	new helper: iov_iter_rw()	Omar Sandoval	2015-04-11	1	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Get either READ or WRITE out of iter->type. Signed-off-by: Omar Sandoval <osandov@osandov.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	->aio_read and ->aio_write removed	Al Viro	2015-04-11	8	-54/+9
\| \| \| \| \| \| \| \| \| \| \| \|	no remaining users Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	pcm: another weird API abuse	Al Viro	2015-04-11	1	-19/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	readv() and writev() should _not_ ignore all but the first ->iov_len, among other things. Really weird abuse of those syscalls - it expects a vector element per channel, with identical lengths (it actually assumes them to be identical - no checking is done). readv() and writev() are really bad match for that. Unfortunately, userland API is userland API and we can't do anything about them. Converted to ->read_iter/->write_iter. Please, _please_ don't do anything of that kind when designing new interfaces. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	infinibad: weird APIs switched to ->write_iter()	Al Viro	2015-04-11	2	-15/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Things Not To Do When Writing A Driver, part 1001st: have writev() and write() on the same file doing completely different things. As in, "interpret very different sets of commands". We _can_ handle that, but it's a bloody bad idea. Don't do that in new drivers. Ever. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	kill do_sync_read/do_sync_write	Al Viro	2015-04-11	2	-40/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	all remaining instances of aio_{read,write} (all 4 of them) have explicit ->read and ->write resp.; do_sync_read/do_sync_write is never called by __vfs_read/__vfs_write anymore and no other users had been left. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	fuse: use iov_iter_get_pages() for non-splice path	Al Viro	2015-04-11	1	-24/+17
\| \| \| \| \| \| \| \| \| \| \| \|	store reference to iter instead of that to iovec Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	fuse: switch to ->read_iter/->write_iter	Al Viro	2015-04-11	1	-12/+14
\| \| \| \| \| \| \| \| \| \| \| \|	we just change the calling conventions here; more work to follow. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	switch drivers/char/mem.c to ->read_iter/->write_iter	Al Viro	2015-04-11	1	-9/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Note that _these_ guys have ->read() and ->write() left in place - they are eqiuvalent to what we'd get if we replaced those with NULL, but we are talking about hot paths here. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	make new_sync_{read,write}() static	Al Viro	2015-04-11	59	-153/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	All places outside of core VFS that checked ->read and ->write for being NULL or called the methods directly are gone now, so NULL {read,write} with non-NULL {read,write}_iter will do the right thing in all cases. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	coredump: accept any write method	Al Viro	2015-04-11	1	-1/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	switch /dev/loop to vfs_iter_write()	Al Viro	2015-04-11	1	-5/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	all writable files that might be used as backing store for /dev/loop already support ->write_iter() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	serial2002: switch to __vfs_read/__vfs_write	Al Viro	2015-04-11	1	-12/+6
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	ashmem: use __vfs_read()	Al Viro	2015-04-11	1	-1/+1
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	export __vfs_read()	Al Viro	2015-04-11	1	-8/+5
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	autofs: switch to __vfs_write()	Al Viro	2015-04-11	2	-2/+2
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	new helper: __vfs_write()	Al Viro	2015-04-11	2	-12/+17
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \|	Merge branch '9p-iov_iter' into for-next	Al Viro	2015-04-11	12	-627/+355
\|\ \
\| * \|	net/9p: remove (now-)unused helpers	Al Viro	2015-04-11	2	-43/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	p9_client_attach(): set fid->uid correctly	Al Viro	2015-04-11	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	it's almost always equal to current_fsuid(), but there's an exception - if the first writeback fid is opened by non-root and that happens before root has done any lookups in /, we end up doing attach for root. The current code leaves the resulting FID owned by root from the server POV and by non-root from the client one. Unfortunately, it means that e.g. massive dcache eviction will leave that user buggered - they'll end up redoing walks from / and picking that FID every time. As soon as they try to create something, the things will get nasty. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: we are leaking glock.client_id in v9fs_file_getlock()	Al Viro	2015-04-11	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: switch to ->read_iter/->write_iter	Al Viro	2015-04-11	1	-44/+39
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: get rid of v9fs_direct_file_read()	Al Viro	2015-04-11	2	-51/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	do it in ->direct_IO()... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: switch p9_client_read() to passing struct iov_iter *	Al Viro	2015-04-11	7	-183/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	... and make it loop Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: get rid of v9fs_direct_file_write()	Al Viro	2015-04-11	2	-82/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	just handle it in ->direct_IO() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: fold v9fs_file_write_internal() into the caller	Al Viro	2015-04-11	2	-49/+30
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: switch ->writepage() to direct use of p9_client_write()	Al Viro	2015-04-11	1	-22/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Don't mess with kmap() - just use ITER_BVEC. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	9p: switch p9_client_write() to passing it struct iov_iter *	Al Viro	2015-04-11	4	-97/+62
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	... and make it loop until it's done Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
\| * \|	net/9p: switch the guts of p9_client_{read,write}() to iov_iter	Al Viro	2015-04-11	4	-133/+147
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	... and have get_user_pages_fast() mapping fewer pages than requested to generate a short read/write. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
* \| \|	switch hugetlbfs to ->read_iter()	Al Viro	2015-04-11	1	-58/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	... and fix the case when the area we are asked to read crosses a hugepage boundary Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>