linux.git - Linux kernel mainline tree

	Commit message (Collapse)	Author	Age	Files	Lines
...
\| * \|	NVMe: Fix off-by-one when filling in PRP lists	Matthew Wilcox	2011-11-04	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the last element in the PRP list fits on the end of the page, there's no need to allocate an extra page to put that single element in. It can fit on the end of the page. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Fix interpretation of 'Number of Namespaces' field	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The spec says this is a 0s based value. We don't need to handle the maximal value because it's reserved to mean "every namespace". Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Remove outdated comments	Matthew Wilcox	2011-11-04	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The head can never overrun the tail since we won't allocate enough command IDs to let that happen. The status codes are in sync with the spec. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Fix comment formatting	Matthew Wilcox	2011-11-04	1	-2/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reported-by: Randy Dunlap <rdunlap@xenotime.net> Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Convert comments to kernel-doc notation	Matthew Wilcox	2011-11-04	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Reported-by: Randy Dunlap <rdunlap@xenotime.net> Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Version 0.4	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Reduce maximum queue depth by 1	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The spec says we're not allowed to completely fill the submission queue. Solve this by reducing the number of allocatable cmdids by 1. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Fix discontiguous accesses	Matthew Wilcox	2011-11-04	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When we submit subsequent portions of the I/O, we need to access the updated block, not start reading again from the original position. This was showing up as miscompares in the XFS randholes testcase. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Handle bios that contain non-virtually contiguous addresses	Matthew Wilcox	2011-11-04	1	-9/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	NVMe scatterlists must be virtually contiguous, like almost all I/Os. However, when the filesystem lays out files with a hole, it can be that adjacent LBAs map to non-adjacent virtual addresses. Handle this by submitting one NVMe command at a time for each virtually discontiguous range. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Implement Flush	Matthew Wilcox	2011-11-04	1	-0/+40
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Linux implements Flush as a bit in the bio. That means there may also be data associated with the flush; if so the flush should be sent before the data. To avoid completing the bio twice, I add CMD_CTX_FLUSH to indicate the completion routine should do nothing. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Mark CMD_CTX_CANCELLED as being unlikely	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Correct SQ doorbell semantics	Matthew Wilcox	2011-11-04	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The value written to the doorbell needs to be the first free index in the queue, not the most recently used index in the queue. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Let the kthread take care of devices earlier	Matthew Wilcox	2011-11-04	1	-4/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If interrupts are misconfigured, the kthread will be needed to process admin queue completions. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Rename nr_queues to nr_io_queues	Matthew Wilcox	2011-11-04	1	-11/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I got confused about whether this included the admin queue or not, and had to resort to reading the spec. It doesn't include the admin queue, so make that clear in the name. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Remove setting of 'flags' in rw command	Matthew Wilcox	2011-11-04	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This was the data transfer bit until spec rev 0.92 Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Release 0.3	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Add a kthread to handle the congestion list	Matthew Wilcox	2011-11-04	1	-19/+67
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Instead of trying to resubmit I/Os in the I/O completion path (in interrupt context), wake up a kthread which will resubmit I/O from user context. This allows mke2fs to run to completion. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Handle failures differently in nvme_submit_bio_queue()	Matthew Wilcox	2011-11-04	1	-19/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Return -EBUSY if the queue is full or -ENOMEM if we failed to allocate memory (or map a scatterlist). Also use GFP_ATOMIC to allocate the nvme_bio and move the locking to the callers of nvme_submit_bio_queue(). In nvme_make_request(), don't permit an I/O to jump the queue -- if the congestion list already has an entry, just add to the tail, rather than trying to submit. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Handle physical merging of bvec entries	Matthew Wilcox	2011-11-04	1	-9/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In order to not overrun the sg array, we have to merge physically contiguous pages into a single sg entry. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Check for DMA mapping failure	Matthew Wilcox	2011-11-04	1	-1/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If dma_map_sg returns 0 (failure), we need to fail the I/O. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Pass the nvme_dev to nvme_free_prps and nvme_setup_prps	Matthew Wilcox	2011-11-04	1	-13/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We were passing the nvme_queue to access the q_dmadev for the dma_alloc_coherent calls, but since we moved to the dma pool API, we really only need the nvme_dev. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Optimise memory usage for I/Os between 4k and 128k	Matthew Wilcox	2011-11-04	1	-8/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add a second memory pool for smaller I/Os. We can pack 16 of these on a single page instead of using an entire page for each one. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Switch to use DMA Pool API	Matthew Wilcox	2011-11-04	1	-7/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Calling dma_free_coherent from interrupt context causes warnings. Using the DMA pools delays freeing until pool destruction, so avoids the problem. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Rename nvme_req_info to nvme_bio	Matthew Wilcox	2011-11-04	1	-24/+24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are too many things called 'info' in this driver. This data structure is auxiliary information for a struct bio, so call it nvme_bio, or nbio when used as a variable. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Initial PRP List support	Shane Michael Matthews	2011-11-04	1	-13/+91
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add a pointer to the nvme_req_info to hold a new data structure (nvme_prps) which contains a list of the pages allocated to this particular request for holding PRP list entries. nvme_setup_prps() now returns this pointer. To allocate and free the memory used for PRP lists, we need a struct device, so we need to pass the nvme_queue pointer to many functions which didn't use to need it. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Advance the sg pointer when filling in an sg list	Matthew Wilcox	2011-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For multipage BIOs, we were always using sg[0] instead of advancing through the list. Oops :-) Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Renumber the special context values	Matthew Wilcox	2011-11-04	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If POISON_POINTER_DELTA isn't defined, ensure they're in page 0 which should never be mapped. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Handle the congestion list a little better	Matthew Wilcox	2011-11-04	1	-0/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In the bio completion handler, check for bios on the congestion list for this NVM queue. Also, lock the congestion list in the make_request function as the queue may end up being shared between multiple CPUs. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Record the timeout for each command	Matthew Wilcox	2011-11-04	1	-17/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In addition to recording the completion data for each command, record the anticipated completion time. Choose a timeout of 5 seconds for normal I/Os and 60 seconds for admin I/Os. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Need to lock queue during interrupt handling	Matthew Wilcox	2011-11-04	1	-6/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we're sharing a queue between multiple CPUs and we cancel a sync I/O, we must have the queue locked to avoid corrupting the stack of the thread that submitted the I/O. It turns out this is the same locking that's needed for the threaded irq handler, so share that code. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Detect command IDs completing that are out of range	Matthew Wilcox	2011-11-04	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the adapter completes a command ID that is outside the bounds of the array, return CMD_CTX_INVALID instead of random data, and print a message in the sync_completion handler (which is rapidly becoming the misc completion handler :-) Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Detect commands that are completed twice	Matthew Wilcox	2011-11-04	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Set the context value to CMD_CTX_COMPLETED, and print a message in the sync_completion handler if we see it. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Use a symbolic name to represent cancelled commands instead of 0	Matthew Wilcox	2011-11-04	1	-8/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I have plans for other special values in sync_completion. Plus, this is more self-documenting, and lets us detect bogus usages. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Add a module parameter to use a threaded interrupt	Matthew Wilcox	2011-11-04	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We're currently calling bio_endio from hard interrupt context. This is not a good idea for preemptible kernels as it will cause longer latencies. Using a threaded interrupt will run the entire queue processing mechanism (including bio_endio) in a thread, which can be preempted. Unfortuantely, it also adds about 7us of latency to the single-I/O case, so make it a module parameter for the moment. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Call put_nvmeq() before calling nvme_submit_sync_cmd()	Matthew Wilcox	2011-11-04	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can't have preemption disabled when we call schedule(). Accept the possibility that we'll get preempted, and it'll cost us some cacheline bounces. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Allow fatal signals to interrupt I/O	Matthew Wilcox	2011-11-04	1	-6/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the user sends a fatal signal, sleeping in the TASK_KILLABLE state permits the task to be aborted. The only wrinkle is making sure that if/when the command completes later that it doesn't upset anything. Handle this by setting the data pointer to 0, and checking the value isn't NULL in the sync completion path. Eventually, bios can be cancelled through this path too. Note that the cmdid isn't freed to prevent reuse. We should also abort the command in the future, but this is a good start. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Release 0.2	Matthew Wilcox	2011-11-04	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Add download / activate firmware ioctls	Matthew Wilcox	2011-11-04	1	-0/+45
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Move sysfs entries to the right place	Matthew Wilcox	2011-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Because I wasn't setting driverfs_dev, the devices were showing up under /sys/devices/virtual/block. Now they appear underneath the PCI device which they belong to. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Disable the device before we write the admin queues	Shane Michael Matthews	2011-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In case the card has been left in a partially-configured state, write 0 to the Enable bit. Signed-off-by: Shane Michael Matthews <shane.matthews@intel.com> Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Request I/O regions	Matthew Wilcox	2011-11-04	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Calling pci_request_selected_regions() reserves these regions for our use. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Allow queues to be allocated above 4GB	Matthew Wilcox	2011-11-04	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Need to call dma_set_coherent_mask() to allow queues to be allocated above 4GB. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Enable device DMA	Matthew Wilcox	2011-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Need to call pci_set_master() to enable device DMA Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Enable and disable the PCI device	Shane Michael Matthews	2011-11-04	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Call pci_enable_device_mem() at initialisation and pci_disable_device at exit. Signed-off-by: Shane Michael Matthews <shane.matthews@intel.com> Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Check returns from nvme_alloc_queue()	Matthew Wilcox	2011-11-04	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It can return NULL, so handle that. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Remove 'node' from nvme_dev	Matthew Wilcox	2011-11-04	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't keep a list of nvme_dev any more Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Read the model, serial & firmware rev from the controller	Matthew Wilcox	2011-11-04	1	-1/+9
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Add NVME_IOCTL_SUBMIT_IO	Matthew Wilcox	2011-11-04	1	-0/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Allow userspace to submit synchronous I/O like the SCSI sg interface does. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Create nvme_map_user_pages() and nvme_unmap_user_pages()	Matthew Wilcox	2011-11-04	1	-15/+53
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These are generalisations of the code that was in nvme_submit_user_admin_command(). Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>
\| * \|	NVMe: Change NVME_IOCTL_GET_RANGE_TYPE to return all the ranges	Matthew Wilcox	2011-11-04	1	-30/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Factor out most of nvme_identify() into a new nvme_submit_user_admin_command() function. Change nvme_get_range_type() to call it and change nvme_ioctl to realise that it's getting back all 64 ranges. Signed-off-by: Matthew Wilcox <matthew.r.wilcox@intel.com>