Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1764321Ab3DDSHW (ORCPT ); Thu, 4 Apr 2013 14:07:22 -0400 Received: from mail-ob0-f170.google.com ([209.85.214.170]:40988 "EHLO mail-ob0-f170.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1764294Ab3DDSHU (ORCPT ); Thu, 4 Apr 2013 14:07:20 -0400 MIME-Version: 1.0 In-Reply-To: <514BAB13.3000101@gmail.com> References: <20130321043449.7229.81056.stgit@amt.stowe> <20130321043502.7229.43877.stgit@amt.stowe> <514BAB13.3000101@gmail.com> From: Bjorn Helgaas Date: Thu, 4 Apr 2013 12:06:58 -0600 Message-ID: Subject: Re: [PATCH 2/3] PCI: Handle device quirks when accessing sysfs resource entries To: Robert Hancock Cc: Myron Stowe , "linux-pci@vger.kernel.org" , Xiangliang Yu , xiangliang yu , Greg Kroah-Hartman , "alex.williamson@redhat.com" , Kay Sievers , "linux-scsi@vger.kernel.org" , "linux-kernel@vger.kernel.org" Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4007 Lines: 78 On Thu, Mar 21, 2013 at 6:51 PM, Robert Hancock wrote: > On 03/20/2013 10:35 PM, Myron Stowe wrote: >> >> Sysfs includes entries to memory regions that back a PCI device's BARs. >> The pci-sysfs entries backing I/O Port BARs can be accessed by userspace, >> providing direct access to the device's registers. File permissions >> prevent random users from accessing the device's registers through these >> files, but don't stop a privileged app that chooses to ignore the purpose >> of these files from doing so. >> >> There are devices with abnormally strict restrictions with respect to >> accessing their registers; aspects that are typically handled by the >> device's driver. When these access restrictions are not followed - as >> when a userspace app such as "udevadm info --attribute-walk >> --path=/sys/..." parses though reading all the device's sysfs entries - it >> can cause such devices to fail. >> >> This patch introduces a quirking mechanism that can be used to detect >> accesses that do no meet the device's restrictions, letting a device >> specific method intervene and decide how to progress. >> >> Reported-by: Xiangliang Yu >> Signed-off-by: Myron Stowe > > > I honestly don't think there's much point in even attempting this strategy. > This list of devices in the quirk can't possibly be complete. It would > likely be easier to enumerate a white-list of devices that can deal with > their IO ports being read willy-nilly than a blacklist of those that don't, > as there's likely countless devices that fall into this category. Even if > they don't choke as badly as these ones do, it's quite likely that bad > behavior will result. I agree, I'm not inclined to add a bunch of code that only fixes one device when there are likely many others with similar problems. > I think there's a few things that need to be done: > > -Fix the bug in udevadm that caused it to trawl through these files > willy-nilly, We can argue about whether this is actually a bug in udevadm, but either way, my opinion is that the data that udevadm prints out from these files is uninteresting in the same way the data from "uevent," "dev," "modalias," "resource," etc. is uninteresting. So to me, a udevadm change seems justified for that reason. > -Fix the kernel so that access through these files complies with the > kernel's mechanisms for claiming IO/memory regions to prevent access > conflicts (i.e. opening these files should claim the resource region they > refer to, and should fail with EBUSY or something if another process or a > kernel driver is using it). > > -Reconsider whether supporting read/write on the resource files for IO port > regions like these makes any sense. Obviously mmap isn't very practical for > IO port access on x86 but you could even do something like an ioctl for this > purpose. Not very many pieces of software would need to access these files > so it's likely OK if the API is a bit ugly. That would prevent something > like grepping through sysfs from generating port accesses to random devices. This is the approach I'd like to push on for a kernel fix. I'm not a VM person, but if it were possible to support .mmap() in such a way that a handler would be called for every access to the region, we could easily support I/O port access that way. Along that line, I'm concerned that we may have a hole in the MEM BAR .mmap() path. What happens if we have BARs from two different devices in the same page, and we mmap() one of them? Does the user also get access to the second device's BAR on the same page? If so, that could also be fixed with an .mmap() trap approach. Performance would suck for these small BARs, of course. Bjorn -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/