Received: by 2002:a05:6a10:6744:0:0:0:0 with SMTP id w4csp1012171pxu; Thu, 8 Oct 2020 00:29:21 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx7YyGjDYH5IBVueTw3z0BfpG84AsLicMo0NqY5MrHGWTgpIjj6kgFVxtBKhpVTZtZI5y0V X-Received: by 2002:a17:906:1744:: with SMTP id d4mr7490033eje.326.1602142161437; Thu, 08 Oct 2020 00:29:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1602142161; cv=none; d=google.com; s=arc-20160816; b=xiAhRBY8PSsvWda6necpfYQmekpxuDEGcXM7VuD74jJndT4/mVpk6zKaw4oGzeoT6B K+GtuvaTZZeBZ3vIq3pEvjJNSmeNjjF1FA2J0PWTdll+o/9B6S841u9zixgUvUo+exJI Z/EoyNZsv2lpx5BIerjGMlztmQ140g/6rZfUQuuOdlQu5vokW3QEhwYObiXE5n5EodKE fD917/m4uev1W5ZsaaC9HqizdXOWmr1f6J/X6MSJbfH15lTMhTWv/iD2l1VboVY1Wbr6 nSjpHQFugbnrxSW98EIMML5vUT9qdKqg9F9apm85ZWY/sXI5CbTE4i38zr4pBSdjiDrr Ksxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Ui8WkZ8ZsqWNVKFISnxizxdl1xBQtE0ojtcqIbBf85o=; b=OmZlS6yLcLb/MHU9Nq/2QDTI3FpoMKoNnx19t88nllL1/6ahgvNTweUm0VTBsfymih sZJNu/RfY5+QKnhQoFqt6ulb6ttACkZgAwKqm9VJGK0m22mEp5pom7DXj5ry5DBJYQv8 9bDGnYMokCdhAiHL+Yjvxe7WqP0GJ66UsHKaXOLtj3X0ieWyAtBJExM1kpnNWgNLBcUu vmmJN2L+n/zSLWsOSlQUPyUhsZ/e6VCSDLsvTwvcPSthiwIJaioJ01Wu4Th4sHsIQYWB tk7aCrPwuK/Jc1oSJe1zsjxtC9euUr0Vvb5RXPqgbV0skCIE3wJTJlVYUm8mcQqw0LVK vmQQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=OLHQfD6c; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bl4si3669230ejb.190.2020.10.08.00.28.56; Thu, 08 Oct 2020 00:29:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=OLHQfD6c; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726754AbgJHHX1 (ORCPT + 99 others); Thu, 8 Oct 2020 03:23:27 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47098 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725874AbgJHHX1 (ORCPT ); Thu, 8 Oct 2020 03:23:27 -0400 Received: from mail-oo1-xc43.google.com (mail-oo1-xc43.google.com [IPv6:2607:f8b0:4864:20::c43]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id DF33AC0613D4 for ; Thu, 8 Oct 2020 00:23:26 -0700 (PDT) Received: by mail-oo1-xc43.google.com with SMTP id w7so1253413oow.7 for ; Thu, 08 Oct 2020 00:23:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Ui8WkZ8ZsqWNVKFISnxizxdl1xBQtE0ojtcqIbBf85o=; b=OLHQfD6clKQWUqpmaAK9RMlR/tsEUgzWlL24nlPViG1Rk9p8LHmyCmM7/5WLHXKEvu lI/irRP+aq5gBsvKA49uJgztAy930tPikm27PGApe4JTRXsGR0g3qvfoDhdKCEoe0v+J hUX2iSsNsi3+8JirFm0OBhWG4fOImFFw/17nk= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Ui8WkZ8ZsqWNVKFISnxizxdl1xBQtE0ojtcqIbBf85o=; b=LPq8pX2vXK1fy5Fl633yagsi7GTaUD0h1Jm8Hj4jT/ZZb2YmlAihzH60MFb0uBQ0bz B5wV079gK0QJ+Kouv3D1PDozZTPbl94Eyx3l0+Tvxa4A8NDvDMmJaSlZM9kYg1HvY7gr SON5OVWUVoT8G03bH07cR7+sgVKK6O/AlPsDdn2SX9hD7K0lnr6N8ARSuA8bHNxuAie3 5RETmNyzNEpJDgsGF/zwMrGntF4x3r5hDPuij9rQfdcmwcfOOv1l65g5zoowFAQra72/ BZEka2VNfrr2raa7m/28u+PhTZdaNIoHS77bQ751Uvo0xgnqQOXW4jUGDBmppF92Ouj4 dMbw== X-Gm-Message-State: AOAM533u6uP9Emit5x3RgnGwr3hKsM8HcBHgjcjkobeB/DaF+5LW5X+j uK5RH7F8y1kioe/eGEmGEPGIwP71DQnNWHXqfI4fQA== X-Received: by 2002:a4a:c011:: with SMTP id v17mr4481557oop.89.1602141806038; Thu, 08 Oct 2020 00:23:26 -0700 (PDT) MIME-Version: 1.0 References: <20201007164426.1812530-1-daniel.vetter@ffwll.ch> <20201007164426.1812530-8-daniel.vetter@ffwll.ch> <852a74ec-339b-4c7f-9e29-b9736111849a@nvidia.com> In-Reply-To: <852a74ec-339b-4c7f-9e29-b9736111849a@nvidia.com> From: Daniel Vetter Date: Thu, 8 Oct 2020 09:23:14 +0200 Message-ID: Subject: Re: [PATCH 07/13] mm: close race in generic_access_phys To: John Hubbard Cc: DRI Development , LKML , kvm@vger.kernel.org, Linux MM , Linux ARM , linux-samsung-soc , "open list:DMA BUFFER SHARING FRAMEWORK" , linux-s390@vger.kernel.org, Jason Gunthorpe , Dan Williams , Kees Cook , Rik van Riel , Benjamin Herrensmidt , Dave Airlie , Hugh Dickins , Andrew Morton , =?UTF-8?B?SsOpcsO0bWUgR2xpc3Nl?= , Jan Kara , Daniel Vetter Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Oct 8, 2020 at 2:44 AM John Hubbard wrote: > > On 10/7/20 9:44 AM, Daniel Vetter wrote: > > Way back it was a reasonable assumptions that iomem mappings never > > change the pfn range they point at. But this has changed: > > > > - gpu drivers dynamically manage their memory nowadays, invalidating > > ptes with unmap_mapping_range when buffers get moved > > > > - contiguous dma allocations have moved from dedicated carvetouts to > > s/carvetouts/carveouts/ > > > cma regions. This means if we miss the unmap the pfn might contain > > pagecache or anon memory (well anything allocated with GFP_MOVEABLE) > > > > - even /dev/mem now invalidates mappings when the kernel requests that > > iomem region when CONFIG_IO_STRICT_DEVMEM is set, see 3234ac664a87 > > ("/dev/mem: Revoke mappings when a driver claims the region") > > Thanks for putting these references into the log, it's very helpful. > ... > > diff --git a/mm/memory.c b/mm/memory.c > > index fcfc4ca36eba..8d467e23b44e 100644 > > --- a/mm/memory.c > > +++ b/mm/memory.c > > @@ -4873,28 +4873,68 @@ int follow_phys(struct vm_area_struct *vma, > > return ret; > > } > > > > +/** > > + * generic_access_phys - generic implementation for iomem mmap access > > + * @vma: the vma to access > > + * @addr: userspace addres, not relative offset within @vma > > + * @buf: buffer to read/write > > + * @len: length of transfer > > + * @write: set to FOLL_WRITE when writing, otherwise reading > > + * > > + * This is a generic implementation for &vm_operations_struct.access for an > > + * iomem mapping. This callback is used by access_process_vm() when the @vma is > > + * not page based. > > + */ > > int generic_access_phys(struct vm_area_struct *vma, unsigned long addr, > > void *buf, int len, int write) > > { > > resource_size_t phys_addr; > > unsigned long prot = 0; > > void __iomem *maddr; > > + pte_t *ptep, pte; > > + spinlock_t *ptl; > > int offset = addr & (PAGE_SIZE-1); > > + int ret = -EINVAL; > > + > > + if (!(vma->vm_flags & (VM_IO | VM_PFNMAP))) > > + return -EINVAL; > > + > > +retry: > > + if (follow_pte(vma->vm_mm, addr, &ptep, &ptl)) > > + return -EINVAL; > > + pte = *ptep; > > + pte_unmap_unlock(ptep, ptl); > > > > - if (follow_phys(vma, addr, write, &prot, &phys_addr)) > > + prot = pgprot_val(pte_pgprot(pte)); > > + phys_addr = (resource_size_t)pte_pfn(pte) << PAGE_SHIFT; > > + > > + if ((write & FOLL_WRITE) && !pte_write(pte)) > > return -EINVAL; > > > > maddr = ioremap_prot(phys_addr, PAGE_ALIGN(len + offset), prot); > > if (!maddr) > > return -ENOMEM; > > > > + if (follow_pte(vma->vm_mm, addr, &ptep, &ptl)) > > + goto out_unmap; > > + > > + if (pte_same(pte, *ptep)) { > > > The ioremap area is something I'm sorta new to, so a newbie question: > is it possible for the same pte to already be there, ever? If so, we > be stuck in an infinite loop here. I'm sure that's not the case, but > it's not yet obvious to me why it's impossible. Resource reservations > maybe? It's just buggy, it should be !pte_same. And I need to figure out how to test this I guess. -Daniel -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch