Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp593282pxb; Wed, 3 Feb 2021 12:33:47 -0800 (PST) X-Google-Smtp-Source: ABdhPJzq0H0SJJvWnU2mDoSGpxshyOiPT/buLZY0nGs+xZqc0yNEt/JMel3jg/sAzxWfAxONn+w7 X-Received: by 2002:a17:906:f950:: with SMTP id ld16mr5013263ejb.553.1612384426933; Wed, 03 Feb 2021 12:33:46 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1612384426; cv=none; d=google.com; s=arc-20160816; b=lIaKYwCa36fnhtmu8RzQtPC7NGwQK/omW18nLWZZA0pQPde/cIQdTpOaastYWGhs8V AevFUu7stbYgI20OJ10/k40ZpaX/OVK2Eoe22KLsjLDTX5z8dleWlLdm00LqcqvcKN1K b4uL29D/BlKjoyfvk/HRXFLcFa9c+9VlPyxIfRRXoJUEjqoklw0hsydhV+WM6JtW4422 ttbbL+9okXK4fCt4XgiQefIyVVdcSQauwARkST8VBwLr5h23LuXLG3zhkSxK2IDvt0lu yrpXalLM2fTG3mmocp/xWzII+TSLLW8uuw80U12Vudmzxw4OqEsLblCp5sdZ1tDkc2v+ G6bQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=eNYLKrHHFpo2W+MgM64k66uhU/rYD0kH/7hNDtjliGE=; b=jYYPk4oLyItETVy+Y6WCmo8ciV/gSBhXhS00WsT2xWvNBGzyYXuvEpxJuCng+TGQAa 9RGL07HxVNcApc3/tVlk80ch8lSAOzCMFdnB7Hjw0TjrN6md8cDxIwHzacVLhFTyHPvL dfEb+wlxad9fMhaRsjPRzLwvSCpbsyBZ9e3/h63Gc1Dr361gGwFhF8lkFKG2GouYman2 Vdx0+IbSHnCWuN3WCo5WjIzcQqv1CRdY2lqDJiKc/gI3vD+Xp7t1z7GUTZPYvWkbh7pU LidXYP7IIKJQFx3vxwsrqJjQ0pRV2EtWMTaWAZbAqUL0a5lBmVlG2tB9tN9r8OiFmE3t lFaw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=DIeJeHCe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bm18si1977874edb.545.2021.02.03.12.33.21; Wed, 03 Feb 2021 12:33:46 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ffwll.ch header.s=google header.b=DIeJeHCe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232051AbhBCUar (ORCPT + 99 others); Wed, 3 Feb 2021 15:30:47 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48646 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231367AbhBCUaq (ORCPT ); Wed, 3 Feb 2021 15:30:46 -0500 Received: from mail-oi1-x233.google.com (mail-oi1-x233.google.com [IPv6:2607:f8b0:4864:20::233]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A8422C0613D6 for ; Wed, 3 Feb 2021 12:30:04 -0800 (PST) Received: by mail-oi1-x233.google.com with SMTP id m7so1228994oiw.12 for ; Wed, 03 Feb 2021 12:30:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=eNYLKrHHFpo2W+MgM64k66uhU/rYD0kH/7hNDtjliGE=; b=DIeJeHCeI1SX9JRP36sM+W6rFO4OdsJpfGTkBtUBqCnklQGL6KVyuYn41Odc9/rLLe ISiwA+5txu4Ckq5KLKgiP0C/6WCuG77XxYxfBn96xe5UwbX1KuUtHivcIwEaeYETczwR NVLOPhppjYD4HewAedlleI8pu+9/HV0nd48To= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=eNYLKrHHFpo2W+MgM64k66uhU/rYD0kH/7hNDtjliGE=; b=meJ8bzbC2qfd/5omKiVwVT6EoOyq4GoI6PplVdF7+Lf76jS3dmHgkWFqTkBK3Xzesc n+KrIeeG1li53/PWWI3aSUHiSstrtVSs208P/SKN53f03PuvM3F/AaOPUsmtueWmRLZb GLJUcUtZvVN2xx4DeetAs4UflIowu0xylaL+sLeRUsaqXUChTYqWswh1rl1nCKcq2TCd 6vYdBaJ1OUfFvIIq6Q+7kV0Yt64Cpl7lH7BRwinV10asRo/vzLOd26qoK7MkTeGa9Kwh vtuk/NFTvI+n6q2E+Wgc4VoRHcrcO3mDJWE8VFrRjsS+ffgkIfKQR+25C6Sto0hnZAgn ZIUQ== X-Gm-Message-State: AOAM533pR7IZSkM/e2TGO2D+PTHrG9WgeoopY/HxG8XrSDnI+6CJKrCu 9DgkVl8P/HqRq/G9QPqDq9sHTFp58DhkUShsnFT6DQ== X-Received: by 2002:aca:4ac5:: with SMTP id x188mr2940752oia.14.1612384203966; Wed, 03 Feb 2021 12:30:03 -0800 (PST) MIME-Version: 1.0 References: <20210203003134.2422308-1-surenb@google.com> <20210203015553.GX308988@casper.infradead.org> In-Reply-To: From: Daniel Vetter Date: Wed, 3 Feb 2021 21:29:52 +0100 Message-ID: Subject: Re: [Linaro-mm-sig] [PATCH 1/2] mm: replace BUG_ON in vm_insert_page with a return of an error To: Suren Baghdasaryan Cc: Matthew Wilcox , "moderated list:DMA BUFFER SHARING FRAMEWORK" , Sandeep Patil , =?UTF-8?Q?Christian_K=C3=B6nig?= , Android Kernel Team , James Jones , Linux Kernel Mailing List , Liam Mark , Brian Starkey , Christoph Hellwig , Minchan Kim , Linux MM , John Stultz , dri-devel , Chris Goldsworthy , Hridya Valsaraju , Andrew Morton , Robin Murphy , "open list:DMA BUFFER SHARING FRAMEWORK" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 3, 2021 at 9:20 PM Suren Baghdasaryan wrote: > > On Wed, Feb 3, 2021 at 12:52 AM Daniel Vetter wrote: > > > > On Wed, Feb 3, 2021 at 2:57 AM Matthew Wilcox wrote: > > > > > > On Tue, Feb 02, 2021 at 04:31:33PM -0800, Suren Baghdasaryan wrote: > > > > Replace BUG_ON(vma->vm_flags & VM_PFNMAP) in vm_insert_page with > > > > WARN_ON_ONCE and returning an error. This is to ensure users of the > > > > vm_insert_page that set VM_PFNMAP are notified of the wrong flag usage > > > > and get an indication of an error without panicing the kernel. > > > > This will help identifying drivers that need to clear VM_PFNMAP before > > > > using dmabuf system heap which is moving to use vm_insert_page. > > > > > > NACK. > > > > > > The system may not _panic_, but it is clearly now _broken_. The device > > > doesn't work, and so the system is useless. You haven't really improved > > > anything here. Just bloated the kernel with yet another _ONCE variable > > > that in a normal system will never ever ever be triggered. > > > > Also, what the heck are you doing with your drivers? dma-buf mmap must > > call dma_buf_mmap(), even for forwarded/redirected mmaps from driver > > char nodes. If that doesn't work we have some issues with the calling > > contract for that function, not in vm_insert_page. > > The particular issue I observed (details were posted in > https://lore.kernel.org/patchwork/patch/1372409) is that DRM drivers > set VM_PFNMAP flag (via a call to drm_gem_mmap_obj) before calling > dma_buf_mmap. Some drivers clear that flag but some don't. I could not > find the answer to why VM_PFNMAP is required for dmabuf mappings and > maybe someone can explain that here? > If there is a reason to set this flag other than historical use of > carveout memory then we wanted to catch such cases and fix the drivers > that moved to using dmabuf heaps. However maybe there are other > reasons and if so I would be very grateful if someone could explain > them. That would help me to come up with a better solution. > > > Finally why exactly do we need to make this switch for system heap? > > I've recently looked at gup usage by random drivers, and found a lot > > of worrying things there. gup on dma-buf is really bad idea in > > general. > > The reason for the switch is to be able to account dmabufs allocated > using dmabuf heaps to the processes that map them. The next patch in > this series https://lore.kernel.org/patchwork/patch/1374851 > implementing the switch contains more details and there is an active > discussion there. Would you mind joining that discussion to keep it in > one place? How many semi-unrelated buffer accounting schemes does google come up with? We're at three with this one. And also we _cannot_ required that all dma-bufs are backed by struct page, so requiring struct page to make this work is a no-go. Second, we do not want to all get_user_pages and friends to work on dma-buf, it causes all kinds of pain. Yes on SoC where dma-buf are exclusively in system memory you can maybe get away with this, but dma-buf is supposed to work in more places than just Android SoCs. If you want to account dma-bufs, and gpu memory in general, I'd say the solid solution is cgroups. There's patches floating around. And given that Google Android can't even agree internally on what exactly you want I'd say we just need to cut over to that and make it happen. Cheers, Daniel -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch