Received: by 10.223.176.5 with SMTP id f5csp4137455wra; Tue, 30 Jan 2018 02:43:01 -0800 (PST) X-Google-Smtp-Source: AH8x226ggvlNgpfg0yE2HMAKb73heZYgQo6DbQRzHFB6lVAT9RI4LE1hASKiUYANBu73CHugU1DS X-Received: by 2002:a17:902:8f95:: with SMTP id z21-v6mr25889525plo.104.1517308981153; Tue, 30 Jan 2018 02:43:01 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517308981; cv=none; d=google.com; s=arc-20160816; b=v9no99ytsWyBkV1j6ZWw4l1vcnjyZ6j1V/bGXfPd0Eenzveayf/65QdaprQIacKcXU iJ4p9C3b0UWNEryjZ+nGcJ3t4hpztdTu5nP6Mao7mElLfWt52aJ38dz42XCaOoZ739ql sUBYPTwp2Z1NWpZH64FMs0b5K4b0ryB0FL8hSpmGQ5sSd8r7SaTPFcFFcXrH7tZxkajN 3I14S66BkS3Ro3QjcH97TYdrW4pZWUkVMl43a0MvbbJmDIT//QY4eUEMTTVllulD92Ie rmPMhP2ox5g1/eq9h7zgd9gvH77/CCY/OnjGrN8s5f62aM80G9b/RqyPbl33HgQH8kXv Owxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:mail-followup-to:message-id:subject:cc:to:from:date :dkim-signature:arc-authentication-results; bh=4upkCzYEtCIC2jUwB8bbKTu/QJ2vs4QNj8TUIr0USQU=; b=yp3T1N0w8wIM4WyBWh4YU25HlrxdmnomRGny5oxEc0eOAsrcv84/niMumLcOBzFyMj U3uHU+m9gH6C5BCwormcggNu5rDwrLjLnLxDeYWED3f2OXYooANcE9uJX1i1OSFkmS0b 6xNWxvnfB9DMrl5mA3tHBDIA8mMZcjnpybr+teqJQfZ5r7N4P2YJczIUCqdRv4gu6OUg +mhYpkDnpx1l+uMx2YprJqRxPypfkRSSFo5ERX7Adlqac0Kccn8mT+EwO+2kk/uSFtiH nooWB/aS9xxqQBsodjHjrE0MYjzYhFmz3YG8mgCZ54SrvrWejf5fcjFAu/d+rAqCluJZ b+sA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@ffwll.ch header.s=google header.b=WItOj07p; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f5si9054711pgr.235.2018.01.30.02.42.46; Tue, 30 Jan 2018 02:43:01 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@ffwll.ch header.s=google header.b=WItOj07p; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751574AbeA3KmW (ORCPT + 99 others); Tue, 30 Jan 2018 05:42:22 -0500 Received: from mail-wm0-f51.google.com ([74.125.82.51]:35564 "EHLO mail-wm0-f51.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751366AbeA3KmU (ORCPT ); Tue, 30 Jan 2018 05:42:20 -0500 Received: by mail-wm0-f51.google.com with SMTP id r78so155163wme.0 for ; Tue, 30 Jan 2018 02:42:20 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ffwll.ch; s=google; h=sender:date:from:to:cc:subject:message-id:mail-followup-to :references:mime-version:content-disposition :content-transfer-encoding:in-reply-to:user-agent; bh=4upkCzYEtCIC2jUwB8bbKTu/QJ2vs4QNj8TUIr0USQU=; b=WItOj07pKl9NxUZEQ+4uUXz90gVEIwTi/9OBNZUyyTFSqjobu9imS6apCnnjdYaATY +IMktExN/YHjCdGU9CanoVFbc1P5FmEWzRcDUv0SbjQ3nkI0TPlqQXWTrrS+y4rALneA gbddXJwxs2eUlohH26LmYJ2cIWYA5wUf+gheE= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:date:from:to:cc:subject:message-id :mail-followup-to:references:mime-version:content-disposition :content-transfer-encoding:in-reply-to:user-agent; bh=4upkCzYEtCIC2jUwB8bbKTu/QJ2vs4QNj8TUIr0USQU=; b=A/cSnKeiKRtE590Yw97eQIgy5caSZpb4CDw0M2ZqVcRY5LvxvIGkxfwQlTSBkh9mTp BhLuC5tDfgdFcuwYBa8Jv+6EHHxzWcq8Y79/YNiMxTHFF13JCSdn7slm7ctq7IqVFxsW B4VyKgzRNAOYcbvGF4LL0OUyYMi0A2hkcTWcm4tjSk1R5JoxL3XEl89IKasGM55bPkym CIaA0Stffd8af9TX3qtmoaExnrxSG3AK57iIcp/Ef92BZaN2Do2nlY4X9kZ39v/ylgEm RUOtaIq9fxGGj6sQjF/4RmIAN036pzO6g7YXBEHhAnCb/szOzJxUh/mT27XqkE4H9847 WizQ== X-Gm-Message-State: AKwxytfP0RzVG2QRlkSDQBMvrdyY4W/r6vcuz7SuzWmA8Lis92+eckL1 OKfZXisw2WgWnD7x5jsFHij+6Q== X-Received: by 10.80.212.154 with SMTP id s26mr49415785edi.268.1517308939290; Tue, 30 Jan 2018 02:42:19 -0800 (PST) Received: from phenom.ffwll.local ([2a02:168:56e6:0:e4bc:76a0:8042:669e]) by smtp.gmail.com with ESMTPSA id u17sm8139366edm.6.2018.01.30.02.42.18 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Tue, 30 Jan 2018 02:42:18 -0800 (PST) Date: Tue, 30 Jan 2018 11:42:16 +0100 From: Daniel Vetter To: Michel =?iso-8859-1?Q?D=E4nzer?= Cc: christian.koenig@amd.com, Michal Hocko , dri-devel@lists.freedesktop.org, Roman Gushchin , linux-kernel@vger.kernel.org, linux-mm@kvack.org, amd-gfx@lists.freedesktop.org Subject: Re: [RFC] Per file OOM badness Message-ID: <20180130104216.GR25930@phenom.ffwll.local> Mail-Followup-To: Michel =?iso-8859-1?Q?D=E4nzer?= , christian.koenig@amd.com, Michal Hocko , dri-devel@lists.freedesktop.org, Roman Gushchin , linux-kernel@vger.kernel.org, linux-mm@kvack.org, amd-gfx@lists.freedesktop.org References: <20180123153631.GR1526@dhcp22.suse.cz> <20180124092847.GI1526@dhcp22.suse.cz> <583f328e-ff46-c6a4-8548-064259995766@daenzer.net> <20180124110141.GA28465@dhcp22.suse.cz> <36b49523-792d-45f9-8617-32b6d9d77418@daenzer.net> <20180124115059.GC28465@dhcp22.suse.cz> <381a868c-78fd-d0d1-029e-a2cf4ab06d37@gmail.com> <20180130093145.GE25930@phenom.ffwll.local> <3db43c1a-59b8-af86-2b87-c783c629f512@daenzer.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <3db43c1a-59b8-af86-2b87-c783c629f512@daenzer.net> X-Operating-System: Linux phenom 4.14.0-1-amd64 User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jan 30, 2018 at 10:43:10AM +0100, Michel D?nzer wrote: > On 2018-01-30 10:31 AM, Daniel Vetter wrote: > > On Wed, Jan 24, 2018 at 01:11:09PM +0100, Christian K?nig wrote: > >> Am 24.01.2018 um 12:50 schrieb Michal Hocko: > >>> On Wed 24-01-18 12:23:10, Michel D?nzer wrote: > >>>> On 2018-01-24 12:01 PM, Michal Hocko wrote: > >>>>> On Wed 24-01-18 11:27:15, Michel D?nzer wrote: > >>> [...] > >>>>>> 2. If the OOM killer kills a process which is sharing BOs with another > >>>>>> process, this should result in the other process dropping its references > >>>>>> to the BOs as well, at which point the memory is released. > >>>>> OK. How exactly are those BOs mapped to the userspace? > >>>> I'm not sure what you're asking. Userspace mostly uses a GEM handle to > >>>> refer to a BO. There can also be userspace CPU mappings of the BO's > >>>> memory, but userspace doesn't need CPU mappings for all BOs and only > >>>> creates them as needed. > >>> OK, I guess you have to bear with me some more. This whole stack is a > >>> complete uknonwn. I am mostly after finding a boundary where you can > >>> charge the allocated memory to the process so that the oom killer can > >>> consider it. Is there anything like that? Except for the proposed file > >>> handle hack? > >> > >> Not that I knew of. > >> > >> As I said before we need some kind of callback that a process now starts to > >> use a file descriptor, but without anything from that file descriptor mapped > >> into the address space. > > > > For more context: With DRI3 and wayland the compositor opens the DRM fd > > and then passes it to the client, which then starts allocating stuff. That > > makes book-keeping rather annoying. > > Actually, what you're describing is only true for the buffers shared by > an X server with an X11 compositor. For the actual applications, the > buffers are created on the client side and then shared with the X server > / Wayland compositor. > > Anyway, it doesn't really matter. In all cases, the buffers are actually > used by all parties that are sharing them, so charging the memory to all > of them is perfectly appropriate. > > > > I guess a good first order approximation would be if we simply charge any > > newly allocated buffers to the process that created them, but that means > > hanging onto lots of mm_struct pointers since we want to make sure we then > > release those pages to the right mm again (since the process that drops > > the last ref might be a totally different one, depending upon how the > > buffers or DRM fd have been shared). > > > > Would it be ok to hang onto potentially arbitrary mmget references > > essentially forever? If that's ok I think we can do your process based > > account (minus a few minor inaccuracies for shared stuff perhaps, but no > > one cares about that). > > Honestly, I think you and Christian are overthinking this. Let's try > charging the memory to every process which shares a buffer, and go from > there. I'm not concerned about wrongly accounting shared buffers (they don't matter), but imbalanced accounting. I.e. allocate a buffer in the client, share it, but then the compositor drops the last reference. If we store the mm_struct pointer in drm_gem_object, we don't need any callback from the vfs when fds are shared or anything like that. We can simply account any newly allocated buffers to the current->mm, and then store that later for dropping the account for when the gem obj is released. This would entirely ignore any complications with shared buffers, which I think we can do because even when we pass the DRM fd to a different process, the actual buffer allocations are not passed around like that for private buffers. And private buffers are the only ones that really matter. -Daniel -- Daniel Vetter Software Engineer, Intel Corporation http://blog.ffwll.ch