Received: by 2002:a25:b794:0:0:0:0:0 with SMTP id n20csp5605225ybh; Wed, 7 Aug 2019 08:34:28 -0700 (PDT) X-Google-Smtp-Source: APXvYqwNgNL8yCBR+84h2S1zh9VNCCVryAlTvam6osT43ueXo2A7eIDdbf1yQ3Wb+HwkF+es8rvo X-Received: by 2002:a62:79c2:: with SMTP id u185mr10036171pfc.237.1565192068607; Wed, 07 Aug 2019 08:34:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565192068; cv=none; d=google.com; s=arc-20160816; b=fVMk7M7CzXZr6g4hmzwJ/wMf7P8RQmis/ZE+11qJ4AfS72lC4raiRSZMoJt9f0AcJ6 Kr6whsJE7t9I5/zw+gT20RYZ6mqOATETLtxUmZGV0rQPmjwXkmlfH/QPhCG9SGMhMBrO qVCx+/ETzutVImyKJTn4VuRhPjM0/QgFb4WhuCR1wlAk9kEKIybH5iTRdcGMOnxRv2zC q9/bbWFFdAvP3Doe0PNXcGc0I/1SYotz5ARNDJBrF9bSeRJ3Gm/jnrdENyllSim/9djB +U1F3Ggy86JPczD1/J0RZwFml69dyMuGA18ROGcNrCLfRFLu/q//WHLq4g3u4mvjEkPO y4Wg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=zxf6XN/biAMtxrk5NqSSYToLgatnIlo8JOgL2S7Co70=; b=MeP1te711wz8bE3e/Wg3KgJ8t6i+m+kbMEOF7aFU7WyIkprDujA+aLSPafEdnElGyt +VED1kNsB9ZCPNvDQrV088GVyzpF3K12Qobz7XspC1nWimrFxMXtAJixNV0Kqzb8BhRq AoUI9aGACO7xgjipX7cV4ATRu/OtPCM/+P0sDL8OwHsgEFYka6btuUVaagEk8ipDyw31 Ex4tnuYMxrn3ecjNAPi9J8WGZHHqJnF+Q592s/KkzFzJamEUS3ZmybePTkw2yk8ekMMj l6hgv4WWDKBJf97KDXu/36qXgzpXHYbJjgwbtL4Svp+F+wY6+dFsBVnwciotJqvrX5QO QPqA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 73si49015544pld.221.2019.08.07.08.34.12; Wed, 07 Aug 2019 08:34:28 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388603AbfHGPcz (ORCPT + 99 others); Wed, 7 Aug 2019 11:32:55 -0400 Received: from foss.arm.com ([217.140.110.172]:50338 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2387543AbfHGPcz (ORCPT ); Wed, 7 Aug 2019 11:32:55 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id B7905344; Wed, 7 Aug 2019 08:32:54 -0700 (PDT) Received: from [10.1.196.133] (e112269-lin.cambridge.arm.com [10.1.196.133]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 2C9A53F706; Wed, 7 Aug 2019 08:32:53 -0700 (PDT) Subject: Re: drm pull for v5.3-rc1 To: Matthew Wilcox Cc: Christoph Hellwig , Linus Torvalds , =?UTF-8?Q?Thomas_Hellstr=c3=b6m_=28VMware=29?= , Dave Airlie , Thomas Hellstrom , Daniel Vetter , LKML , dri-devel , Jerome Glisse , Jason Gunthorpe , Andrew Morton , Linux-MM References: <48890b55-afc5-ced8-5913-5a755ce6c1ab@shipmail.org> <20190806073831.GA26668@infradead.org> <20190806190937.GD30179@bombadil.infradead.org> <20190807064000.GC6002@infradead.org> <20190807141517.GA5482@bombadil.infradead.org> <62cbe523-e8a4-cdfd-90c2-80260cefa5de@arm.com> <20190807145601.GB5482@bombadil.infradead.org> From: Steven Price Message-ID: <4b9ea419-571b-93ab-ee52-811e52c0ae91@arm.com> Date: Wed, 7 Aug 2019 16:32:51 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20190807145601.GB5482@bombadil.infradead.org> Content-Type: text/plain; charset=utf-8 Content-Language: en-GB Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 07/08/2019 15:56, Matthew Wilcox wrote: > On Wed, Aug 07, 2019 at 03:30:38PM +0100, Steven Price wrote: >> On 07/08/2019 15:15, Matthew Wilcox wrote: >>> On Tue, Aug 06, 2019 at 11:40:00PM -0700, Christoph Hellwig wrote: >>>> On Tue, Aug 06, 2019 at 12:09:38PM -0700, Matthew Wilcox wrote: >>>>> Has anyone looked at turning the interface inside-out? ie something like: >>>>> >>>>> struct mm_walk_state state = { .mm = mm, .start = start, .end = end, }; >>>>> >>>>> for_each_page_range(&state, page) { >>>>> ... do something with page ... >>>>> } >>>>> >>>>> with appropriate macrology along the lines of: >>>>> >>>>> #define for_each_page_range(state, page) \ >>>>> while ((page = page_range_walk_next(state))) >>>>> >>>>> Then you don't need to package anything up into structs that are shared >>>>> between the caller and the iterated function. >>>> >>>> I'm not an all that huge fan of super magic macro loops. But in this >>>> case I don't see how it could even work, as we get special callbacks >>>> for huge pages and holes, and people are trying to add a few more ops >>>> as well. >>> >>> We could have bits in the mm_walk_state which indicate what things to return >>> and what things to skip. We could (and probably should) also use different >>> iterator names if people actually want to iterate different things. eg >>> for_each_pte_range(&state, pte) as well as for_each_page_range(). >>> >> >> The iterator approach could be awkward for the likes of my generic >> ptdump implementation[1]. It would require an iterator which returns all >> levels and allows skipping levels when required (to prevent KASAN >> slowing things down too much). So something like: >> >> start_walk_range(&state); >> for_each_page_range(&state, page) { >> switch(page->level) { >> case PTE: >> ... >> case PMD: >> if (...) >> skip_pmd(&state); >> ... >> case HOLE: >> .... >> ... >> } >> } >> end_walk_range(&state); >> >> It seems a little fragile - e.g. we wouldn't (easily) get type checking >> that you are actually treating a PTE as a pte_t. The state mutators like >> skip_pmd() also seem a bit clumsy. > > Once you're on-board with using a state structure, you can use it in all > kinds of fun ways. For example: > > struct mm_walk_state { > struct mm_struct *mm; > unsigned long start; > unsigned long end; > unsigned long curr; > p4d_t p4d; > pud_t pud; > pmd_t pmd; > pte_t pte; > enum page_entry_size size; > int flags; > }; > > For this user, I'd expect something like ... > > DECLARE_MM_WALK_FLAGS(state, mm, start, end, > MM_WALK_HOLES | MM_WALK_ALL_SIZES); > > walk_each_pte(state) { > switch (state->size) { > case PE_SIZE_PTE: > ... > case PE_SIZE_PMD: > if (...(state->pmd)) > continue; You need to be able to signal whether you want to descend into the PMD or skip the entire part of the tree. This was my skip_pmd() function above. > ... > } > } > > There's no need to have start / end walk function calls. > You've got a start walk function (it's your DECLARE_MM_WALK_FLAGS above). The end walk I agree I think you don't actually need it since struct mm_walk_state contains all the state. Steve