Received: by 2002:a25:e7d8:0:0:0:0:0 with SMTP id e207csp197444ybh; Mon, 9 Mar 2020 19:33:47 -0700 (PDT) X-Google-Smtp-Source: ADFU+vvjLYKfZzounTWLLav52+Ye4swKtP0Ddclr++7tJU4HOF//nM/9hXvYR2ES267RCe475XWD X-Received: by 2002:a05:6830:1447:: with SMTP id w7mr1903748otp.267.1583807627851; Mon, 09 Mar 2020 19:33:47 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1583807627; cv=none; d=google.com; s=arc-20160816; b=hhJ1JDVZNga7svngVl0f9tGJ7H4iNJZ9KYhHmnFwkcBNZgl9B3FtX+h367n/IflYym b/HbpFY6uTBakwd5bxH8ZeSY/kpqt8QDCbWf0Kov6vJOi0tBXApXFWLCPj1a/JgKnUxI rjnv6H/iRRIBKWabwlGC7VrORqpUVfsYloIhJbhjat/q66bfV3lmnGF3I/pKQ7ewKzNe iUZrUISm/WaqFPZWL4h7mjoLXFalGuZNawS0ESLeKgMPLPG7GQ0rv1YgrhVcrojElCRN ynAph+sg7cfgwFqYKxczoD4azyShMkRvD/is/vkZbeE6wt1hfUopZuhcKYHnUVODRfQ/ Ctmw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=3y84yqZRugayEnPuj4WmyCRbsqiJlfr9qmjJyOyF0wE=; b=oZI/EVkJSsEq99d0ashlmWybmWG8F/gtW0AsQiKVq1Ybh/6pXdNXqbtB9DlfJL1ynx RdFmN8hSA+MBv77QsxgzqRPEtt9NRM3CpUD3G9vQ3uB1DPu0p0krPYI7xRGZapy+r7ej dGf3snl6Esx5uy1V45rJTo7mStlKSGyCd6Edy+RjZE4TOuWS3XtomMrKTuxH71jFJFbH R+Q6tc1KpslHZ9VwBlYuu7jOQ1AkfIsQO96ttG4DmH9X/7m87AUj4v7eUa1u1CYA1PwK 2FYeCwB9+n6pvuyTJ3AUFPPtIJhVScoScINbQJ6sHL4J45NyK5GeewQJ+FMeWKc6UOn2 PYWA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w29si2083585oth.24.2020.03.09.19.33.36; Mon, 09 Mar 2020 19:33:47 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726703AbgCJCcD (ORCPT + 99 others); Mon, 9 Mar 2020 22:32:03 -0400 Received: from foss.arm.com ([217.140.110.172]:59254 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726464AbgCJCcD (ORCPT ); Mon, 9 Mar 2020 22:32:03 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id C538730E; Mon, 9 Mar 2020 19:32:02 -0700 (PDT) Received: from [10.163.1.203] (unknown [10.163.1.203]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id E46843F67D; Mon, 9 Mar 2020 19:31:53 -0700 (PDT) Subject: Re: [PATCH V15] mm/debug: Add tests validating architecture page table helpers To: Christophe Leroy , Qian Cai Cc: linux-mm@kvack.org, Andrew Morton , Mike Rapoport , Vineet Gupta , Catalin Marinas , Will Deacon , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , Heiko Carstens , Vasily Gorbik , Christian Borntraeger , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , "Kirill A . Shutemov" , Paul Walmsley , Palmer Dabbelt , linux-snps-arc@lists.infradead.org, linux-arm-kernel@lists.infradead.org, linuxppc-dev@lists.ozlabs.org, linux-s390@vger.kernel.org, linux-riscv@lists.infradead.org, x86@kernel.org, linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org References: <61250cdc-f80b-2e50-5168-2ec67ec6f1e6@arm.com> From: Anshuman Khandual Message-ID: <2d950d8c-4b23-741e-591f-e22e857c0755@arm.com> Date: Tue, 10 Mar 2020 08:01:51 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 03/07/2020 12:35 PM, Christophe Leroy wrote: > > > Le 07/03/2020 à 01:56, Anshuman Khandual a écrit : >> >> >> On 03/07/2020 06:04 AM, Qian Cai wrote: >>> >>> >>>> On Mar 6, 2020, at 7:03 PM, Anshuman Khandual wrote: >>>> >>>> Hmm, set_pte_at() function is not preferred here for these tests. The idea >>>> is to avoid or atleast minimize TLB/cache flushes triggered from these sort >>>> of 'static' tests. set_pte_at() is platform provided and could/might trigger >>>> these flushes or some other platform specific synchronization stuff. Just >>> >>> Why is that important for this debugging option? >> >> Primarily reason is to avoid TLB/cache flush instructions on the system >> during these tests that only involve transforming different page table >> level entries through helpers. Unless really necessary, why should it >> emit any TLB/cache flush instructions ? > > What's the problem with thoses flushes ? > >> >>> >>>> wondering is there specific reason with respect to the soft lock up problem >>>> making it necessary to use set_pte_at() rather than a simple WRITE_ONCE() ? >>> >>> Looks at the s390 version of set_pte_at(), it has this comment, >>> vmaddr); >>> >>> /* >>>   * Certain architectures need to do special things when PTEs >>>   * within a page table are directly modified.  Thus, the following >>>   * hook is made available. >>>   */ >>> >>> I can only guess that powerpc  could be the same here. >> >> This comment is present in multiple platforms while defining set_pte_at(). >> Is not 'barrier()' here alone good enough ? Else what exactly set_pte_at() >> does as compared to WRITE_ONCE() that avoids the soft lock up, just trying >> to understand. >> > > > Argh ! I didn't realise that you were writing directly into the page tables. When it works, that's only by chance I guess. > > To properly set the page table entries, set_pte_at() has to be used: > - On powerpc 8xx, with 16k pages, the page table entry must be copied four times. set_pte_at() does it, WRITE_ONCE() doesn't. > - On powerpc book3s/32 (hash MMU), the flag _PAGE_HASHPTE must be preserved among writes. set_pte_at() preserves it, WRITE_ONCE() doesn't. > > set_pte_at() also does a few other mandatory things, like calling pte_mkpte() > > So, the WRITE_ONCE() must definitely become a set_pte_at() Sure, will do. These are part of the clear tests that populates a given entry with a non zero value before clearing and testing it with pxx_none(). In that context, WRITE_ONCE() seemed sufficient. But pte_clear() might be closely tied with proper page table entry update and hence a preceding set_pte_at() will be better. There are still more WRITE_ONCE() for other page table levels during these clear tests. set_pmd_at() and set_pud_at() are defined on platforms that support (and enable) THP and PUD based THP respectively. Hence they could not be used for clear tests as remaining helpers pmd_clear(), pud_clear(), p4d_clear() and pgd_clear() still need to be validated with or without THP support and enablement. We should just leave all other WRITE_ONCE() instances unchanged. Please correct me if I am missing something here. > > Christophe >