Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp1488294pxb; Fri, 27 Aug 2021 09:56:34 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxwkRZqAazCMkpP3NqErhk6wpi94d4ZCMhI/YZPu1A13OBjIeM/XBCTb0/YrP/s3DaQu+wx X-Received: by 2002:a17:906:c346:: with SMTP id ci6mr10734141ejb.479.1630083394351; Fri, 27 Aug 2021 09:56:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1630083394; cv=none; d=google.com; s=arc-20160816; b=T6lxMAKs1uSm8UnsfKJ+cSXboJNeJF4vbOR9kltfe5Oxh6PAWjKOdeM2IzTuOPCt/O IXbeuKWdC1AHOxzUFfMmfWpNtonw7D8QvCGCQwhNwioluhZcKP5YcL7EEyBfyzT6+arH ZD//6R7c6x8jsITnaUe/cILeLLZD9/nG/RDxWaA9eRhttWDpa/Te38vM9bHDE5IZtg15 8pX/MfFlZHWc6rVR0GpgoxNWFOuC028fvQZj6TTeVEEPqr2aKZCScmace15s/iEMsBz4 0Meb4uk5mxQPiG+51bnx0PBRFlRtTqjeXm89PuTp6y10O2AON5OAl+tX03Lc0s7u6pm6 0Fkg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=nm5f4TdykV0Ij8N80OXzW6n74qiksoV2Mz6T3bNcU4c=; b=wW4gfeQbgHPIrQ4SXEcLFVtTP90v97eSHS6382nwpbAJb53BbAyuz2ry5TLvSEoJY7 kL1nZbJqsiIsEEdyUIhtW821D1WGRSVn7rZ5JA11yVu/3OQQzcrJzGc9BOn+59zHtYEB HRSHPaAokVD8BEsqXWcpWJcU+OrRTgOYT0kfB/utUKOPdQQA9NWZlHFeIklbbvN6zONe aoP3uGZLrDxhxNVbK0cQUn9EdjU3B99F3ZoHkm9akIX4rPBkY/XoCdywBzrR666VvklR QboI4yR7H6pUYkrdjwC1PEDGO1P6ou4NogYkdSHdhn4d7RS0jC0A89hiv6XEOnbHkC1u jOdw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=WpMImxTP; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id kl9si6499932ejc.603.2021.08.27.09.56.11; Fri, 27 Aug 2021 09:56:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=WpMImxTP; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239225AbhH0Qwy (ORCPT + 99 others); Fri, 27 Aug 2021 12:52:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53582 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239681AbhH0Qwv (ORCPT ); Fri, 27 Aug 2021 12:52:51 -0400 Received: from mail-qt1-x82d.google.com (mail-qt1-x82d.google.com [IPv6:2607:f8b0:4864:20::82d]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 54DBAC0617AE for ; Fri, 27 Aug 2021 09:52:02 -0700 (PDT) Received: by mail-qt1-x82d.google.com with SMTP id s15so2452451qta.10 for ; Fri, 27 Aug 2021 09:52:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ziepe.ca; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=nm5f4TdykV0Ij8N80OXzW6n74qiksoV2Mz6T3bNcU4c=; b=WpMImxTPH0oqNigcdNaPe3TalPVZxGNIypXpTMh6+4KoJHhi7xXR01+ileZaSM8UFm nmXdWe5LE6PVPClzBloI1zRgpV4Hf5SkcnnUgCWsOKNpVVNkksUvD2CJCDBOqD83sP+l IdlmSCqaJKHANuvGz6mBMIO0QO2jKhAlfZKNnxwmO8AwFT/FFU77A/jOeD7UAFa0V/bw KS6DF4fU86u9GkYKn+t6MFbRPVpjSv9+HPL7u29z9jbikqjShfG42wBcgnDvW/CUNkq7 rA3ngjetilfeeJK5lyk6Gb/wjfQjeOiLIcOw2FJ4xHexazEAp7Bo1SMmrzHwpMSzHNLW u1zw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=nm5f4TdykV0Ij8N80OXzW6n74qiksoV2Mz6T3bNcU4c=; b=hXGlQowtE9WOOlSODiaoHCK33rfLfnHd19DE48j/1yZDNTUkYcG0gDmWoCtWpocMUU HUhNxUq4v/Lz87DWQmOk6aR83vrGTPfjhrMkClE8rypS0wcUmAEKqXufrcaZr35DHhnX 6lXFyjkH5AmvWzmmhGFGYbz7O73y4J+3OLzEzdRDShzJIzayuY9u7XGyRhwVFpMJf5qa G7dOZhdviEqsvuYEK0Gi/ij1hLMMfnhg1iAc+jIIyuGtQFdg7h8o2fwcn7tdoLm8vN6f 1qsT76o4jajX5WbLQ6ghBuOUaAAB4tkpwNobFM3NHa/vEjHyNldsB2qtLu5DQnJJnAj6 eR6Q== X-Gm-Message-State: AOAM532UYO3jYEx0ypZ8LHkI14n5CvgLhxK+WURy8ie/bAD7gOUicpbJ Tkg6vYi7Fv3yASGBLwMC0GOD2w== X-Received: by 2002:ac8:7c44:: with SMTP id o4mr9135892qtv.82.1630083121426; Fri, 27 Aug 2021 09:52:01 -0700 (PDT) Received: from ziepe.ca (hlfxns017vw-142-162-113-129.dhcp-dynamic.fibreop.ns.bellaliant.net. [142.162.113.129]) by smtp.gmail.com with ESMTPSA id x3sm4855078qkx.62.2021.08.27.09.52.00 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 27 Aug 2021 09:52:00 -0700 (PDT) Received: from jgg by mlx with local (Exim 4.94) (envelope-from ) id 1mJf5E-005kOn-4S; Fri, 27 Aug 2021 13:52:00 -0300 Date: Fri, 27 Aug 2021 13:52:00 -0300 From: Jason Gunthorpe To: Dan Williams Cc: "Li, Zhijian" , "lizhijian@fujitsu.com" , "nvdimm@lists.linux.dev" , Yishai Hadas , "linux-rdma@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "yangx.jy@fujitsu.com" Subject: Re: RDMA/rpma + fsdax(ext4) was broken since 36f30e486d Message-ID: <20210827165200.GM1200268@ziepe.ca> References: <8b2514bb-1d4b-48bb-a666-85e6804fbac0@cn.fujitsu.com> <68169bc5-075f-8260-eedc-80fdf4b0accd@cn.fujitsu.com> <20210806014559.GM543798@ziepe.ca> <10c4bead-c778-8794-f916-80bf7ba3a56b@fujitsu.com> <20210827121034.GG1200268@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Aug 27, 2021 at 09:42:21AM -0700, Dan Williams wrote: > On Fri, Aug 27, 2021 at 6:05 AM Li, Zhijian wrote: > > > > > > on 2021/8/27 20:10, Jason Gunthorpe wrote: > > > On Fri, Aug 27, 2021 at 08:15:40AM +0000, lizhijian@fujitsu.com wrote: > > >> i looked over the change-log of hmm_vma_handle_pte(), and found that before > > >> 4055062 ("mm/hmm: add missing call to hmm_pte_need_fault in HMM_PFN_SPECIAL handling") > > >> > > >> hmm_vma_handle_pte() will not check pte_special(pte) if pte_devmap(pte) is true. > > >> > > >> when we reached > > >> "if (pte_special(pte) && !is_zero_pfn(pte_pfn(pte))) {" > > >> the pte have already presented and its pte's flag already fulfilled the request flags. > > >> > > >> > > >> My question is that > > >> Per https://01.org/blogs/dave/2020/linux-consumption-x86-page-table-bits, > > >> pte_devmap(pte) and pte_special(pte) could be both true in fsdax user case, right ? > > > How? what code creates that? > > > > > > I see: > > > > > > insert_pfn(): > > > /* Ok, finally just insert the thing.. */ > > > if (pfn_t_devmap(pfn)) > > > entry = pte_mkdevmap(pfn_t_pte(pfn, prot)); > > > else > > > entry = pte_mkspecial(pfn_t_pte(pfn, prot)); > > > > > > So what code path ends up setting both bits? > > > > pte_mkdevmap() will set both _PAGE_SPECIAL | PAGE_DEVMAP > > > > 395 static inline pte_t pte_mkdevmap(pte_t pte) > > 396 { > > 397 return pte_set_flags(pte, _PAGE_SPECIAL|_PAGE_DEVMAP); > > 398 } > > I can't recall why _PAGE_SPECIAL is there. I'll take a look, but I > think setting _PAGE_SPECIAL in pte_mkdevmap() is overkill. This is my feeling too, but every arch does it, so hmm should check it, at least for now as a stable fix devmap has a struct page so it should be refcounted inside the VMA and that is the main thing that PAGE_SPECIAL disabled, AFAICR.. The only places where pte_special are used that I wonder if are OK for devmap have to do with CPU cache maintenance vm_normal_page(), hmm_vma_handle_pte(), gup_pte_range() all look OK to drop the special bit Jason