Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp539602iob; Wed, 4 May 2022 02:33:21 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyDe9opBLnaLsFSxhLBgfz0CQndi7xhSdBDq0rQv+BQQTdeJqWrSA9YiPJaTrWMZv0b3DAL X-Received: by 2002:a17:906:c113:b0:6d7:7b53:9cb with SMTP id do19-20020a170906c11300b006d77b5309cbmr20083769ejc.197.1651656800967; Wed, 04 May 2022 02:33:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651656800; cv=none; d=google.com; s=arc-20160816; b=VeQXeSPQmm7r3+pTQYrk/hMfUQYwnUPQhHzZY1R+jsgRAm6riUr+/yDpE0Ap/vpqjF ur0A0uGaGzm169TNVOQw4KFnlMSTOvF0G6ANlnpoxiwpr6ixMficjVAzgRwdiKWV/r5b hBhLFgyv0ZYs1Lgs4Svr2iXwEjmS0peSHO+uWE+4uSMm9+AwCnrkG4JB4PEZSAs7LW2v YV5JTZ7aPq16SqS0gj1Ju3pr+xhggzSCScBfYqwdwUHWo+LhMWbrjWEybBK451gRQyji ztlI0uyjPxveSjeAiteytjIl5/fVxgFKHq+1GhVnQSbQqqytQGFmBtRCWWQWAR9KRWun HUyw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=TZeM2sK6VsFpvnvIUwVCoNa4r1ezlmI7cS8ZgepmC84=; b=DBm/HzE0f/FAXBCm6cEFue8q0LC5RGbcMIKf4hXX3awAeqINTxuTN7h/LZ139Pp4h0 07vq0KZZnZ/WoGjLyoN9rSPHo31vUEE1PKoXaJ2yredn82YJIUD07A7X/6qAurmg65XC HXZs6/zJgq7hmfxBldD8nzVmgzvSkz2UT4VhGuYYLiEVEBM7Iit51PFqfLF49WZ5G7gc YjgWI/BKLa0SaZ77YVLhbx0Jvg10bV8wT31jY2O+E6DqUgrQ/LG6RpcXzelR2SwUjae7 Qv2ShTeYTBXB2VkegQOrCj7kG7uFvxCJjEDWEdEyJ8bgLQANX2zo6kvHMjm8Brx1Sfkr THrQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=hSNZbMrT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id cf14-20020a0564020b8e00b0042374f2973bsi14002760edb.193.2022.05.04.02.32.57; Wed, 04 May 2022 02:33:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=hSNZbMrT; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1346256AbiEDIYb (ORCPT + 99 others); Wed, 4 May 2022 04:24:31 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53524 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240351AbiEDIY1 (ORCPT ); Wed, 4 May 2022 04:24:27 -0400 Received: from mail-vk1-xa31.google.com (mail-vk1-xa31.google.com [IPv6:2607:f8b0:4864:20::a31]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 15EF022B1C for ; Wed, 4 May 2022 01:20:52 -0700 (PDT) Received: by mail-vk1-xa31.google.com with SMTP id b81so273442vkf.1 for ; Wed, 04 May 2022 01:20:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=TZeM2sK6VsFpvnvIUwVCoNa4r1ezlmI7cS8ZgepmC84=; b=hSNZbMrTMFievu5D9MtMUJMwIDI8+/jQVZxuuo+3f9e8nR41TGIc9FzDpRb0hjzBGP j/wAcoRO8SZrWabKfZaBkxYlA+SBRBfWlcNA/+0IS22029aOTvGHT2fbzcWKT6JmCxLp bshEvOmNiHDkAvBrvx+BeBhcD5dIg0n4uHm1gkBk46Ecb28i63c/bTiohrKuudKH69n2 OlV8xGhL61ydC5BDlhXlY4a14uQQsX0E0LQk3kLnRPEifpJ/8e0AOmXR/8bfTOS58bJx 9feiiopTnMypFzKGpnqI2UQ7dy/RjbRnMqqhrD3FPhV5xWa0NKIOfymwWeYYDhGQrAFY QMBw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=TZeM2sK6VsFpvnvIUwVCoNa4r1ezlmI7cS8ZgepmC84=; b=36RH/RSSfAw4bQwCI+V8mhd9Pae+/F0vy4E/pI1w3UgoZ5AJ4aFFDIdpNjoxlQe1MB JEUA76Aj4iGk5QK3WLV3AhSjJaoSSM/ig5K/uOz9yt700LIUMOfVKIpHgsudr6bHi2r/ l4NWNA3k5HfT47ofvOOqBXix0K1ST8kteb9fZ6I2IsjNUcaOnkJ1OiBpwxbb+vm7lw5d YHggfx5gbC9Q+6hau786lUOpgFxcYuj5h7leDkjP7eOPd2pHlYmlvcpxqRV+LqQpRVYG XOmbcmvIkoOxd13v2gjrNk90Ou7DVHWQi0ZjzxCpHkmTvedpxT83BlzAWP4aeA9gFgOb b7Pg== X-Gm-Message-State: AOAM532Fio5MzWLCpKSDQ19JbQnbroDIEHVzoiSWpTYAkXI4lzV1EY4R cHzgFIKVwZ4Y1tpEl/NLxn+0PHR8b9xzMQ9aPNM= X-Received: by 2002:a1f:a146:0:b0:34e:1514:c124 with SMTP id k67-20020a1fa146000000b0034e1514c124mr5732956vke.33.1651652450400; Wed, 04 May 2022 01:20:50 -0700 (PDT) MIME-Version: 1.0 References: <20220503155913.GA1187610@paulmck-ThinkPad-P17-Gen-1> <20220503163905.GM1790663@paulmck-ThinkPad-P17-Gen-1> In-Reply-To: From: Michel Lespinasse Date: Wed, 4 May 2022 01:20:39 -0700 Message-ID: Subject: Re: Memory allocation on speculative fastpaths To: Matthew Wilcox Cc: "Paul E. McKenney" , Michal Hocko , Liam Howlett , hannes@cmpxchg.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Davidlohr Bueso , David Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org (for context, this came up during a discussion of speculative page faults implementation details) On Tue, May 3, 2022 at 11:28 AM Matthew Wilcox wrote: > Johannes (I think it was?) made the point to me that if we have another > task very slowly freeing memory, a task in this path can take advantage > of that other task's hard work and never go into reclaim. So the > approach we should take is: > > p4d_alloc(GFP_NOWAIT | __GFP_NOMEMALLOC | __GFP_NOWARN); > pud_alloc(GFP_NOWAIT | __GFP_NOMEMALLOC | __GFP_NOWARN); > pmd_alloc(GFP_NOWAIT | __GFP_NOMEMALLOC | __GFP_NOWARN); > > if (failure) { > rcu_read_unlock(); > do_reclaim(); > return FAULT_FLAG_RETRY; > } I don't think this works. The problem with allocating page tables is not just that it may break an rcu-locked code section; you also need the code inserting the new page tables into the mm's page table tree to synchronize with any munmap() that may be concurrently running. RCU isn't sufficient here, and we would need a proper lock when wiring new page tables (current code relies on mmap lock for this). > ... but all this is now moot since the approach we agreed to yesterday > is: > > rcu_read_lock(); > vma = vma_lookup(); > if (down_read_trylock(&vma->sem)) { > rcu_read_unlock(); > } else { > rcu_read_unlock(); > mmap_read_lock(mm); > vma = vma_lookup(); > down_read(&vma->sem); > } > > ... and we then execute the page table allocation under the protection of > the vma->sem. > > At least, that's what I think we agreed to yesterday. I don't remember discussing any of this yesterday. As I remember it, the discussion was about having one large RCU section vs several small ones linked by sequence count checks to verify the validity of the vma at the start of each RCU section.