Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp2021183pxa; Mon, 24 Aug 2020 02:48:46 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxwp2dwuyM0mgFtGIVlIP/IAOtgseZ+jsfLKxwzbZW9w4SkhwgYvM5ZmWyYq0qcgvr1232m X-Received: by 2002:a17:906:68b:: with SMTP id u11mr5000454ejb.143.1598262526244; Mon, 24 Aug 2020 02:48:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598262526; cv=none; d=google.com; s=arc-20160816; b=YY/YRNISPMPMnjQXVgTq8ee2WLfIekBUCwx2Mrwy6g9DWXuH/fiHGjoIQjpI90ElnA c4+SyVHrdVrA+xx9JxlsmE1KIQ1KZpq8HPtK0IbRyyWrbN4V2+lICVlbXriFrhAV6EFD BZLtVBv5CnsEwNktFZk0NZa6dfFYccXQzpSTM/WyvWYYiCpCv33j5ZnQAlsi94QC3AS5 LHre9GgnyRXojF0iXniqOtRm9+ftH+LcYmUEaeUQQlqfQyXLHBuVTh4HpNL2wXAdQ/I/ kxpTa4kOEp7LALOea3h1ZAMlSldKf2Uzj5ArILzparza1pKDnEarYMf8cVjxp80WjAew xJVw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=9im+JpTn+ZwcncIBDrBPGiSEuYkvQT2oRjyNCJQxoPs=; b=gI36I3NCHQuMUM61TdqeCG/PL8tLVxJ4j9+RhuZUGV+qGb/cLI3Yk+bEOrPy/9E7V5 vJnuru1/Zju3WbqR8ZOvROWkwXHASdketDThpJrP0/VDhc+twzAVU7m1e15BzlT7dsbX Msln+vfvbdN2ATDy3h8SaMryp1hIcmj1u0c+zUZjHzl24c0AYh+VhcsQK/HcfzODkz+Z YkWaBQ/3QxerXwVpXifXnHgeZWfoytTVwgK+5CWZOjwfYim6t4K4yVYGw/wUu03GyZmJ YKC6OV+UXRG+gYzxurCueVwOBnj79D0bSHRhLH5HGEtTJTnOfiv5ZmaPaGzbNEqqiFQX xdXA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id c22si6334790edr.146.2020.08.24.02.48.23; Mon, 24 Aug 2020 02:48:46 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728830AbgHXJrw (ORCPT + 99 others); Mon, 24 Aug 2020 05:47:52 -0400 Received: from relay.sw.ru ([185.231.240.75]:58626 "EHLO relay3.sw.ru" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1728409AbgHXIlK (ORCPT ); Mon, 24 Aug 2020 04:41:10 -0400 Received: from [192.168.15.190] by relay3.sw.ru with esmtp (Exim 4.94) (envelope-from ) id 1kA7xd-000xLn-NE; Mon, 24 Aug 2020 11:36:13 +0300 Subject: Re: [PATCH 1/4] mm: Trial do_wp_page() simplification To: Peter Xu , linux-mm@kvack.org, linux-kernel@vger.kernel.org Cc: "Maya B . Gokhale" , Linus Torvalds , Yang Shi , Marty Mcfadden , Kirill Shutemov , Oleg Nesterov , Jann Horn , Jan Kara , Andrea Arcangeli , Christoph Hellwig , Andrew Morton References: <20200821234958.7896-1-peterx@redhat.com> <20200821234958.7896-2-peterx@redhat.com> From: Kirill Tkhai Message-ID: <42bc9a68-ef9e-2542-0b21-392a7f47bd74@virtuozzo.com> Date: Mon, 24 Aug 2020 11:36:22 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 In-Reply-To: <20200821234958.7896-2-peterx@redhat.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 22.08.2020 02:49, Peter Xu wrote: > From: Linus Torvalds > > How about we just make sure we're the only possible valid user fo the > page before we bother to reuse it? > > Simplify, simplify, simplify. > > And get rid of the nasty serialization on the page lock at the same time. > > Signed-off-by: Linus Torvalds > [peterx: add subject prefix] > Signed-off-by: Peter Xu > --- > mm/memory.c | 59 +++++++++++++++-------------------------------------- > 1 file changed, 17 insertions(+), 42 deletions(-) > > diff --git a/mm/memory.c b/mm/memory.c > index 602f4283122f..cb9006189d22 100644 > --- a/mm/memory.c > +++ b/mm/memory.c > @@ -2927,50 +2927,25 @@ static vm_fault_t do_wp_page(struct vm_fault *vmf) > * not dirty accountable. > */ > if (PageAnon(vmf->page)) { > - int total_map_swapcount; > - if (PageKsm(vmf->page) && (PageSwapCache(vmf->page) || > - page_count(vmf->page) != 1)) > + struct page *page = vmf->page; > + > + /* PageKsm() doesn't necessarily raise the page refcount */ No, this is wrong. PageKSM() always raises refcount. There was another problem: KSM may raise refcount without lock_page(), and only then it takes the lock. See get_ksm_page(GET_KSM_PAGE_NOLOCK) for the details. So, reliable protection against parallel access requires to freeze page counter, which is made in reuse_ksm_page(). > + if (PageKsm(page) || page_count(page) != 1) > + goto copy; > + if (!trylock_page(page)) > + goto copy; > + if (PageKsm(page) || page_mapcount(page) != 1 || page_count(page) != 1) { > + unlock_page(page); > goto copy; > - if (!trylock_page(vmf->page)) { > - get_page(vmf->page); > - pte_unmap_unlock(vmf->pte, vmf->ptl); > - lock_page(vmf->page); > - vmf->pte = pte_offset_map_lock(vma->vm_mm, vmf->pmd, > - vmf->address, &vmf->ptl); > - if (!pte_same(*vmf->pte, vmf->orig_pte)) { > - update_mmu_tlb(vma, vmf->address, vmf->pte); > - unlock_page(vmf->page); > - pte_unmap_unlock(vmf->pte, vmf->ptl); > - put_page(vmf->page); > - return 0; > - } > - put_page(vmf->page); > - } > - if (PageKsm(vmf->page)) { > - bool reused = reuse_ksm_page(vmf->page, vmf->vma, > - vmf->address); > - unlock_page(vmf->page); > - if (!reused) > - goto copy; > - wp_page_reuse(vmf); > - return VM_FAULT_WRITE; > - } > - if (reuse_swap_page(vmf->page, &total_map_swapcount)) { > - if (total_map_swapcount == 1) { > - /* > - * The page is all ours. Move it to > - * our anon_vma so the rmap code will > - * not search our parent or siblings. > - * Protected against the rmap code by > - * the page lock. > - */ > - page_move_anon_rmap(vmf->page, vma); > - } > - unlock_page(vmf->page); > - wp_page_reuse(vmf); > - return VM_FAULT_WRITE; > } > - unlock_page(vmf->page); > + /* > + * Ok, we've got the only map reference, and the only > + * page count reference, and the page is locked, > + * it's dark out, and we're wearing sunglasses. Hit it. > + */ > + wp_page_reuse(vmf); > + unlock_page(page); > + return VM_FAULT_WRITE; > } else if (unlikely((vma->vm_flags & (VM_WRITE|VM_SHARED)) == > (VM_WRITE|VM_SHARED))) { > return wp_page_shared(vmf); >