Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp3987044ybv; Mon, 10 Feb 2020 10:05:56 -0800 (PST) X-Google-Smtp-Source: APXvYqzpAJASC2ZlG8167fsiL5tAuFGayPH6omUfr7IedWgPQ9CiJS0Ma45dmzWZdw0ZRmmseMqf X-Received: by 2002:a9d:7410:: with SMTP id n16mr2039380otk.23.1581357956486; Mon, 10 Feb 2020 10:05:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581357956; cv=none; d=google.com; s=arc-20160816; b=KqLomCC3vl1Q10JlQcxvFgMFh4X834ClJs7a9NP2KDAO0xPNWacR3ANgv98q+JigZf w3tXIRctgd5vyhGWI5q9RUgYlCJdY3aONiuXarucjRW1MyQUEr5Z+ilHVLazxtiZPcYE U4fu2l69y4TbXuYVuAfSD5vLZ7NExlWZ79SIqMMhe5UtXixeyhq5niEBpfxITIMYhCT8 aZMmYz0XjiYZadvUnxeg0AFcAYAPYDbESEOMaTGKHf1pkAmwhyWrO37ICQpz9ozSGeUX EzE5McI+y+aPsvxAokL/wKWV6sfN0UrDVlWzBoK0dmSakyvLBLmxlaYPAU/8+1icib9C eEzg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=Y4Bm0V75xs4E9d5KoFxNlIFDOvUlqLn1HJW2IQtl9+k=; b=mGvPfxsiWkfoajwUDeeGf+n8l5dUCs/Jn8Jz9ChQswvEsxMHQDgXvt2B785Ov9Dk1V G4wBlRpTPfYNowY4RiR1SsAC8oTxMCm6R6VxFH+zZpgfdJ42BxgBj6l2CJyuMonxDv9g Nq7IGhGZ7+zKsaAytDBRzfU7AcaWjM39sX9BBsygJdpiOntJiJf4rTDtbp4t8I6yL5V8 KWVWWTw2+ppd+20LleHLyxqEP0JAHTQVpknYL3IscpuiaPexjEWvHOL38jgR8+xFLYy0 4VfYWeo26FZ+yxELNDIOcmLoEeCeafxsViCirL4/cXZl9FzPz1l7mOifHIr486I5zLF+ 2kPQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@shutemov-name.20150623.gappssmtp.com header.s=20150623 header.b=ecRg2C95; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s15si461055oih.252.2020.02.10.10.05.43; Mon, 10 Feb 2020 10:05:56 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@shutemov-name.20150623.gappssmtp.com header.s=20150623 header.b=ecRg2C95; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727029AbgBJSFc (ORCPT + 99 others); Mon, 10 Feb 2020 13:05:32 -0500 Received: from mail-lf1-f68.google.com ([209.85.167.68]:40238 "EHLO mail-lf1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726809AbgBJSFc (ORCPT ); Mon, 10 Feb 2020 13:05:32 -0500 Received: by mail-lf1-f68.google.com with SMTP id c23so4893064lfi.7 for ; Mon, 10 Feb 2020 10:05:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=shutemov-name.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=Y4Bm0V75xs4E9d5KoFxNlIFDOvUlqLn1HJW2IQtl9+k=; b=ecRg2C95MZZKpuyX1OS5SgB5WMpKOHvTN+P+xXrF4pWw6qe6pnsDcwpCcbBA6DVH7/ LUoljMghOFcAWCo24+4hT9CfYC6pSM1WPU58MldU3f9xsI4KXouq3B3cyKSjICwxPoVJ eNIwYAZQQ5rBt2IA4DOGy8IgUmdq6Zx5UFgub+VNbOwIoetCh72nstuIjGTam3c7ViVF pqnU+GEkiK0F8bHdJbzXfIv6saRgOpFZhbG/80sJORZMo3y6bhXz32nLF9uKBLJAKgSZ Y9AaK2vRjHq7mj5Yo2A1sYLQM0E2XrHpEjqMQX0gKOD9urLrmUaKN5xWabHJO1jX1CCy HrXA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=Y4Bm0V75xs4E9d5KoFxNlIFDOvUlqLn1HJW2IQtl9+k=; b=mLJeKBhz2V3/6hPlHhItXLgEepEUn5NZ2i+b/ZfEpxv5c5VudPfCRqgR/PApD+fSzx F26iZWK5gNT2iB/8jt1g4VrlzCS3gDgoSvcunRxjaD8LSNS0TyDaCGasBfe0TjH1fUAd tmFlZ1OuIoHZrSVII7u6AARou4ydz2dMRr9YIFtgnPSmKQhsTfFpbID648L48b+tiUtV uN1Ojs+49SYqxM0ltY7qL1jcbwKePuZaZou0fv7D4JfqGKNJRKcr4nyzQgjR9eiZD9kY lr/PllauCDIHmUK94sF9hh8Mqe1PT3E2I52pbb3iiBp+dKb2kJWmR56B2StJAIkL162j IjQg== X-Gm-Message-State: APjAAAWuXh6U+gv/JoYPnv84DbxJvcahXkS9oxtAZzWgabRysuVwmxNZ rJFILgtlZ/Uy0K5T4bk2VfEhDg== X-Received: by 2002:a19:a40a:: with SMTP id q10mr1352460lfc.204.1581357928579; Mon, 10 Feb 2020 10:05:28 -0800 (PST) Received: from box.localdomain ([86.57.175.117]) by smtp.gmail.com with ESMTPSA id w71sm630671lff.0.2020.02.10.10.05.27 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 10 Feb 2020 10:05:28 -0800 (PST) Received: by box.localdomain (Postfix, from userid 1000) id CD21C100D30; Mon, 10 Feb 2020 21:05:46 +0300 (+03) Date: Mon, 10 Feb 2020 21:05:46 +0300 From: "Kirill A. Shutemov" To: Matthew Wilcox Cc: Qian Cai , akpm@linux-foundation.org, elver@google.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH -next] mm/filemap: fix a data race in filemap_fault() Message-ID: <20200210180546.vt7yhdjav5oinij7@box> References: <1581354029-20154-1-git-send-email-cai@lca.pw> <20200210172511.GL8731@bombadil.infradead.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200210172511.GL8731@bombadil.infradead.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Feb 10, 2020 at 09:25:11AM -0800, Matthew Wilcox wrote: > On Mon, Feb 10, 2020 at 12:00:29PM -0500, Qian Cai wrote: > > @@ -2622,7 +2622,7 @@ void filemap_map_pages(struct vm_fault *vmf, > > if (page->index >= max_idx) > > goto unlock; > > > > - if (file->f_ra.mmap_miss > 0) > > + if (data_race(file->f_ra.mmap_miss > 0)) > > file->f_ra.mmap_miss--; > > How is this safe? Two threads can each see 1, and then both decrement the > in-memory copy, causing it to end up at -1. Right, it is bogus. Below is my completely untested attempt on fix this. It still allows races, but they will only lead to missed accounting, but not underflow. diff --git a/mm/filemap.c b/mm/filemap.c index 1784478270e1..1919d37c646a 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -2365,6 +2365,7 @@ static struct file *do_sync_mmap_readahead(struct vm_fault *vmf) struct address_space *mapping = file->f_mapping; struct file *fpin = NULL; pgoff_t offset = vmf->pgoff; + unsigned mmap_miss; /* If we don't want any read-ahead, don't bother */ if (vmf->vma->vm_flags & VM_RAND_READ) @@ -2380,14 +2381,15 @@ static struct file *do_sync_mmap_readahead(struct vm_fault *vmf) } /* Avoid banging the cache line if not needed */ - if (ra->mmap_miss < MMAP_LOTSAMISS * 10) - ra->mmap_miss++; + mmap_miss = READ_ONCE(ra->mmap_miss); + if (mmap_miss < MMAP_LOTSAMISS * 10) + WRITE_ONCE(ra->mmap_miss, ++mmap_miss); /* * Do we miss much more than hit in this file? If so, * stop bothering with read-ahead. It will only hurt. */ - if (ra->mmap_miss > MMAP_LOTSAMISS) + if (mmap_miss > MMAP_LOTSAMISS) return fpin; /* @@ -2413,13 +2415,15 @@ static struct file *do_async_mmap_readahead(struct vm_fault *vmf, struct file_ra_state *ra = &file->f_ra; struct address_space *mapping = file->f_mapping; struct file *fpin = NULL; + unsigned int mmap_miss; pgoff_t offset = vmf->pgoff; /* If we don't want any read-ahead, don't bother */ if (vmf->vma->vm_flags & VM_RAND_READ) return fpin; - if (ra->mmap_miss > 0) - ra->mmap_miss--; + mmap_miss = READ_ONCE(ra->mmap_miss); + if (mmap_miss) + WRITE_ONCE(ra->mmap_miss, --mmap_miss); if (PageReadahead(page)) { fpin = maybe_unlock_mmap_for_io(vmf, fpin); page_cache_async_readahead(mapping, ra, file, @@ -2586,7 +2590,9 @@ void filemap_map_pages(struct vm_fault *vmf, unsigned long max_idx; XA_STATE(xas, &mapping->i_pages, start_pgoff); struct page *page; + unsigned long mmap_miss; + mmap_miss = READ_ONCE(file->f_ra.mmap_miss); rcu_read_lock(); xas_for_each(&xas, page, end_pgoff) { if (xas_retry(&xas, page)) @@ -2622,8 +2628,8 @@ void filemap_map_pages(struct vm_fault *vmf, if (page->index >= max_idx) goto unlock; - if (file->f_ra.mmap_miss > 0) - file->f_ra.mmap_miss--; + if (mmap_miss > 0) + mmap_miss--; vmf->address += (xas.xa_index - last_pgoff) << PAGE_SHIFT; if (vmf->pte) @@ -2643,6 +2649,7 @@ void filemap_map_pages(struct vm_fault *vmf, break; } rcu_read_unlock(); + WRITE_ONCE(file->f_ra.mmap_miss, mmap_miss); } EXPORT_SYMBOL(filemap_map_pages); -- Kirill A. Shutemov