Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751377AbaAMD0N (ORCPT ); Sun, 12 Jan 2014 22:26:13 -0500 Received: from mail.linuxfoundation.org ([140.211.169.12]:45136 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751110AbaAMD0J (ORCPT ); Sun, 12 Jan 2014 22:26:09 -0500 Date: Sun, 12 Jan 2014 19:27:44 -0800 From: Andrew Morton To: Weijie Yang Cc: Weijie Yang , linux-kernel , Linux-MM , Hugh Dickins , Minchan Kim , Shaohua Li , Bob Liu , stable@vger.kernel.org, Krzysztof Kozlowski Subject: Re: [PATCH] mm/swap: fix race on swap_info reuse between swapoff and swapon Message-Id: <20140112192744.9bca5c6d.akpm@linux-foundation.org> In-Reply-To: References: <000001cf0cfd$6d251640$476f42c0$%yang@samsung.com> <20140110171108.32b2be171cd5e54bf22fb2a4@linux-foundation.org> X-Mailer: Sylpheed 2.7.1 (GTK+ 2.18.9; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 13 Jan 2014 11:08:58 +0800 Weijie Yang wrote: > >> --- a/mm/swapfile.c > >> +++ b/mm/swapfile.c > >> @@ -1922,7 +1922,6 @@ SYSCALL_DEFINE1(swapoff, const char __user *, specialfile) > >> p->swap_map = NULL; > >> cluster_info = p->cluster_info; > >> p->cluster_info = NULL; > >> - p->flags = 0; > >> frontswap_map = frontswap_map_get(p); > >> spin_unlock(&p->lock); > >> spin_unlock(&swap_lock); > >> @@ -1948,6 +1947,16 @@ SYSCALL_DEFINE1(swapoff, const char __user *, specialfile) > >> mutex_unlock(&inode->i_mutex); > >> } > >> filp_close(swap_file, NULL); > >> + > >> + /* > >> + * clear SWP_USED flag after all resources freed > >> + * so that swapon can reuse this swap_info in alloc_swap_info() safely > >> + * it is ok to not hold p->lock after we cleared its SWP_WRITEOK > >> + */ > >> + spin_lock(&swap_lock); > >> + p->flags = 0; > >> + spin_unlock(&swap_lock); > >> + > >> err = 0; > >> atomic_inc(&proc_poll_event); > >> wake_up_interruptible(&proc_poll_wait); > > > > I didn't look too closely, but this patch might also address the race > > which Krzysztof addressed with > > http://ozlabs.org/~akpm/mmots/broken-out/swap-fix-setting-page_size-blocksize-during-swapoff-swapon-race.patch. > > Can we please check that out? > > > > I do prefer fixing all these swapon-vs-swapoff races with some large, > > simple, wide-scope exclusion scheme. Perhaps SWP_USED is that scheme. > > > > An alternative would be to add another mutex and just make sys_swapon() > > and sys_swapoff() 100% exclusive. But that is plastering yet another > > lock over this mess to hide the horrors which lurk within :( > > > > Hi, Andrew. Thanks for your suggestion. > > I checked Krzysztof's patch, it use the global swapon_mutex to protect > race condition among > swapon, swapoff and swap_start(). It is a kind of correct method, but > a heavy method. But do you agree that your http://ozlabs.org/~akpm/mmots/broken-out/mm-swap-fix-race-on-swap_info-reuse-between-swapoff-and-swapon.patch makes Krzysztof's http://ozlabs.org/~akpm/mmots/broken-out/swap-fix-setting-page_size-blocksize-during-swapoff-swapon-race.patch obsolete? I've been sitting on Krzysztof's swap-fix-setting-page_size-blocksize-during-swapoff-swapon-race.patch for several months - Hugh had issues with it so I put it on hold and nothing further happened. > I will try to resend a patchset to make lock usage in swapfile.c clear > and fine grit OK, thanks. In the meanwhile I'm planning on dropping Krzysztof's patch and merging your patch into 3.14-rc1, which is why I'd like confirmation that your patch addresses the issues which Krzysztof identified? -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/