Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755104AbbG1CPQ (ORCPT ); Mon, 27 Jul 2015 22:15:16 -0400 Received: from mail9.hitachi.co.jp ([133.145.228.44]:35594 "EHLO mail9.hitachi.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754823AbbG1CPN (ORCPT ); Mon, 27 Jul 2015 22:15:13 -0400 X-AuditID: 85900ec0-9cdc8b9000001a57-a0-55b6e5757d85 Message-ID: <55B6E5AB.4070301@hitachi.com> Date: Tue, 28 Jul 2015 11:15:07 +0900 From: Hidehiro Kawai User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:13.0) Gecko/20120604 Thunderbird/13.0 MIME-Version: 1.0 To: Michal Hocko CC: Jonathan Corbet , Peter Zijlstra , Ingo Molnar , "Eric W. Biederman" , "H. Peter Anvin" , Andrew Morton , Thomas Gleixner , Vivek Goyal , linux-doc@vger.kernel.org, x86@kernel.org, kexec@lists.infradead.org, linux-kernel@vger.kernel.org, Masami Hiramatsu Subject: Re: [V2 PATCH 2/3] kexec: Fix race between panic() and crash_kexec() called directly References: <20150727015850.4928.87717.stgit@softrs> <20150727015850.4928.15194.stgit@softrs> <20150727145506.GG11317@dhcp22.suse.cz> In-Reply-To: <20150727145506.GG11317@dhcp22.suse.cz> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Brightmail-Tracker: AAAAAA== Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1866 Lines: 60 Hi, (2015/07/27 23:55), Michal Hocko wrote: > On Mon 27-07-15 10:58:50, Hidehiro Kawai wrote: > [...] >> @@ -1472,6 +1472,18 @@ void __weak crash_unmap_reserved_pages(void) >> >> void crash_kexec(struct pt_regs *regs) >> { >> + int old_cpu, this_cpu; >> + >> + /* >> + * `old_cpu == -1' means we are the first comer and crash_kexec() >> + * was called without entering panic(). >> + * `old_cpu == this_cpu' means crash_kexec() was called from panic(). >> + */ >> + this_cpu = raw_smp_processor_id(); >> + old_cpu = atomic_cmpxchg(&panicking_cpu, -1, this_cpu); >> + if (old_cpu != -1 && old_cpu != this_cpu) >> + return; >> + >> /* Take the kexec_mutex here to prevent sys_kexec_load >> * running on one cpu from replacing the crash kernel >> * we are using after a panic on a different cpu. >> @@ -1491,6 +1503,14 @@ void crash_kexec(struct pt_regs *regs) >> } >> mutex_unlock(&kexec_mutex); >> } >> + >> + /* >> + * If we came here from panic(), we have to keep panicking_cpu >> + * to prevent other cpus from entering panic(). Otherwise, >> + * resetting it so that other cpus can enter panic()/crash_kexec(). >> + */ >> + if (old_cpu == this_cpu) >> + atomic_set(&panicking_cpu, -1); > > This do the opposite what the comment says, wouldn't it? You should > check old_cpu == -1. Sorry, you are right. I performed same tests as for the previous patch set, but I missed the test case for this new logic. > Also atomic_set doesn't imply memory barriers which > might be a problem. OK, I'll use atomic_xchg(). Regards, -- Hidehiro Kawai Hitachi, Ltd. Research & Development Group -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/