Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp274531pxa; Wed, 12 Aug 2020 01:23:01 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyDzr4zpi/9MLjT01gNpxzofqqexq2W+/+deXZKP0yjR6JD/HS6fFJVGA6juUL6Dy/6Ljlq X-Received: by 2002:a05:6402:2038:: with SMTP id ay24mr30364729edb.200.1597220581655; Wed, 12 Aug 2020 01:23:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1597220581; cv=none; d=google.com; s=arc-20160816; b=bKQhrrqhF0q0YRqwAtLKAiwRKjecZ+XqKsK9ZvBKCf+EjSN+savr6r7OkWP7IYkcg4 XJSfqKptRTpsg3dDbdryQaHAH1P4Roj7v33o5v/5uAQD60ApYRG9URDcBmn6gQltsnGl l+nCe1+oRY5ZllTfit7ZgW5Eto6xsiFyZYhoBx/UDbgxe6R63K4r5l++iMrtbGCE3yTN BpH74IeLolHRkeR/KO7kqxkC5v2nRdtKcz9+57PToNWMQHc9y/PZwPralpWRpQ9F8R4U 54EnSn3glMV7mRGkf8FnoAqwHVuHiJP/GSqx4OYKyfLR/icLmR3IHIvYwrEc2k+7UL+E i4sQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=JgBMrxFhpXh3xpWdoZXYK4da8LqJW1ikTR27jgesG/k=; b=P97dhDOJgVp4U2yPWt6KG8kfG56+AQw+vJxldEEg24MjPEMagjFFtl62q4P5IW8ehI yBnDMpH8lqggDDBJ54QVs815kSif78SGaOVBowtYgLowzhUquuW+8ZN0tZOcqPWSazVP M+J+BRj0GjAZG+pGQFd4jMSMBWDMELlt2MHa7PdUhgm6K4/Gjg5UWylZ8HTNNpdRwugm 0OZkyeerWoSmTBlODo27GBHPioJI/raW6o6lAXiSu4W8/Mt9u6Wy1IGQ3CY4FRMo3RPq Jjhze889BzXMp/GpbHBD+p3LPkC0iWbrEKg0eJnic7957oMB0WljPuMlwpkXO6rvB6EQ yLtQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@infradead.org header.s=casper.20170209 header.b=kTn2pnqI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id h14si985033edl.132.2020.08.12.01.22.37; Wed, 12 Aug 2020 01:23:01 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=fail header.i=@infradead.org header.s=casper.20170209 header.b=kTn2pnqI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726601AbgHLIWB (ORCPT + 99 others); Wed, 12 Aug 2020 04:22:01 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46472 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726517AbgHLIWB (ORCPT ); Wed, 12 Aug 2020 04:22:01 -0400 Received: from casper.infradead.org (casper.infradead.org [IPv6:2001:8b0:10b:1236::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9BB40C06174A for ; Wed, 12 Aug 2020 01:22:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=JgBMrxFhpXh3xpWdoZXYK4da8LqJW1ikTR27jgesG/k=; b=kTn2pnqInJbU6VFHhlEzEJiFza iNeM2LqgEYKHtAhuLBQ2kSWOSGQz+7gJvBNPsixA/bLKhQMxkIyfRqQkdJFEKGO0BBCPcnytjTI1l 21EQ6PoQdbshQxZB+vcLZicIrvMoGvJ1I/T4lHXqfqLjM32Xrj5BW+bkmvlc2ZzTkCvB4kiZTNm8D Aglia5uM4vHo2WZsz/txGeq/wZkOpZ/5VLg89D1RT9K2V+7fvR6HpuQjl4tnYqiHlqhnK5SyU7jDc wZZh5h5ubs1pb8xS7yAuFEQFVUBKb73IoTtIB4ovsCsnuSR5stZZJF335zM/wrE9Ub9AI8uUxltOX 2AN1pmRg==; Received: from j217100.upc-j.chello.nl ([24.132.217.100] helo=noisy.programming.kicks-ass.net) by casper.infradead.org with esmtpsa (Exim 4.92.3 #3 (Red Hat Linux)) id 1k5lxz-0001nH-73; Wed, 12 Aug 2020 08:18:55 +0000 Received: from hirez.programming.kicks-ass.net (hirez.programming.kicks-ass.net [192.168.1.225]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by noisy.programming.kicks-ass.net (Postfix) with ESMTPS id 7D08D300238; Wed, 12 Aug 2020 10:18:32 +0200 (CEST) Received: by hirez.programming.kicks-ass.net (Postfix, from userid 1000) id 6679022281190; Wed, 12 Aug 2020 10:18:32 +0200 (CEST) Date: Wed, 12 Aug 2020 10:18:32 +0200 From: peterz@infradead.org To: Marco Elver Cc: =?iso-8859-1?Q?J=FCrgen_Gro=DF?= , Borislav Petkov , Dave Hansen , fenghua.yu@intel.com, "H. Peter Anvin" , LKML , Ingo Molnar , syzkaller-bugs , Thomas Gleixner , "Luck, Tony" , the arch/x86 maintainers , yu-cheng.yu@intel.com, sdeep@vmware.com, virtualization@lists.linux-foundation.org, kasan-dev , syzbot , "Paul E. McKenney" , Wei Liu , Steven Rostedt Subject: Re: [PATCH] x86/paravirt: Add missing noinstr to arch_local*() helpers Message-ID: <20200812081832.GK2674@hirez.programming.kicks-ass.net> References: <20200807151903.GA1263469@elver.google.com> <20200811074127.GR3982@worktop.programming.kicks-ass.net> <20200811081205.GV3982@worktop.programming.kicks-ass.net> <07f61573-fef1-e07c-03f2-a415c88dec6f@suse.com> <20200811092054.GB2674@hirez.programming.kicks-ass.net> <20200811094651.GH35926@hirez.programming.kicks-ass.net> <20200811201755.GI35926@hirez.programming.kicks-ass.net> <20200812080650.GA3894595@elver.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200812080650.GA3894595@elver.google.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Aug 12, 2020 at 10:06:50AM +0200, Marco Elver wrote: > On Tue, Aug 11, 2020 at 10:17PM +0200, peterz@infradead.org wrote: > > On Tue, Aug 11, 2020 at 11:46:51AM +0200, peterz@infradead.org wrote: > > > > > So let me once again see if I can't find a better solution for this all. > > > Clearly it needs one :/ > > > > So the below boots without triggering the debug code from Marco -- it > > should allow nesting local_irq_save/restore under raw_local_irq_*(). > > > > I tried unconditional counting, but there's some _reallly_ wonky / > > asymmetric code that wrecks that and I've not been able to come up with > > anything useful. > > > > This one starts counting when local_irq_save() finds it didn't disable > > IRQs while lockdep though it did. At that point, local_irq_restore() > > will decrement and enable things again when it reaches 0. > > > > This assumes local_irq_save()/local_irq_restore() are nested sane, which > > is mostly true. > > > > This leaves #PF, which I fixed in these other patches, but I realized it > > needs fixing for all architectures :-( No bright ideas there yet. > > > > --- > > arch/x86/entry/thunk_32.S | 5 ---- > > include/linux/irqflags.h | 45 +++++++++++++++++++------------- > > init/main.c | 16 ++++++++++++ > > kernel/locking/lockdep.c | 58 +++++++++++++++++++++++++++++++++++++++++ > > kernel/trace/trace_preemptirq.c | 33 +++++++++++++++++++++++ > > 5 files changed, 134 insertions(+), 23 deletions(-) > > Testing this again with syzkaller produced some new reports: > > BUG: stack guard page was hit in error_entry > BUG: stack guard page was hit in exc_int3 > PANIC: double fault in error_entry > PANIC: double fault in exc_int3 > > Most of them have corrupted reports, but this one might be useful: > > BUG: stack guard page was hit at 000000001fab0982 (stack is 00000000063f33dc..00000000bf04b0d8) > BUG: stack guard page was hit at 00000000ca97ac69 (stack is 00000000af3e6c84..000000001597e1bf) > kernel stack overflow (double-fault): 0000 [#1] PREEMPT SMP > CPU: 1 PID: 4709 Comm: kworker/1:1H Not tainted 5.8.0+ #5 > Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1 04/01/2014 > Workqueue: events_highpri snd_vmidi_output_work > RIP: 0010:exc_int3+0x5/0xf0 arch/x86/kernel/traps.c:636 > Code: c9 85 4d 89 e8 31 c0 e8 a9 7d 68 fd e9 90 fe ff ff e8 0f 35 00 00 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 53 48 89 fb 76 0e 00 00 85 c0 74 03 5b 5d c3 f6 83 88 00 00 00 03 74 7e 48 > RSP: 0018:ffffc90008114000 EFLAGS: 00010083 > RAX: 0000000084e00e17 RBX: ffffc90008114018 RCX: ffffffff84e00e17 > RDX: 0000000000000000 RSI: ffffffff84e00a39 RDI: ffffc90008114018 > RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 > R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 > R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 > FS: 0000000000000000(0000) GS:ffff88807dc80000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: ffffc90008113ff8 CR3: 000000002dae4006 CR4: 0000000000770ee0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 > PKRU: 00000000 > Call Trace: > asm_exc_int3+0x31/0x40 arch/x86/include/asm/idtentry.h:537 > RIP: 0010:arch_static_branch include/trace/events/preemptirq.h:40 [inline] > RIP: 0010:static_key_false include/linux/jump_label.h:200 [inline] > RIP: 0010:trace_irq_enable_rcuidle+0xd/0x120 include/trace/events/preemptirq.h:40 > Code: 24 08 48 89 df e8 43 8d ef ff 48 89 df 5b e9 4a 2e 99 03 66 2e 0f 1f 84 00 00 00 00 00 55 41 56 53 48 89 fb e8 84 1a fd ff cc <1f> 44 00 00 5b 41 5e 5d c3 65 8b 05 ab 74 c3 7e 89 c0 31 f6 48 0f > RSP: 0018:ffffc900081140f8 EFLAGS: 00000093 > RAX: ffffffff813d9e8c RBX: ffffffff81314dd3 RCX: ffff888076ce6000 > RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff81314dd3 > RBP: 0000000000000000 R08: ffffffff813da3d4 R09: 0000000000000001 > R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 > R13: 0000000000000082 R14: 0000000000000000 R15: ffff888076ce6000 > trace_hardirqs_restore+0x59/0x80 kernel/trace/trace_preemptirq.c:106 > rcu_irq_enter_irqson+0x43/0x70 kernel/rcu/tree.c:1074 > trace_irq_enable_rcuidle+0x87/0x120 include/trace/events/preemptirq.h:40 > trace_hardirqs_restore+0x59/0x80 kernel/trace/trace_preemptirq.c:106 > rcu_irq_enter_irqson+0x43/0x70 kernel/rcu/tree.c:1074 > trace_irq_enable_rcuidle+0x87/0x120 include/trace/events/preemptirq.h:40 > trace_hardirqs_restore+0x59/0x80 kernel/trace/trace_preemptirq.c:106 > rcu_irq_enter_irqson+0x43/0x70 kernel/rcu/tree.c:1074 > trace_irq_enable_rcuidle+0x87/0x120 include/trace/events/preemptirq.h:40 > trace_hardirqs_restore+0x59/0x80 kernel/trace/trace_preemptirq.c:106 > rcu_irq_enter_irqson+0x43/0x70 kernel/rcu/tree.c:1074 > > <... repeated many many times ...> > > trace_irq_enable_rcuidle+0x87/0x120 include/trace/events/preemptirq.h:40 > trace_hardirqs_restore+0x59/0x80 kernel/trace/trace_preemptirq.c:106 > rcu_irq_enter_irqson+0x43/0x70 kernel/rcu/tree.c:1074 > Lost 500 message(s)! > BUG: stack guard page was hit at 00000000cab483ba (stack is 00000000b1442365..00000000c26f9ad3) > BUG: stack guard page was hit at 00000000318ff8d8 (stack is 00000000fd87d656..0000000058100136) > ---[ end trace 4157e0bb4a65941a ]--- Wheee... recursion! Let me try and see if I can make something of that.