Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp3244828imm; Fri, 19 Oct 2018 07:32:37 -0700 (PDT) X-Google-Smtp-Source: ACcGV614dIyi2CESVbRDsqPgqCphUGfTA6WKwFYmgJy1BDF3aJtA8pSglM0y3vHPLL0efI+Uf7B5 X-Received: by 2002:a17:902:8a8c:: with SMTP id p12-v6mr34278877plo.133.1539959557379; Fri, 19 Oct 2018 07:32:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539959557; cv=none; d=google.com; s=arc-20160816; b=uEfWULxniwLU6TddjVY4o/UIK7g1cC+VJoCnXh7Wm7Hda3VuXwbwu6s0I/GT3b8tB0 Bqnw16jc+CTlJzrhD5YZn882i+P0e1BdaKLp7pK+WOHR02cylBYy7F/ZCoNLuA55Hn98 Rw1xXvGK6FKux/uOdqrthcwek/Di0wighdHdEDh4KNeHbtzA7OttCY74DBb1gXWuzx6n f01hVLnsrp1XSIQaEn1Sw3BIZCIvv4BPKtwCQ7FWuROcVuKtvttJqhxGVEglWfux43io r73cXP/0NVMfe1xgZW0jml6h4s37jDqdUlC4/UvlXDOVW2rpnG4L9r/CEB2srAULtltP SXTw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature; bh=reKol3Kt1XrJOxvg6Nseeg8Juwj5XurAilRn4PiWeHU=; b=1JTlhzoGuPTnEuvJStH9B1rjuUxlBoeBIJ56n+rLrT1m6kfuUlbEQzuhTcTMm1+tOR z7TxHQJCd3BhZsSHuh/BOem1d85bCHgsKWqgojRJeaaX8/ngYDqXP07JX01Jw2zgselV CN7kgfgC2c3a16TUatuocaj/mdkQ1/MaYFPYqCNxOsvUVOtsECiPP0QZTIw128ms/i+3 pr0MEQ4KbzEzP25X5k2TfvQ3M9GTbWrcreOIaHL5k6mAMXfKmQlWwqkD8GvpDBq7+2vG tFGd1fSPBs88JbBuaHoYwk4RQ6pcUiaAv2/mpXFaNI75vef1rVbCBZZ03777aQBT74jo mELQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=GqUdLUQx; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j16-v6si24390609pgm.501.2018.10.19.07.32.20; Fri, 19 Oct 2018 07:32:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=GqUdLUQx; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727757AbeJSWgL (ORCPT + 99 others); Fri, 19 Oct 2018 18:36:11 -0400 Received: from mail-pg1-f194.google.com ([209.85.215.194]:39040 "EHLO mail-pg1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727574AbeJSWgK (ORCPT ); Fri, 19 Oct 2018 18:36:10 -0400 Received: by mail-pg1-f194.google.com with SMTP id r9-v6so15848569pgv.6 for ; Fri, 19 Oct 2018 07:29:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amacapital-net.20150623.gappssmtp.com; s=20150623; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=reKol3Kt1XrJOxvg6Nseeg8Juwj5XurAilRn4PiWeHU=; b=GqUdLUQxxvOEDnh0yMkS6Ww+/JldYLx0t4//MQPaln/hOnP7oNxwfzjNM56O/x0Yg7 UCebtr51g2WFohQTSLbOiL/W0QunmjNJ0Sa8n3ngGsHK3WGdPqZJr0kksuoLHDdsFeeJ LTPI9ZPYBn5AMDb2yrk/mFMXG0Z49ZXDnhIOu/Vpdf96c2YPsJilWygpGn1p3gKOq566 oLCWsOYt5vNmevzFO3Hd3D27uZVUDnH6OcJoV6an7th7q6N5T9+w34EVdDJYAV8ixtHW SFRsBMqQwWDy+DwkCZYR/mVZ4TsMSlHNyffLhtdmiyk7fKM69R0ZpfIc7R9hS/fhZHRZ +E4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=reKol3Kt1XrJOxvg6Nseeg8Juwj5XurAilRn4PiWeHU=; b=H4/Hi8+b1PhkiYHbYd32g58vIfSqFvSZ3PlftjswxRZ46m1baBv+sQO8dSGw0FbKNi sfQrZx9QfCTr61ENg0B1SJL7qsqlVkf9DyuthtwAj33irkLmKsD1GTEhK8Flt/gL6/0k PVMiGecdSj0Ih1E9hWrrsH9e5VE1yNsBuanKyfCibBmEWeHITYHtp5nU9hzqQeyVxNet K2lLcYTHF4Nv7PlufkKkraEZZt1jXeiuZOauJJOEJlmi63azxv7buZUx1VSKk5mxduUl ha9nOIKLEks0QuCMObQbvIsTpj84H90vLnOhzvYcxlUeuKz1r4IGuQScSzRywEAZ9mdv 9OdA== X-Gm-Message-State: ABuFfogpXFnRcZ0Eclmnzkhag8xtCuaF0r/CePRWXjuSIcHuwu0l2npD US6UZfF+U1D4X7NM/O2Nj5HcjkLKrBE= X-Received: by 2002:a62:f715:: with SMTP id h21-v6mr33959748pfi.169.1539959388012; Fri, 19 Oct 2018 07:29:48 -0700 (PDT) Received: from ?IPv6:2601:646:c200:7429:746e:dce6:68de:332d? ([2601:646:c200:7429:746e:dce6:68de:332d]) by smtp.gmail.com with ESMTPSA id t22-v6sm33515563pfk.141.2018.10.19.07.29.46 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 19 Oct 2018 07:29:46 -0700 (PDT) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (1.0) Subject: Re: [RFC PATCH 1/5] x86: introduce preemption disable prefix From: Andy Lutomirski X-Mailer: iPhone Mail (16A366) In-Reply-To: <20181019083325.GC3121@hirez.programming.kicks-ass.net> Date: Fri, 19 Oct 2018 07:29:45 -0700 Cc: Nadav Amit , Ingo Molnar , Andrew Lutomirski , "H. Peter Anvin" , Thomas Gleixner , LKML , X86 ML , Borislav Petkov , "Woodhouse, David" Content-Transfer-Encoding: quoted-printable Message-Id: References: <20181018005420.82993-1-namit@vmware.com> <20181018005420.82993-2-namit@vmware.com> <07255D2B-0243-4254-B62A-37050C44207E@vmware.com> <925F22EA-F8CB-4194-B96B-378409ED7918@vmware.com> <2626124E-7344-42F3-AD07-0BB34D62A9EE@amacapital.net> <6F1FD9DA-5E86-42A2-8EAF-05F5D70FE2EF@vmware.com> <20181019083325.GC3121@hirez.programming.kicks-ass.net> To: Peter Zijlstra Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Oct 19, 2018, at 1:33 AM, Peter Zijlstra wrote: >=20 >> On Fri, Oct 19, 2018 at 01:08:23AM +0000, Nadav Amit wrote: >> Consider for example do_int3(), and see my inlined comments: >>=20 >> dotraplinkage void notrace do_int3(struct pt_regs *regs, long error_code)= >> { >> ... >> ist_enter(regs); // =3D> preempt_disable() >> cond_local_irq_enable(regs); // =3D> assume it enables IRQs >>=20 >> ... >> // resched irq can be delivered here. It will not caused rescheduling >> // since preemption is disabled >>=20 >> cond_local_irq_disable(regs); // =3D> assume it disables IRQs >> ist_exit(regs); // =3D> preempt_enable_no_resched() >> } >>=20 >> At this point resched will not happen for unbounded length of time (unles= s >> there is another point when exiting the trap handler that checks if >> preemption should take place). >>=20 >> Another example is __BPF_PROG_RUN_ARRAY(), which also uses >> preempt_enable_no_resched(). >>=20 >> Am I missing something? >=20 > Would not the interrupt return then check for TIF_NEED_RESCHED and call > schedule() ? The paranoid exit path doesn=E2=80=99t check TIF_NEED_RESCHED because it=E2=80= =99s fundamentally atomic =E2=80=94 it=E2=80=99s running on a percpu stack a= nd it can=E2=80=99t schedule. In theory we could do some evil stack switchin= g, but we don=E2=80=99t. How does NMI handle this? If an NMI that hit interruptible kernel code over= flows a perf counter, how does the wake up work? (do_int3() is special because it=E2=80=99s not actually IST. But it can hit= in odd places due to kprobes, and I=E2=80=99m nervous about recursing incor= rectly into RCU and context tracking code if we were to use exception_enter(= ).) >=20 > I think (and this certainly wants a comment) is that the ist_exit() > thing hard relies on the interrupt-return path doing the reschedule.