Received: by 10.223.176.5 with SMTP id f5csp1898437wra; Wed, 31 Jan 2018 13:18:57 -0800 (PST) X-Google-Smtp-Source: AH8x224jjJ1OOH5ogVLUGmSAC/r6oxDY29lYbhHGZEJiFVXF0JwrVZT1ivoMnaphAmH9UukjxR03 X-Received: by 2002:a17:902:1683:: with SMTP id h3-v6mr28847156plh.433.1517433537648; Wed, 31 Jan 2018 13:18:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517433537; cv=none; d=google.com; s=arc-20160816; b=w58ugIlDXn8NUzfKILd3lPqfJDY60FaFFWH8Pzl4JnoK3hlgTo9hsNSZtohXkrXdF5 maQwRi/we0YIqDEoX9yWs2OX1NlH1aYkGJyriDUzuWG3ORe78xOWZkmOUmKwrKhCBnJG WJvnsUjqbKJRJzh5Z+bvMblXV1c6Wgxv2zYn31OUStEjhc4TEeO7z8UMMLzKhxCercAW b5Tg2GbP4xog7h3Ja2kOnVbC+zjcM0148y1UOhJ0BSVQ2mUTenfhqjSJBtvYjvlKOupW MhAKm07QIPKLIe731F8wAnBPjpO02Y6bj+biNacIv1O0ceFd+ZvX7MMTJfKYf/f1sQ9w PQUg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature:arc-authentication-results; bh=EPf9OvlKFHKRJWtrJrZsMKlXaJ7H+XdMVVDo0MWKszA=; b=IuC6LSSFiZ5jmZJN2x9HU9RInqEryOLBS6p5sURHh4b4mBOYHDEaqPT/mvWAoIP7DB mFaOFGHKAbggt+BOPcBq8BHbeAcIpxo0FF9KxDxpPY/GNDRfIJkCBNqQtfsvE+ChQ5Y1 SWP4xbe8i9F10JAfZNL68+WwzWsMFsTpYfQAO2gvtAtIZfNA85gjB92D9x01h1uyeP83 8XVqulj9zvSqEYhQaVGPy+IyAzBLKRZuRoYVJfZjwB3aCEq1jhxlz349bVapHsln0/TE hOBDkRxM2V7V/24Hd2gdhdYCRoRfgSPmDKCw5jvJfXiUwQtyNqiugxBFgmRowi9MmCM/ ww0g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=rwFSlD3M; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 31-v6si2710727plz.201.2018.01.31.13.18.42; Wed, 31 Jan 2018 13:18:57 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=rwFSlD3M; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753212AbeAaVRl (ORCPT + 99 others); Wed, 31 Jan 2018 16:17:41 -0500 Received: from mail-pf0-f180.google.com ([209.85.192.180]:36020 "EHLO mail-pf0-f180.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753131AbeAaVRk (ORCPT ); Wed, 31 Jan 2018 16:17:40 -0500 Received: by mail-pf0-f180.google.com with SMTP id 23so13969912pfp.3 for ; Wed, 31 Jan 2018 13:17:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=EPf9OvlKFHKRJWtrJrZsMKlXaJ7H+XdMVVDo0MWKszA=; b=rwFSlD3Muld2i8HYi/bJjd6QL1ChKTX/1Ts86UY+tCentnVJhdtoaWZscx+ckjSATT UUgh4VnD43TwQfezsWfIIWti4f9DRmbUsi1nKhxbLHkbX536v1f8weOa96T3EX11iQft GR06mTRYsTbWYaRL0K50Viu2ZKyAVvlNF86n2MyT2M65wg6mFUE2RmO4t9z8KoRIpa/0 frGouzTWxCZAyFqySAlRv7i3eXye1BwzlgfZbd6FEDWMao2Y1L9DOHsQRFsN8TTCAwFH WHx5gTbGCX+NayFVVyhmmX554/x22wE/0NsvBK/+/E2e0TiB31TY0NrtfLVNgtXAGgVo qp6w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=EPf9OvlKFHKRJWtrJrZsMKlXaJ7H+XdMVVDo0MWKszA=; b=UBnYErRvpLpTvbgK0eVjZ2TbylNQ4IyysLUNY4O9RPhTldnICsjnBr0Mkc7hn6LrmP 5wHGWEogQ1pu1nAIOBou3I5Da78NHLj1lHCFFfmfzQjN6NvhIrVFniJ7YoRha7qhPojc NS+1I559WSHSd6Qqzj3Y2GMQh56DvZ8BTdXPFqE1Z5osAe24IwWFRFAgBRo0KsoK8EG9 B+7omItP/KwRZVJr1JLS0lKUQV4P+4lTVJUJ2OC03YancoXG9/6MWTtv2FtbiUI26vGJ hhqBC3DQdvV8UHPfRSse2JjWVwyZaRUbu3UM876lt8jRWwDAiZh+2yuC96iri5TLfn9A 5ouw== X-Gm-Message-State: AKwxytfmOiDpwHLBYF3F8w2zHkInvXR6ACzG3KUpyrFihyZGYtegjZAz JjGAA9r8SMav74baLgIxuN0= X-Received: by 10.98.80.20 with SMTP id e20mr34548859pfb.148.1517433459162; Wed, 31 Jan 2018 13:17:39 -0800 (PST) Received: from [10.2.101.129] ([208.91.2.2]) by smtp.gmail.com with ESMTPSA id j13sm40299779pfk.112.2018.01.31.13.17.37 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 31 Jan 2018 13:17:38 -0800 (PST) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\)) Subject: Re: [PATCH] x86: Align TLB invalidation info From: Nadav Amit In-Reply-To: Date: Wed, 31 Jan 2018 13:17:36 -0800 Cc: Dave Hansen , the arch/x86 maintainers , Thomas Gleixner , Ingo Molnar , "H. Peter Anvin" , LKML , Peter Zijlstra Content-Transfer-Encoding: quoted-printable Message-Id: References: <20180131201118.1694-1-namit@vmware.com> <8bb352bc-4e1f-4e87-80e3-a8e65d618d2a@linux.intel.com> <0E65629C-0D7D-4602-A43D-B18E62F330A6@gmail.com> To: Andy Lutomirski X-Mailer: Apple Mail (2.3273) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Andy Lutomirski wrote: > On Wed, Jan 31, 2018 at 1:09 PM, Nadav Amit = wrote: >> Dave Hansen wrote: >>=20 >>> On 01/31/2018 12:11 PM, Nadav Amit wrote: >>>> The TLB invalidation info is allocated on the stack, which might = cause >>>> it to be unaligned. Since this information may be transferred to >>>> different cores for TLB shootdown, this might result in an = additional >>>> cache-line bouncing between the cores. >>>>=20 >>>> GCC provides a way to deal with it by using >>>> __builtin_alloca_with_align(). Use it to avoid the bouncing cache = lines. >>>=20 >>> It doesn't really *bounce*, though, does it? I don't see any writes = on >>> the remote side. The remote use seems entirely read-only. >>>=20 >>> You also don't have to exhaustively test this, but I'd love to see = at >>> least a sanity check with a microbenchmark (or something) that, yes, >>> this does help *something*. Maybe it makes the remote >>> flush_tlb_func_common() run faster because it's pulling in fewer = lines, >>> or maybe you can even detect fewer misses in there. >>=20 >> I agree that with the whole Meltdown/Spectre entry-cost it might not = even be >> measurable, at least on small ( < 2 sockets) machines. But I do not = think it >> worth profiling. Basically, AFAIK, all the data structures that are = used for >> inter-processor communication by the kernel are aligned, and this is = an >> exception. >=20 > This is only going to be measurable at all on NUMA, I suspect. Yes, I meant <=3D 2 ...=20