Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3238419imu; Sat, 24 Nov 2018 00:49:06 -0800 (PST) X-Google-Smtp-Source: AFSGD/VCt2CjLNiEqhFwOD0HFditnSlwNQlsnq15yi7SiJQDg/fVyYoCFX849noyJ475rmaFNdsQ X-Received: by 2002:a17:902:5066:: with SMTP id f35mr15165872plh.78.1543049346273; Sat, 24 Nov 2018 00:49:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1543049346; cv=none; d=google.com; s=arc-20160816; b=yjB11UJQXI5xAj51rfB7JzqTI1XVgMWqKd+mOwUPhbMJRMk2S+nSg9jxW9mxzklrMz 83X9pbLfIUFzUVTK7gXzB3yIkIgF5o8333VskQaRqLaPpbqZiMBG0uFznkIUBg1TLV3m tcV6jVMpluEfpeCvRr3OJFTPPG5eRbXNqysF6IqHumNDlqyVc6ZLClNUDPRRy+YbD74/ VK544SjzqjZOnBPm+asMDCMIfGmgEWaKryzIoPWGivrPYbWB3LGYW2NFnOiv6dFc6mhR OFMG3ujcW0fEssCbMAVwP0O6uagIKEASWGTPytlOXVmXiLZnXwcIEEaTcK0al+5HVRH9 qAHg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature; bh=jnd2/Z9l7JPVuov0yRMK35sC+paAzZ2dIu4sX4K3+1I=; b=S/9i0E9hncAMruKGXNZPodU9mtBGh0gx31Hwzt7IRs9lmqsiWjS7tHEQsJa7aEHUCT C2bP6XQKBbo4muZgKPBjzyy8VmDIICZ6zGOR1TP/VMGN78PDgnv+q9cc9RdKCV5KAEam y2L/b0qXPqmgfAasaVYt+PHzGq20AA9fWun0A4d/sABHd/eWm23K2Hub8z8awgiMGPsu 3q3U1qCOzIvmjVq2bmYP0Erp1EAlyQFL7rtD+84znOhflUS0+6xTiBSpEYeYEUHebDkx e0JxnFh/wYReMBimfjc0za6MYtGSFSp3jg7t6pLitNenxh9o/NAXHEA0q6ZpXfj8sfJE +aPA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=lI6qGoL9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d12si46268121pga.506.2018.11.24.00.48.52; Sat, 24 Nov 2018 00:49:06 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@amacapital-net.20150623.gappssmtp.com header.s=20150623 header.b=lI6qGoL9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2441307AbeKXF45 (ORCPT + 99 others); Sat, 24 Nov 2018 00:56:57 -0500 Received: from mail-it1-f194.google.com ([209.85.166.194]:51101 "EHLO mail-it1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2437213AbeKXF44 (ORCPT ); Sat, 24 Nov 2018 00:56:56 -0500 Received: by mail-it1-f194.google.com with SMTP id z7so2155445iti.0 for ; Fri, 23 Nov 2018 11:11:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amacapital-net.20150623.gappssmtp.com; s=20150623; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=jnd2/Z9l7JPVuov0yRMK35sC+paAzZ2dIu4sX4K3+1I=; b=lI6qGoL9+kwQflprcXIwqWtBBuH7vrEI0FLtwK0wV+PNUM+98qfr7LVkkqvMr6ZLyY AHe2tseFdhz2OF7PH8MDe1SXDo2L/WPEJ40iJ5/5ULlxTvzRqnfgcshDxZALSEkcBERT MEsktkEuqfGgi6LZe6Bn9apofg4WUKxvxjP7yIswHfNi5LRF8Z1YQCX69lziEQI7xdOP b6FevqkNFUxlaJwSrU/oBuj7fHFgKIAr8kc/iCVmng0XiFf0mOfU6CL8D5p5ifYyza+j UBWXBsq+nSU8ddFBOZyCJio9JImfDJ/Ew2CWPH/1RlqSHw5964f19JDOvaQhRaCVpK+t YudA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=jnd2/Z9l7JPVuov0yRMK35sC+paAzZ2dIu4sX4K3+1I=; b=DJIHSiHmbLB7wZ9N4fHVwypaTomEezIw1/Qo7lInOJ+DCJoIbC3LNKeVJ0Wc7k2A+o 14ks5BAEpFu7cNkCMv93VXfH8CsGS7/h+lUXWXo6AoSL6bBHX/vn5HKlQe3BFH+1CWeJ ZP15LPElG/6T1NJ7cUWtDBnpxmI6XoZvPSCUBJKiGaEoBektk1uUg8q4b247WHRcY1wF D36WGMoTWpVtYcDkijjJJ8p5naAiWpozqH2az7EGZjn0qmDBTciV1Qkae/30JgvpyVtW jlMJLR51f2WEO742hktv1PtxAoIox8sbLXLjB3PRHNsCls2ai9+WdaUus03t3YRn3yHB BBpA== X-Gm-Message-State: AGRZ1gLVPNCiW3RqSbC405fbDFAgqmF/IlvSVcWojc8x7j9xEQZKbSBV +BX+K6HA3uxDCH5MC/6b5Rav0w== X-Received: by 2002:a24:4648:: with SMTP id j69mr14453715itb.56.1543000282097; Fri, 23 Nov 2018 11:11:22 -0800 (PST) Received: from ?IPv6:2600:100e:b01e:7457:896a:cc79:a4ed:4140? ([2600:100e:b01e:7457:896a:cc79:a4ed:4140]) by smtp.gmail.com with ESMTPSA id d12sm10672700ioh.44.2018.11.23.11.11.20 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 23 Nov 2018 11:11:21 -0800 (PST) Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (1.0) Subject: Re: [PATCH] x86: only use ERMS for user copies for larger sizes From: Andy Lutomirski X-Mailer: iPhone Mail (16A404) In-Reply-To: Date: Fri, 23 Nov 2018 12:11:19 -0700 Cc: David.Laight@aculab.com, Andrew Lutomirski , dvlasenk@redhat.com, Jens Axboe , Ingo Molnar , Thomas Gleixner , Ingo Molnar , bp@alien8.de, Peter Anvin , the arch/x86 maintainers , Andrew Morton , Peter Zijlstra , brgerst@gmail.com, Linux List Kernel Mailing , pabeni@redhat.com Content-Transfer-Encoding: quoted-printable Message-Id: <0D9E579C-F980-4BB7-A282-42206F6636F3@amacapital.net> References: <02bfc577-32a5-66be-64bf-d476b7d447d2@kernel.dk> <20181121063609.GA109082@gmail.com> <48e27a3a-2bb2-ff41-3512-8aeb3fd59e57@kernel.dk> <26eff539-7de7-784c-0c88-f1d30753299d@redhat.com> <7ea44458b90b4d41a08ba9012818d273@AcuMS.aculab.com> <64fd67993af04579b5262c270a7a4694@AcuMS.aculab.com> To: Linus Torvalds Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > On Nov 23, 2018, at 11:44 AM, Linus Torvalds wrote: >=20 >> On Fri, Nov 23, 2018 at 10:39 AM Andy Lutomirski wr= ote: >>=20 >> What is memcpy_to_io even supposed to do? I=E2=80=99m guessing it=E2=80=99= s defined as something like =E2=80=9Ccopy this data to IO space using at mos= t long-sized writes, all aligned, and writing each byte exactly once, in ord= er.=E2=80=9D That sounds... dubiously useful. >=20 > We've got hundreds of users of it, so it's fairly common.. >=20 I=E2=80=99m wondering if the =E2=80=9Cat most long-sizes=E2=80=9D restrictio= n matters, especially given that we=E2=80=99re apparently accessing some of t= he same bytes more than once. I would believe that trying to encourage 16-by= te writes (with AVX, ugh) or 64-byte writes (with MOVDIR64B) would be safe a= nd could meaningfully speed up some workloads. >> I could see a function that writes to aligned memory in specified-sized c= hunks. >=20 > We have that. It's called "__iowrite{32,64}_copy()". It has very few users= . >=20 > Linus