Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp4310880imw; Tue, 12 Jul 2022 05:58:37 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tZ3TuIk6dhnkS37u2rbZPNj0HBt3hSXT7RXQyHM9nAt5njP3Kd8ONbVr9iDPMZKntThor7 X-Received: by 2002:a17:907:2bf4:b0:72b:3336:ca9e with SMTP id gv52-20020a1709072bf400b0072b3336ca9emr19061020ejc.341.1657630717248; Tue, 12 Jul 2022 05:58:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1657630717; cv=none; d=google.com; s=arc-20160816; b=PzeJoUHwRgqS/zV71gAOY7lP7KC0QnWM5wXFtOIQwlpENfXuo4qvkngBgbA7uSMIZJ UV8u3tHErfC0mSt303wX2y5eupi20I30K6uUfSX3+QbR1S9uCfXGeigb7eWL1SukxljQ lOnhAs3fFugyvCWinYWkc2Fjvjbp1ljXjIOinxdozx27+dsXPPswPlALRDEwsIndXRdE wIFcsHsLiTdpvfi/Fnx5OhLMoNSPXP4QqYE53gybhPmNwrvFkLVqXbq9GZ2Ck+O4XMK4 Y3QCnXPLByew1K3H5bBr7QnKqRqhcvw8VrJgR22Rjk1qMO9ZlpwWDRxhIqrQx2oKhFHp TMSA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=TzUu9/4eWAwyfJe5s1t0yONoD27sPS3zCjo2PMaad0c=; b=baTdsjqw8P8io14ZO4s+AIIdI3hSc803Qn78kMmKVRd8mQ/7XzyKWiAjBbqIwtSO+0 zy/ghXInuj8w+mkqckPGVRBvB4YH4GAmcx781LXnHxQvA8vN4t+ztcJ6xQ1JgOSd175P XnMvTGBdIZ+tOaeXGn5LCSNv2lL9Py6MNCRGo5Wo2wK5m+Xg7MqS+Im48Uvk8ju3aw0U RFvbyWICr7lh8KzQ+QTtRqKXRNDJWhm71dGl8p4soaYm6Up91Pra+6NYK3rsjV8Ta9Bd dsVwF+q4pzCrom9cIAOZM1l0Y8pWQ7j+nZhrstsZ5UqmbjoWTYNW8+G36tfq6sdI5UBJ pZGA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=LyQMDbQI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id s8-20020aa7c548000000b0043a7a473db2si13003001edr.484.2022.07.12.05.58.11; Tue, 12 Jul 2022 05:58:37 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=LyQMDbQI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232790AbiGLMcq (ORCPT + 99 others); Tue, 12 Jul 2022 08:32:46 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48700 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233061AbiGLMcj (ORCPT ); Tue, 12 Jul 2022 08:32:39 -0400 Received: from mail-wr1-x42d.google.com (mail-wr1-x42d.google.com [IPv6:2a00:1450:4864:20::42d]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E56B8A2E74; Tue, 12 Jul 2022 05:32:32 -0700 (PDT) Received: by mail-wr1-x42d.google.com with SMTP id z12so10961395wrq.7; Tue, 12 Jul 2022 05:32:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=TzUu9/4eWAwyfJe5s1t0yONoD27sPS3zCjo2PMaad0c=; b=LyQMDbQIrV+g+jI+he7uBZBC8jr7Ejlx42tIA5H19ff7Cs3u1PgxgOYerIu0INB/Fe 8SP+5323SDOc+X4kWcatVLXLL5X3WSrCuZMPPQKgmXCUh+9dfKoX9oWySx01EHqlL+cN vFUklcbidni9y+6s8b8ioIyqgmal8WGQwIA1SKgfpIsBO3dW4E+1M2Na2Yx6ol6A7Xph Lx97rRsHVJSqby6EvsW8UdfjrAKKkgO9YIpnzlrpXJ+uH79BtO5kvNdPZfxat7++ci75 fkglTu3cSKftH0aVJngnI+3ARNFQYO164aDY7r0ZOe5+h+o5fjiUg5nMFvzebX420vGh n6Fg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=TzUu9/4eWAwyfJe5s1t0yONoD27sPS3zCjo2PMaad0c=; b=W5y5RuCf85ud2vCDHRLMm6NJ3KjuchPzCjXtLK0WX9arvHGdibDERMd/5qvg2ZMjaZ VfhsNjzTOghu/5JuEdhXUTR9Vol3zpegO360uWScw/eI0eVNwxGNqADxvamqo/YNWDq5 eEbIrHCep6K+4ejKdC5Xm+y58J1gQ5PZOrYBEbR5YKZ/52mOcvlh926pVQPz4xdOhHHY sDnEd0+5aVhzNMZt+l0ytDeIPW+v7qTJZ5VmH2dnWVTkurqp3BH/uMPVRlBAeO3rQH6x 2eneqMf8Z0diHLSg/xlkMRTRG37DpwYUZ/waWtx1OGdr+1edHcC7rVVrtqI3bA+fWm3B 9Hsw== X-Gm-Message-State: AJIora+ZFleWi+CiQXavHkUawDObi4QDjiUE/sBQpU6gAhjCfLixhpNV r5DvPjcMx2blU4tA8ZznRg== X-Received: by 2002:a05:6000:1688:b0:21d:7b36:926a with SMTP id y8-20020a056000168800b0021d7b36926amr21039555wrd.303.1657629151257; Tue, 12 Jul 2022 05:32:31 -0700 (PDT) Received: from localhost.localdomain ([46.53.253.195]) by smtp.gmail.com with ESMTPSA id q5-20020adf9dc5000000b0021d928d2388sm8186310wre.85.2022.07.12.05.32.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Jul 2022 05:32:30 -0700 (PDT) Date: Tue, 12 Jul 2022 15:32:27 +0300 From: Alexey Dobriyan To: Borislav Petkov Cc: linux-kernel@vger.kernel.org, Linus Torvalds , Mark Hemment , Andrew Morton , the arch/x86 maintainers , Peter Zijlstra , patrice.chotard@foss.st.com, Mikulas Patocka , Lukas Czerner , Christoph Hellwig , "Darrick J. Wong" , Chuck Lever , Hugh Dickins , patches@lists.linux.dev, Linux-MM , mm-commits@vger.kernel.org, Mel Gorman Subject: Re: [PATCH -final] x86/clear_user: Make it faster Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jul 11, 2022 at 12:33:20PM +0200, Borislav Petkov wrote: > On Wed, Jul 06, 2022 at 12:24:12PM +0300, Alexey Dobriyan wrote: > > On Tue, Jul 05, 2022 at 07:01:06PM +0200, Borislav Petkov wrote: > > > > > + asm volatile( > > > + "1:\n\t" > > > + ALTERNATIVE_3("rep stosb", > > > + "call clear_user_erms", ALT_NOT(X86_FEATURE_FSRM), > > > + "call clear_user_rep_good", ALT_NOT(X86_FEATURE_ERMS), > > > + "call clear_user_original", ALT_NOT(X86_FEATURE_REP_GOOD)) > > > + "2:\n" > > > + _ASM_EXTABLE_UA(1b, 2b) > > > + : "+&c" (size), "+&D" (addr), ASM_CALL_CONSTRAINT > > > + : "a" (0) > > > + /* rep_good clobbers %rdx */ > > > + : "rdx"); > > > > "+c" and "+D" should be enough for 1 instruction assembly? > > I'm looking at > > e0a96129db57 ("x86: use early clobbers in usercopy*.c") > > which introduced the early clobbers and I'm thinking we want them > because "this operand is an earlyclobber operand, which is written > before the instruction is finished using the input operands" and we have > exception handling. > > But maybe you need to be more verbose as to what you mean exactly... This is the original code: -#define __do_strncpy_from_user(dst,src,count,res) \ -do { \ - long __d0, __d1, __d2; \ - might_fault(); \ - __asm__ __volatile__( \ - " testq %1,%1\n" \ - " jz 2f\n" \ - "0: lodsb\n" \ - " stosb\n" \ - " testb %%al,%%al\n" \ - " jz 1f\n" \ - " decq %1\n" \ - " jnz 0b\n" \ - "1: subq %1,%0\n" \ - "2:\n" \ - ".section .fixup,\"ax\"\n" \ - "3: movq %5,%0\n" \ - " jmp 2b\n" \ - ".previous\n" \ - _ASM_EXTABLE(0b,3b) \ - : "=&r"(res), "=&c"(count), "=&a" (__d0), "=&S" (__d1), \ - "=&D" (__d2) \ - : "i"(-EFAULT), "0"(count), "1"(count), "3"(src), "4"(dst) \ - : "memory"); \ -} while (0) I meant to say that earlyclobber is necessary only because the asm body is more than 1 instruction so there is possibility of writing to some outputs before all inputs are consumed. If asm body is 1 insn there is no such possibility at all. Now "rep stosb" is 1 instruction and two alterantive functions masquarade as single instruction.