Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp467119pxb; Tue, 14 Sep 2021 01:25:50 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxFjjpYcF2C8XqTmVJld4M4xiMVxqepP2t4dSVRDhU1pfcw5rhE9gbjL3tfxEuMZIMQjj4q X-Received: by 2002:a92:d90b:: with SMTP id s11mr8478040iln.206.1631607950691; Tue, 14 Sep 2021 01:25:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1631607950; cv=none; d=google.com; s=arc-20160816; b=nf3qeg+JrA1PjMMJc+YspDTcj63r2ty3MQPw9YDdqSAqbDg2eG/8bfVt37QpcZ616U xEb9Nf4aohCfKa8MIpBShgKgyhsuewEsuBFaeRV0iXm/FB8XeNrNR/4Zhwd2ceZyELGp UguLNed3h74g2JCmQ/OfPuU+zGRqpXt8wvcSUY8yC+dXK8uBt/4s1B0LZOPCNv6jfFGB gbIYwv0Jq5rQmYK0XyeDs6/NVMwST3Q/R9KVIatgv1pOmkW14sJJLRlZAUJ3YTGiMkTk 5W3CiI2aFQzxPeIBMjNkY9CnT6N7Xt2njQUIjfcjkfAjlBmwRuBcVQdP8eZXo2CzpwXU u0bg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :mime-version:accept-language:in-reply-to:references:message-id:date :thread-index:thread-topic:subject:cc:to:from; bh=QwG9Xs8aUttrMWivsliMmCBTshBCysK0hWyz+3gL2dQ=; b=voQcApIGglbuKMK8hEGZsEO1wMQoM753u/kqpKse6HZ2rQa5JWxA6VzZUTy5zd3lTg YboSqfHLUVmRIsoBkRDpmzB/aHZZen/O15tWlpuNPxG/4BkJnL2uUnrI0DIoYmDL9c6u LJUzKOI9cg7kk5wVbfiq7uVk/Ar4QX62+zqp1+nntcWYYG6ZAFzDAmKJnGvqHrVsEos1 uYrYGPYs9lvREbSnPSDFVLgoqnBDBAk04TRp4cY+wrzykmkelwlo84FPgdHJwoqk5nQ4 HgXRyY0VY+TvRLji6gB5zTKNAt1lIfvp7jwCpA2FfACZ3TtcY3QtQOeaGZV/Ms74wA8k +Dpg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=aculab.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id o18si10731787jam.60.2021.09.14.01.25.39; Tue, 14 Sep 2021 01:25:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=aculab.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229985AbhINIZB convert rfc822-to-8bit (ORCPT + 99 others); Tue, 14 Sep 2021 04:25:01 -0400 Received: from eu-smtp-delivery-151.mimecast.com ([185.58.85.151]:55755 "EHLO eu-smtp-delivery-151.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229458AbhINIZA (ORCPT ); Tue, 14 Sep 2021 04:25:00 -0400 Received: from AcuMS.aculab.com (156.67.243.121 [156.67.243.121]) (Using TLS) by relay.mimecast.com with ESMTP id uk-mta-2-spjJskWLOtKyxaxI53WEXQ-1; Tue, 14 Sep 2021 09:23:41 +0100 X-MC-Unique: spjJskWLOtKyxaxI53WEXQ-1 Received: from AcuMS.Aculab.com (fd9f:af1c:a25b:0:994c:f5c2:35d6:9b65) by AcuMS.aculab.com (fd9f:af1c:a25b:0:994c:f5c2:35d6:9b65) with Microsoft SMTP Server (TLS) id 15.0.1497.23; Tue, 14 Sep 2021 09:23:41 +0100 Received: from AcuMS.Aculab.com ([fe80::994c:f5c2:35d6:9b65]) by AcuMS.aculab.com ([fe80::994c:f5c2:35d6:9b65%12]) with mapi id 15.00.1497.023; Tue, 14 Sep 2021 09:23:40 +0100 From: David Laight To: 'Willy Tarreau' CC: Douglas Gilbert , LKML Subject: RE: how many memset(,0,) calls in kernel ? Thread-Topic: how many memset(,0,) calls in kernel ? Thread-Index: AQHXp5KEgy5ggGKXu0iAcdzvqjgoc6uiIXMA///yLYCAAR6/oA== Date: Tue, 14 Sep 2021 08:23:40 +0000 Message-ID: <15cd0a8e72b3460db939060db25dd59a@AcuMS.aculab.com> References: <1c4a94df-fc2f-1bb2-8bce-2d71f9f1f5df@interlog.com> <20210912045608.GB16216@1wt.eu> <88976a40175c491fb5e3349f6686ad67@AcuMS.aculab.com> <20210913160945.GA2456@1wt.eu> In-Reply-To: <20210913160945.GA2456@1wt.eu> Accept-Language: en-GB, en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-ms-exchange-transport-fromentityheader: Hosted x-originating-ip: [10.202.205.107] MIME-Version: 1.0 Authentication-Results: relay.mimecast.com; auth=pass smtp.auth=C51A453 smtp.mailfrom=david.laight@aculab.com X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: aculab.com Content-Language: en-US Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Willy Tarreau > Sent: 13 September 2021 17:10 > > On Mon, Sep 13, 2021 at 04:03:09PM +0000, David Laight wrote: > > > 36: b9 06 00 00 00 mov $0x6,%ecx > > > 3b: 4c 89 e7 mov %r12,%rdi > > > 3e: f3 ab rep stos %eax,%es:(%rdi) > > > > > > The last line does exactly "memset(%rdi, %eax, %ecx)". Just two bytes > > > for some code that modern processors are even able to optimize. > > > > Hmmm I'd bet that 6 stores will be faster on ~everything. > > 'modern' processors do better than some older ones [1], but 6 > > writes isn't enough to get into the really fast paths. > > So you'll still take a few cycles of setup. > > The exact point is, here it's up to the compiler to decide thanks to > its builtin what it considers best for the target CPU. It already > knows the fixed size and the code is emitted accordingly. It may > very well be a call to the memset() function when the size is large > and a power of two because it knows alternate variants are available > for example. > > The compiler might even decide to shrink that area if other bytes > are written just after the memset(), leaving only holes touched by > memset(). You might think the compiler will make sane choices for the target CPU. But it often makes a complete pig's breakfast of it. I'm pretty sure 6 'rep stos' is slower than 6 write an absolutely everything - with the possible exception of an 8088. By far the worst ones are when the compiler decides to pessimise a loop by using the simd (eg avx512) instructions to do 4 (or 8) loop iterations in one pass. It might be fine if the loop count is in the 100s - but not when it is 3. One compiler I've used nicely converted any byte copy loop into a 'rep movsb' instruction. That was contemporary with P4 netburst - where it was terribly slow. David - Registered Address Lakeside, Bramley Road, Mount Farm, Milton Keynes, MK1 1PT, UK Registration No: 1397386 (Wales)