Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp1873783pxu; Sun, 13 Dec 2020 06:12:55 -0800 (PST) X-Google-Smtp-Source: ABdhPJwe1L11P9WZZb6fTJ/zS4d0lBZmVUeptPghkJZDoRuvacKMNl4jpTz0+KfsdC5W7wDcaaTY X-Received: by 2002:a17:907:41e3:: with SMTP id nb3mr19247076ejb.378.1607868775628; Sun, 13 Dec 2020 06:12:55 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1607868775; cv=none; d=google.com; s=arc-20160816; b=ZF45uesz3c4rFxhjoqHkm/PongvprFnrkS8dZWax924VQ4Qr/yLDb2K9rF4xEpsCsy MeUgde19VSpM/7n7Hi9Q5gHs78TPFALUQKIIdr2/pYZXSZSp79KADBeoHwtYwiFBOo/7 Zuzz23gb6NbXqMRzrVXtdRsWaIVwvj+b7MPoeRw27KjCba3SODVncqBV5NuMlXXFaAWB mwJRmlpbpDYfrZC3ToOe3R3LxnBXgFA5CKFCaBSmYuQ4ZeqWM3yC8A2nolnXHo0RPm3F pgVGfuzLMtbBY+WTZUcGhBx/R7A1WwvyLfFroxnOBpzuLBUtV6OtCRNVJcEBaxfEkulj eWMg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=Lk6+0bJQEoEz7m01v3E/poetQWgjVlN/y2l0bXBHqQM=; b=aI2Zr7sxUdrc2szGAxyqSSkPuH0iujjI6Xz1jjzVVQ+aoZwuM+sgtm5QZ84HegzV/U xSM221YQ2r2wkMEK8fLVf4T5/eJKd3jzyStqZXiMIIOgYACbXr+ba9nXGRUgFLnBFtYe J0LIE6HassjEaQjHmZinA0kluDvXrRCbSpDDf2hCeS4B6SaMc0NQgkihc2DNujiicLrf 1JBgDL43ag7YLzP8H+28pEFrc+UhN4XTzC3xFxTsOe3ZQH4vwKcVyneyMIHmLmrk7+91 pGQ0wM1vz5YAfo+AlQ2NyOwU572iudqWXpkD/LBwiZj1k4X4wV7FaUPNHpba/eNE8aL/ 46nw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l61si8192860ede.106.2020.12.13.06.12.21; Sun, 13 Dec 2020 06:12:55 -0800 (PST) Received-SPF: pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-crypto-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-crypto-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2405429AbgLLEdp (ORCPT + 99 others); Fri, 11 Dec 2020 23:33:45 -0500 Received: from ZXSHCAS2.zhaoxin.com ([203.148.12.82]:14120 "EHLO ZXSHCAS2.zhaoxin.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2389029AbgLLEdS (ORCPT ); Fri, 11 Dec 2020 23:33:18 -0500 Received: from zxbjmbx1.zhaoxin.com (10.29.252.163) by ZXSHCAS2.zhaoxin.com (10.28.252.162) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1979.3; Sat, 12 Dec 2020 12:32:32 +0800 Received: from [10.32.56.37] (10.32.56.37) by zxbjmbx1.zhaoxin.com (10.29.252.163) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1979.3; Sat, 12 Dec 2020 12:32:31 +0800 Subject: Re: [PATCH] crypto: x86/crc32c-intel - Don't match some Zhaoxin CPUs To: Peter Zijlstra CC: , , , , , , , , , , , , , , , References: <1607686144-2604-1-git-send-email-TonyWWang-oc@zhaoxin.com> <20201211130058.GZ2414@hirez.programming.kicks-ass.net> From: Tony W Wang-oc Message-ID: <757bfaea-04ce-fa92-f990-bcda3d7580d7@zhaoxin.com> Date: Sat, 12 Dec 2020 12:32:23 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <20201211130058.GZ2414@hirez.programming.kicks-ass.net> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.32.56.37] X-ClientProxiedBy: ZXSHCAS2.zhaoxin.com (10.28.252.162) To zxbjmbx1.zhaoxin.com (10.29.252.163) Precedence: bulk List-ID: X-Mailing-List: linux-crypto@vger.kernel.org On 11/12/2020 21:00, Peter Zijlstra wrote: > On Fri, Dec 11, 2020 at 07:29:04PM +0800, Tony W Wang-oc wrote: >> The driver crc32c-intel match CPUs supporting X86_FEATURE_XMM4_2. >> On platforms with Zhaoxin CPUs supporting this X86 feature, When >> crc32c-intel and crc32c-generic are both registered, system will >> use crc32c-intel because its .cra_priority is greater than >> crc32c-generic. This case expect to use crc32c-generic driver for >> some Zhaoxin CPUs to get performance gain, So remove these Zhaoxin >> CPUs support from crc32c-intel. >> >> Signed-off-by: Tony W Wang-oc >> --- >> arch/x86/crypto/crc32c-intel_glue.c | 8 ++++++++ >> 1 file changed, 8 insertions(+) >> >> diff --git a/arch/x86/crypto/crc32c-intel_glue.c b/arch/x86/crypto/crc32c-intel_glue.c >> index feccb52..6dafdae 100644 >> --- a/arch/x86/crypto/crc32c-intel_glue.c >> +++ b/arch/x86/crypto/crc32c-intel_glue.c >> @@ -222,8 +222,16 @@ MODULE_DEVICE_TABLE(x86cpu, crc32c_cpu_id); >> >> static int __init crc32c_intel_mod_init(void) >> { >> + struct cpuinfo_x86 *c = &boot_cpu_data; >> + >> if (!x86_match_cpu(crc32c_cpu_id)) >> return -ENODEV; >> + >> + if (c->x86_vendor == X86_VENDOR_ZHAOXIN || c->x86_vendor == X86_VENDOR_CENTAUR) { >> + if (c->x86 == 0x6 || (c->x86 == 0x7 && c->x86_model <= 0x3b)) >> + return -ENODEV; >> + } > > Egads, why can't you use that x86_match_cpu() above, and also this > really wants a comment on why you're excluding these chips. When doing lmbench3 Create and Delete file test on partitions with ext4 enabling metadata checksum, found using crc32c-generic driver could get about 20% performance gain than using the driver crc32c-intel on these chips. Also, since > (IIRC) ZHAOXIN is basically AND, shouldn't they also be listed? > > That is; write it like: > > m = x86_match_cpu(crc32_cpu_id); > if (!m || !m->data) > return -ENODEV; > > That way you can have positive and negative matches in the array > (obviously the existing FEATURE test would need data=1 and be last). > . > Lot thanks for you suggestion, will list these chips in crc32c_cpu_id and use x86_match_cpu: static const struct x86_cpu_id crc32c_cpu_id[] = { + X86_MATCH_VENDOR_FAM_FEATURE(ZHAOXIN, 0x6, X86_FEATURE_XMM4_2, 1), + X86_MATCH_VENDOR_FAM_MODEL_FEATURE(ZHAOXIN, 0x7, 0x1b, X86_FEATURE_XMM4_2, 1), + X86_MATCH_VENDOR_FAM_MODEL_FEATURE(ZHAOXIN, 0x7, 0x3b, X86_FEATURE_XMM4_2, 1), + X86_MATCH_VENDOR_FAM_FEATURE(CENTAUR, 0x6, X86_FEATURE_XMM4_2, 1), + X86_MATCH_VENDOR_FAM_MODEL_FEATURE(CENTAUR, 0x7, 0x1b, X86_FEATURE_XMM4_2, 1), + X86_MATCH_VENDOR_FAM_MODEL_FEATURE(CENTAUR, 0x7, 0x3b, X86_FEATURE_XMM4_2, 1), X86_MATCH_FEATURE(X86_FEATURE_XMM4_2, NULL), {} }; @@ -228,8 +234,10 @@ MODULE_DEVICE_TABLE(x86cpu, crc32c_cpu_id); static int __init crc32c_intel_mod_init(void) { - if (!x86_match_cpu(crc32c_cpu_id)) + const struct x86_cpu_id *m = x86_match_cpu(crc32c_cpu_id); + if (!m || m->driver_data) return -ENODEV; sincerely TonyWWangoc