Received: by 10.213.65.68 with SMTP id h4csp492710imn; Tue, 20 Mar 2018 08:09:37 -0700 (PDT) X-Google-Smtp-Source: AG47ELuIgy+wPPzK3eV6eQlwfpRb/nsOR2yOIanF9oFpSvwuzeYGKyDhjjRDIQfbqI4TCzzA1amN X-Received: by 10.98.174.23 with SMTP id q23mr10548824pff.103.1521558577439; Tue, 20 Mar 2018 08:09:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521558577; cv=none; d=google.com; s=arc-20160816; b=l9DZMs1Xot2Zh27k31ng6IBKLCy8cSkGHXBU8nvwWgViOlp1oKubDaE15norEJexlE YLFLYxwBW5Tvcx+ELJBgZ5FOjIYT3p0/C3QECmnQ83WmXrAfd4qCRSUJ5G6PvMPE8qpV /Uxx8xegI+2YkbMAN8jobTWEpfo6aFqT3lXdgPV/f4TU5zxQhmBe+dZrvMNpVLtAfK+f K3i7jnv6NnTZyYm0vNF4wkiGYpf3xJTYdIMp3iG6WfsIDp11KxonmhMXxFBp63WC7wKw gX24KZHOI3O0tAABWqA2navqIDKnoX/mWjboqtNwySoAL6L4y/XDiTyfM1xZTvigq3ks /4lg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dmarc-filter :dkim-signature:dkim-signature:arc-authentication-results; bh=f87E6lVtZglD2yRiZJprmscuRuNLQ0oNuYBY0EtOE7M=; b=TvwF/X8Cy4TtKEGwDWYazcZEJme7DB+VqdvSfGEBVfMTTQhY0J7BWqP9aN6pEvuKWe HDiboIpxsJOADjvACz+MtedsPr7ArTKUbpeMvVUq0j3qJeoWTFOC4Ulh1oSmCtI4uwuB YlDiwGJhZTzY0efLKvVuNs/IBf1sx1nw2QVN7n993nQh5/NDl48SniZxq8ONjzgg+j8k 7gBzTzTTcUdSaqxNWP3+e+j0LRgynOyVrKzjFaqLfwveUFZgCXSFeY6Tx5Cy7UBna+UK ufW4MFZiH09cDh8KnhE/9gDmNLprji4ZeEYkJZGLDV+PXkIgOoR0dBaWHpga0gasiv2m khxg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=Ea3dnou3; dkim=pass header.i=@codeaurora.org header.s=default header.b=SRNIVfFR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 3-v6si1752865plu.465.2018.03.20.08.09.22; Tue, 20 Mar 2018 08:09:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=Ea3dnou3; dkim=pass header.i=@codeaurora.org header.s=default header.b=SRNIVfFR; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751546AbeCTPIW (ORCPT + 99 others); Tue, 20 Mar 2018 11:08:22 -0400 Received: from smtp.codeaurora.org ([198.145.29.96]:33920 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750991AbeCTPIT (ORCPT ); Tue, 20 Mar 2018 11:08:19 -0400 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id ED09960C54; Tue, 20 Mar 2018 15:08:18 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1521558499; bh=TxeFboFCFrFvCYfW+P5NjUu8zSVR5MOqoLMfL1aYVKU=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=Ea3dnou3p05cKnEB/EZhSDAVlHZCApQxJrmhU/f7k5EjpR3dNMR0drYPauqqxQ8pJ LgnXmu3zpDqPIfH9ksEMi3IZIUEHSTn5+undofSoD6sJRfkZzO/DEfilE6EVl0+Hkj 3AxmCBZ97WhLNHWdcD44kNje8LGuz+S5jnHdBdes= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from [10.177.68.48] (pat_11.qualcomm.com [192.35.156.11]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: okaya@smtp.codeaurora.org) by smtp.codeaurora.org (Postfix) with ESMTPSA id 675026021A; Tue, 20 Mar 2018 15:08:17 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1521558498; bh=TxeFboFCFrFvCYfW+P5NjUu8zSVR5MOqoLMfL1aYVKU=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=SRNIVfFRVet41jheJExmjkXrdPeNv8r6TzCE+dH2Y5xHpW+KLOykVe1HCfLq8Vj3q 7GYwEYTa9pgACFTwgUDayHjIQO0nJPxpFc9fbxyg9HCbA54Qy76gUir/hVtysqYmef qp8BJ/xFZDehQkHv3JYV8jTP98/bLhMORgehTGtQ= DMARC-Filter: OpenDMARC Filter v1.3.2 smtp.codeaurora.org 675026021A Authentication-Results: pdx-caf-mail.web.codeaurora.org; dmarc=none (p=none dis=none) header.from=codeaurora.org Authentication-Results: pdx-caf-mail.web.codeaurora.org; spf=none smtp.mailfrom=okaya@codeaurora.org Subject: Re: [PATCH v4 1/6] RDMA/bnxt_re: Eliminate duplicate barriers on weakly-ordered archs To: Jason Gunthorpe Cc: linux-rdma@vger.kernel.org, timur@codeaurora.org, sulrich@codeaurora.org, linux-arm-msm@vger.kernel.org, linux-arm-kernel@lists.infradead.org, Selvin Xavier , Devesh Sharma , Somnath Kotur , Sriharsha Basavapatna , Doug Ledford , linux-kernel@vger.kernel.org References: <1521514068-8856-1-git-send-email-okaya@codeaurora.org> <1521514068-8856-2-git-send-email-okaya@codeaurora.org> <20180320144801.GE19744@ziepe.ca> From: Sinan Kaya Message-ID: <3e6cfa25-6b9f-24ff-499a-6df2740b658b@codeaurora.org> Date: Tue, 20 Mar 2018 10:08:16 -0500 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 3/20/2018 10:00 AM, Sinan Kaya wrote: > On 3/20/2018 9:48 AM, Jason Gunthorpe wrote: >> On Mon, Mar 19, 2018 at 10:47:43PM -0400, Sinan Kaya wrote: >>> Code includes wmb() followed by writel(). writel() already has a barrier on >>> some architectures like arm64. >>> >>> This ends up CPU observing two barriers back to back before executing the >>> register write. >>> >>> Since code already has an explicit barrier call, changing writel() to >>> writel_relaxed(). >>> >>> Signed-off-by: Sinan Kaya >>> drivers/infiniband/hw/bnxt_re/qplib_rcfw.c | 8 ++++---- >>> 1 file changed, 4 insertions(+), 4 deletions(-) >>> >>> diff --git a/drivers/infiniband/hw/bnxt_re/qplib_rcfw.c b/drivers/infiniband/hw/bnxt_re/qplib_rcfw.c >>> index 8329ec6..4a6b981 100644 >>> +++ b/drivers/infiniband/hw/bnxt_re/qplib_rcfw.c >>> @@ -181,10 +181,10 @@ static int __send_message(struct bnxt_qplib_rcfw *rcfw, struct cmdq_base *req, >>> >>> /* ring CMDQ DB */ >>> wmb(); >>> - writel(cmdq_prod, rcfw->cmdq_bar_reg_iomem + >>> - rcfw->cmdq_bar_reg_prod_off); >>> - writel(RCFW_CMDQ_TRIG_VAL, rcfw->cmdq_bar_reg_iomem + >>> - rcfw->cmdq_bar_reg_trig_off); >>> + writel_relaxed(cmdq_prod, rcfw->cmdq_bar_reg_iomem + >>> + rcfw->cmdq_bar_reg_prod_off); >>> + writel_relaxed(RCFW_CMDQ_TRIG_VAL, rcfw->cmdq_bar_reg_iomem + >>> + rcfw->cmdq_bar_reg_trig_off); >> >> Woah, this may not be safe.. >> >> The definition of writel_relaxed() is that it is fully unordered, so >> the above two writes may change order now. Broadcom guys would have to >> ack if that it is OK or not for their hardware. >> >> In general this is not an OK approach for a mechanical >> conversion.. Only the first writel can be convereted. >> >> You need to check all your patches to make sure there are no >> subsequent writel's in the places touched. > > I paid special attention to this one and went to check the barriers > document. According to the document, writes (whether it is relaxed or not) > are always observed by the HW inorder with respect to each other. > It just doesn't guarantee anything above writel_relaxed() to be observed. > Since we already have a wmb(), this is taken care of. > > If somebody wants things to be observed after register write, there should > have been a wmb() or mmiowb() afterwards. Never mind, it will break some architectures. I'll only change the first one. (1) On some systems, I/O stores are not strongly ordered across all CPUs, and so for _all_ general drivers locks should be used and mmiowb() must be issued prior to unlocking the critical section. (2) If the accessor functions are used to refer to an I/O memory window with relaxed memory access properties, then _mandatory_ memory barriers are required to enforce ordering. > > >> >> Jason >> > > -- Sinan Kaya Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc. Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.