Received: by 10.223.176.46 with SMTP id f43csp1133141wra; Fri, 19 Jan 2018 07:20:56 -0800 (PST) X-Google-Smtp-Source: ACJfBovxTSPsP1NqVIIDl1RJLVG4BOfJrnKUZ25PhMahq5A+A/vottPH0sH524kl4dzJK26K1FH3 X-Received: by 10.101.78.207 with SMTP id w15mr18516071pgq.349.1516375256767; Fri, 19 Jan 2018 07:20:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516375256; cv=none; d=google.com; s=arc-20160816; b=UWSD5Dgd0W7TfZ2w0g2qNspAt91sfUy1BJouRPpXAu3I5oofHzlAROJ8MvBtewYn7P etZynE81+MqWDkyCEN3vuCc3sgET6gl1t/u+SBkFk6Da6NxNgCA8qU4K8iJXbHBH8R0+ eTLeH2SVGP5x+c62W35jGgd38drlSaWCRz0QOM41FBqA1YqHW5+Wn24Wh6C6lxGR9ef2 ur4ZbAMx8nplL1IAvV49Jmhd6dHBzrhhq5MqOEiO1ocuZ3bqkjLwcJ6IXuAZFXwsvlK/ 32bmqTO7bziQ8BcRgF562YklUSpVdKgTGqF+Stb8U5qMqiU0xUm8HhinGDgvzML3lt7k gG8A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature :arc-authentication-results; bh=wGFumJZ0w2pA5G6da0IIfvVo3b4wNs2T759Fbu/TaQ4=; b=rBdq3l3fPLSxUrgshxLMteNStXuxtX9pRegCGnOKiCOMu5vU/KAeO1rLQq/rOdSg6S eNbxIqd5PyXYmT/eeLuIqB3Vj2Q2OWf8OT88WGKZpiH47AymGSFuwt5JELvLuzpUVWrc N8/04MrBtFk82IcbVz6dsRs8koN3Z04REFy4F3Yd9aXBBU5AzIuijn+KAN9gWVteq3fj p7jUCuwe/mN2hJGLLIpSci8DFNi9ucCQacoeerMrOuw4mzBUVNzmHWAvU3h9f9xj+0QE xunbS++lpTX969g6b/Mpq/R8JQfFovzyuY11KU8+khD/azao0hKa35MEOznaTYZA7sSB Vyzw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=t/DLo0Bf; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e18si829158pfi.130.2018.01.19.07.20.42; Fri, 19 Jan 2018 07:20:56 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b=t/DLo0Bf; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932356AbeASPT7 (ORCPT + 99 others); Fri, 19 Jan 2018 10:19:59 -0500 Received: from userp2130.oracle.com ([156.151.31.86]:46242 "EHLO userp2130.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932191AbeASPTs (ORCPT ); Fri, 19 Jan 2018 10:19:48 -0500 Received: from pps.filterd (userp2130.oracle.com [127.0.0.1]) by userp2130.oracle.com (8.16.0.22/8.16.0.22) with SMTP id w0JFGnxk102153; Fri, 19 Jan 2018 15:19:43 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=subject : to : cc : references : from : message-id : date : mime-version : in-reply-to : content-type : content-transfer-encoding; s=corp-2017-10-26; bh=wGFumJZ0w2pA5G6da0IIfvVo3b4wNs2T759Fbu/TaQ4=; b=t/DLo0BffCImE8FOIXjIWBU4lwhmzk1nUgGOfof0leQxr5HgX3rUnQXBaicVhxZMQaij OqJcTjsddjnUC+0kfpkUybFKJw15RwrgScagNLZVfOJ63imAPYP3rPHvADzLvMpveq3y RYpCfUuobb0sv5lhTcQOQIok44xBF6iZmCnH4MAeCvuviLERXr/f+xpAiWNwP5lvj5Ti PP4D46DWaSOQhGjkHYqmIpHdvyDa02CIRY9OjWLSu4F5Z+n+CcRtjpR/iUtmbyPZJhxZ B5TFjCCA8VeXSSu+XEB3xbCqdVhckPxE4imP3qQqoPvIAZtFUquWr/cuiYOTR6iwmFNq dg== Received: from aserv0021.oracle.com (aserv0021.oracle.com [141.146.126.233]) by userp2130.oracle.com with ESMTP id 2fkk3d032b-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 19 Jan 2018 15:19:43 +0000 Received: from aserv0122.oracle.com (aserv0122.oracle.com [141.146.126.236]) by aserv0021.oracle.com (8.14.4/8.14.4) with ESMTP id w0JFGI1B008118 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Fri, 19 Jan 2018 15:16:18 GMT Received: from abhmp0018.oracle.com (abhmp0018.oracle.com [141.146.116.24]) by aserv0122.oracle.com (8.14.4/8.14.4) with ESMTP id w0JFGHOC003486; Fri, 19 Jan 2018 15:16:18 GMT Received: from [10.191.5.26] (/10.191.5.26) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Fri, 19 Jan 2018 07:16:17 -0800 Subject: Re: [PATCH] net/mlx4_en: ensure rx_desc updating reaches HW before prod db updating To: Tariq Toukan , Eric Dumazet , Jason Gunthorpe Cc: junxiao.bi@oracle.com, netdev@vger.kernel.org, linux-rdma@vger.kernel.org, linux-kernel@vger.kernel.org, Saeed Mahameed References: <1515728542-3060-1-git-send-email-jianchao.w.wang@oracle.com> <20180112163247.GB15974@ziepe.ca> <1515775567.131759.42.camel@gmail.com> From: "jianchao.wang" Message-ID: <53b1ac4d-a294-eb98-149e-65d7954243da@oracle.com> Date: Fri, 19 Jan 2018 23:16:09 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.5.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=8778 signatures=668654 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1801190199 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Tariq Very sad that the crash was reproduced again after applied the patch. --- a/drivers/net/ethernet/mellanox/mlx4/en_rx.c +++ b/drivers/net/ethernet/mellanox/mlx4/en_rx.c @@ -252,6 +252,7 @@ static inline bool mlx4_en_is_ring_empty(struct mlx4_en_rx_ring *ring) static inline void mlx4_en_update_rx_prod_db(struct mlx4_en_rx_ring *ring) { + dma_wmb(); *ring->wqres.db.db = cpu_to_be32(ring->prod & 0xffff); } I analyzed the kdump, it should be a memory corruption. Thanks Jianchao On 01/15/2018 01:50 PM, jianchao.wang wrote: > Hi Tariq > > Thanks for your kindly response. > > On 01/14/2018 05:47 PM, Tariq Toukan wrote: >> Thanks Jianchao for your patch. >> >> And Thank you guys for your reviews, much appreciated. >> I was off-work on Friday and Saturday. >> >> On 14/01/2018 4:40 AM, jianchao.wang wrote: >>> Dear all >>> >>> Thanks for the kindly response and reviewing. That's really appreciated. >>> >>> On 01/13/2018 12:46 AM, Eric Dumazet wrote: >>>>> Does this need to be dma_wmb(), and should it be in >>>>> mlx4_en_update_rx_prod_db ? >>>>> >>>> +1 on dma_wmb() >>>> >>>> On what architecture bug was observed ? >>> This issue was observed on x86-64. >>> And I will send a new patch, in which replace wmb() with dma_wmb(), to customer >>> to confirm. >> >> +1 on dma_wmb, let us know once customer confirms. >> Please place it within mlx4_en_update_rx_prod_db as suggested. > Yes, I have recommended it to customer. > Once I get the result, I will share it here. >> All other calls to mlx4_en_update_rx_prod_db are in control/slow path so I prefer being on the safe side, and care less about bulking the barrier. >> >> Thanks, >> Tariq >> >