Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp5928346pxb; Mon, 14 Feb 2022 10:55:41 -0800 (PST) X-Google-Smtp-Source: ABdhPJzsKEPeSvDmyiaBN0Dd9WOS3aRUenlGEVUrc9FDaVioJGaXX2qLO5unWnidGehkWHiZcGxm X-Received: by 2002:a17:90b:1e51:: with SMTP id pi17mr33799pjb.56.1644864941703; Mon, 14 Feb 2022 10:55:41 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644864941; cv=none; d=google.com; s=arc-20160816; b=EQc7ukJPscNkasCdI9xhqAvlSBhAV1Ztq332oKESnGSwWgqMsAfFEraW+LwCG9hCE6 cj25ty79FtdzXxXaKrE3Q+bbUpl3yI3vPT7QxdbeEdH0cOnqGjOdzq6pqtM2tXlt+Hes o9m9yFICIHQibkSeuv+rtj0Q6KzAn7bYpj3QbPNAU/QvzBjDqg6oAxHZWhjEgFoo33qC GlpatLO0ZDb1+2vZTvXUZL1eJKGM3Vs2FGZhU6rlWtZ51vzFkOoBOw0Bi3PFXi4fjAmj V2AzYj9O5jZTa1VudSXbglD5W4wa3xdpLUNY4bB/iZ1uzmc+nH3WnHgI29th7l8AweTs 0v7Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:mail-followup-to:message-id:subject:cc:to :from:date:dkim-signature; bh=GSyyfWxsFPdKqXx8kxMbENaK2TvefOwVMS2PapDx8pg=; b=epC7i/D718C8fc5+A6JK4t/DJp4Nt/1d2j74wwp85iEIhSh6oTTn69pRxJ03iAMkBb eNp7R/m6ZE+ZoUKf95PEK/h+O+7Df0W0M7CYVoNTud+cwWeB9c/pRol3J3WwO3N6Bl7I YN/ToAQ18flOQyTU1kgXMywFWkVXTnKil/Xz0SU3qal+Aydla4En1tPUdHau0P2KL0gc XbT183yqanjFYdq7DemkwZoeW7sSwLhsMSknC34nV7xd1Yva2vZJXS0D55U4KlqzGxRw oYJa81IYTmef5TApEL9J/XSf5bJ09jVwgfUQ9i+4M+184Z4DpeauZRtoPJhrTtQZWQPs JZrg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=UFob9vU5; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id z8si7170036plg.614.2022.02.14.10.55.40 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 14 Feb 2022 10:55:41 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=UFob9vU5; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 63193A9A61; Mon, 14 Feb 2022 10:54:41 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239064AbiBNR7P (ORCPT + 99 others); Mon, 14 Feb 2022 12:59:15 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:50808 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229878AbiBNR7O (ORCPT ); Mon, 14 Feb 2022 12:59:14 -0500 Received: from mga17.intel.com (mga17.intel.com [192.55.52.151]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6F87F443F7 for ; Mon, 14 Feb 2022 09:59:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1644861546; x=1676397546; h=date:from:to:cc:subject:message-id:references: mime-version:in-reply-to; bh=XU044oh5pka0fxROuzgo8HhltCN2bfo2cOcsKP44lsc=; b=UFob9vU5aqHOzv70A8oMeFPtMCJLnc5JtWNm7TB/1o3BY0qhqz/ADW4F zThb+8Wtg3O2kng/3C4VgKYMDitaSVylpSvTneRUo9adpS1n8lLIg+PE+ W60dPw3uLgtYeP5+ssp/BoTdPFQSKv9KUfj3c+RWTUqpLEH9xrwN0o3Nw OWVE17shDRRFheTWVmYFzfz5iHkCMnIYl3MRVQhDAL5d+3ucHXAL+v3qK YKmFeRE6R3gArkMA9UpZqh7VtCT3g4cXjnPVrBKWxqL1bgQqp0a21J3aB zXr5EZq+olDOt3Orx7mrTDkVN6d3fr+6xIyojjFwN/tEvLfPrKaQ+AeGD g==; X-IronPort-AV: E=McAfee;i="6200,9189,10258"; a="230787317" X-IronPort-AV: E=Sophos;i="5.88,368,1635231600"; d="scan'208";a="230787317" Received: from orsmga005.jf.intel.com ([10.7.209.41]) by fmsmga107.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Feb 2022 09:59:06 -0800 X-IronPort-AV: E=Sophos;i="5.88,368,1635231600"; d="scan'208";a="703210078" Received: from iweiny-desk2.sc.intel.com (HELO localhost) ([10.3.52.147]) by orsmga005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Feb 2022 09:59:05 -0800 Date: Mon, 14 Feb 2022 09:59:05 -0800 From: Ira Weiny To: Tong Zhang Cc: Dan Williams , Vishal Verma , Dave Jiang , nvdimm@lists.linux.dev, linux-kernel@vger.kernel.org Subject: Re: [PATCH] dax: make sure inodes are flushed before destroy cache Message-ID: <20220214175905.GV785175@iweiny-DESK2.sc.intel.com> Mail-Followup-To: Tong Zhang , Dan Williams , Vishal Verma , Dave Jiang , nvdimm@lists.linux.dev, linux-kernel@vger.kernel.org References: <20220212071111.148575-1-ztong0001@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220212071111.148575-1-ztong0001@gmail.com> User-Agent: Mutt/1.11.1 (2018-12-01) X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Feb 11, 2022 at 11:11:11PM -0800, Tong Zhang wrote: > A bug can be triggered by following command > > $ modprobe nd_pmem && modprobe -r nd_pmem > > [ 10.060014] BUG dax_cache (Not tainted): Objects remaining in dax_cache on __kmem_cache_shutdown() > [ 10.060938] Slab 0x0000000085b729ac objects=9 used=1 fp=0x000000004f5ae469 flags=0x200000000010200(slab|head|node) > [ 10.062433] Call Trace: > [ 10.062673] dump_stack_lvl+0x34/0x44 > [ 10.062865] slab_err+0x90/0xd0 > [ 10.063619] __kmem_cache_shutdown+0x13b/0x2f0 > [ 10.063848] kmem_cache_destroy+0x4a/0x110 > [ 10.064058] __x64_sys_delete_module+0x265/0x300 > > This is caused by dax_fs_exit() not flushing inodes before destroy cache. > To fix this issue, call rcu_barrier() before destroy cache. I don't doubt that this fixes the bug. However, I can't help but think this is hiding a bug, or perhaps a missing step, in the kmem_cache layer? As far as I can see dax does not call call_rcu() and only uses srcu not rcu? I was tempted to suggest srcu_barrier() but dax does not call call_srcu() either. So I'm not clear about what is really going on and why this fixes it. I know that dax is not using srcu is a standard way so perhaps this helps in a way I don't quite grok? If so perhaps a comment here would be in order? Ira > > Signed-off-by: Tong Zhang > --- > drivers/dax/super.c | 1 + > 1 file changed, 1 insertion(+) > > diff --git a/drivers/dax/super.c b/drivers/dax/super.c > index e3029389d809..6bd565fe2e63 100644 > --- a/drivers/dax/super.c > +++ b/drivers/dax/super.c > @@ -476,6 +476,7 @@ static int dax_fs_init(void) > static void dax_fs_exit(void) > { > kern_unmount(dax_mnt); > + rcu_barrier(); > kmem_cache_destroy(dax_cache); > } > > -- > 2.25.1 > >