Received: by 2002:ac0:a594:0:0:0:0:0 with SMTP id m20-v6csp5081745imm; Mon, 14 May 2018 19:16:25 -0700 (PDT) X-Google-Smtp-Source: AB8JxZq9402/CnG1/D3bY34aIl6LyB5dzM1ctrjHByP7OBROfTGzFqg9vMhbMSnAbAgl5wjhRkNI X-Received: by 2002:a62:3a1c:: with SMTP id h28-v6mr12916427pfa.209.1526350585680; Mon, 14 May 2018 19:16:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1526350585; cv=none; d=google.com; s=arc-20160816; b=caVfk/B5ebqitFoQ0zSNifCZbQ74R+CYzP74SkCWKtIt6BCSXYFPGfVUSJAXQTkBhX k1ojaDTy+j5VLVyIYEolmw9hahrbCZD8554Hnqfvaei+BckuOM30keo8Zy6OtLqNZ+re CvPvCg+4BoQxwePYyoXlw1N463qf5S65R+v2aINQJ+xWn4CgL3LnZoJ54+Ks68IQJ6jM wTHHFsqBELAxSLR/dVScMa/sLJl34lkjKAuxs9nerTjEN+snse7ebLe4eU7poTWbRXOV 7S+F2+6yiKrzsHnkkEXO8QH5BcAX3pRoFwODu2zEO00cURFlAJvn0rHOo/cmWDxFhkuB y2jA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :arc-authentication-results; bh=XC4yaeQkL8aZfqwho7WxoD6T+UzD+qcBsMySQ28GGwc=; b=tQml/j3oHDS4VOxPfuS3trL3csNc5fQTmJQWGf9Dd+pn/3VGtOH1UjqXnEjDwYhPhE GuWfCdlaN5BhGVJRlaiAI+ooSEZtxZP0VeNFHjJbc+OgernZyv58Rk0XX5LsZXLon8b7 kFzWDBIo6hBkJ/LadTLeI1gmSzLM033jQEosd570oDhMWZLOGtpoG6LQ0bK2aOc2J2PJ OCtA/Yw1vIO8rL23HU2fV8pVLAOf8HAcIjAV/v+P5T+Ec9vyGGUNUaoON4fTBQVcr2ub NfBnB6tEsIZriJ84uEJWqNILcsQqJnkX0wCA65PocPlxWTZL/Mp4FKhwjd8og4gGzP0S 0ubg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e10-v6si8568045pgo.397.2018.05.14.19.16.11; Mon, 14 May 2018 19:16:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752162AbeEOCPy (ORCPT + 99 others); Mon, 14 May 2018 22:15:54 -0400 Received: from smtp4.ccs.ornl.gov ([160.91.203.40]:39398 "EHLO smtp4.ccs.ornl.gov" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752009AbeEOCPx (ORCPT ); Mon, 14 May 2018 22:15:53 -0400 Received: from star.ccs.ornl.gov (star.ccs.ornl.gov [160.91.202.134]) by smtp4.ccs.ornl.gov (Postfix) with ESMTP id B403E1005166; Mon, 14 May 2018 22:15:52 -0400 (EDT) Received: by star.ccs.ornl.gov (Postfix, from userid 2004) id A92C3B8; Mon, 14 May 2018 22:15:52 -0400 (EDT) From: James Simmons To: Greg Kroah-Hartman , devel@driverdev.osuosl.org, Andreas Dilger , Oleg Drokin , NeilBrown Cc: Linux Kernel Mailing List , Lustre Development List , Lai Siyao , James Simmons Subject: [PATCH v2] staging: lustre: obdclass: change object lookup to no wait mode Date: Mon, 14 May 2018 22:15:48 -0400 Message-Id: <1526350548-4402-1-git-send-email-jsimmons@infradead.org> X-Mailer: git-send-email 1.8.3.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Lai Siyao Currently we set LU_OBJECT_HEARD_BANSHEE on object when we want to remove object from cache, but this may lead to deadlock, because when other process lookup such object, it needs to wait for this object until release (done at last refcount put), while that process maybe already hold an LDLM lock. Now that current code can handle dying object correctly, we can just return such object in lookup, thus the above deadlock can be avoided. Signed-off-by: Lai Siyao Intel-bug-id: https://jira.hpdd.intel.com/browse/LU-9049 Reviewed-on: https://review.whamcloud.com/26965 Reviewed-by: Alex Zhuravlev Tested-by: Cliff White Reviewed-by: Fan Yong Reviewed-by: Oleg Drokin Signed-off-by: James Simmons --- Changelog: v1) Initial patch that didn't apply to staging-testing branch v2) Rebased after Neil's patches landed. Remove unlikely() test as requested by Dan Carpenter drivers/staging/lustre/lustre/obdclass/lu_object.c | 39 +++++----------------- 1 file changed, 9 insertions(+), 30 deletions(-) diff --git a/drivers/staging/lustre/lustre/obdclass/lu_object.c b/drivers/staging/lustre/lustre/obdclass/lu_object.c index f14e350..e0abd4f 100644 --- a/drivers/staging/lustre/lustre/obdclass/lu_object.c +++ b/drivers/staging/lustre/lustre/obdclass/lu_object.c @@ -593,15 +593,10 @@ static struct lu_object *htable_lookup(struct lu_site *s, const struct lu_fid *f, __u64 *version) { - struct cfs_hash *hs = s->ls_obj_hash; struct lu_site_bkt_data *bkt; struct lu_object_header *h; struct hlist_node *hnode; - __u64 ver; - wait_queue_entry_t waiter; - -retry: - ver = cfs_hash_bd_version_get(bd); + u64 ver = cfs_hash_bd_version_get(bd); if (*version == ver) return ERR_PTR(-ENOENT); @@ -618,31 +613,13 @@ static struct lu_object *htable_lookup(struct lu_site *s, } h = container_of(hnode, struct lu_object_header, loh_hash); - if (likely(!lu_object_is_dying(h))) { - cfs_hash_get(s->ls_obj_hash, hnode); - lprocfs_counter_incr(s->ls_stats, LU_SS_CACHE_HIT); - if (!list_empty(&h->loh_lru)) { - list_del_init(&h->loh_lru); - percpu_counter_dec(&s->ls_lru_len_counter); - } - return lu_object_top(h); + cfs_hash_get(s->ls_obj_hash, hnode); + lprocfs_counter_incr(s->ls_stats, LU_SS_CACHE_HIT); + if (!list_empty(&h->loh_lru)) { + list_del_init(&h->loh_lru); + percpu_counter_dec(&s->ls_lru_len_counter); } - - /* - * Lookup found an object being destroyed this object cannot be - * returned (to assure that references to dying objects are eventually - * drained), and moreover, lookup has to wait until object is freed. - */ - - init_waitqueue_entry(&waiter, current); - add_wait_queue(&bkt->lsb_marche_funebre, &waiter); - set_current_state(TASK_UNINTERRUPTIBLE); - lprocfs_counter_incr(s->ls_stats, LU_SS_CACHE_DEATH_RACE); - cfs_hash_bd_unlock(hs, bd, 1); - schedule(); - remove_wait_queue(&bkt->lsb_marche_funebre, &waiter); - cfs_hash_bd_lock(hs, bd, 1); - goto retry; + return lu_object_top(h); } /** @@ -683,6 +660,8 @@ static void lu_object_limit(const struct lu_env *env, struct lu_device *dev) } /** + * Core logic of lu_object_find*() functions. + * * Much like lu_object_find(), but top level device of object is specifically * \a dev rather than top level device of the site. This interface allows * objects of different "stacking" to be created within the same site. -- 1.8.3.1