Received: by 10.223.148.5 with SMTP id 5csp6363233wrq; Wed, 17 Jan 2018 12:42:32 -0800 (PST) X-Google-Smtp-Source: ACJfBovCvEUAbLFYLY4CTu9Ozmpyf9RCLNQhvte+pJIQHsPGeN6hyrad/TNvrgV7uBLsfLMPcsVg X-Received: by 10.84.236.68 with SMTP id h4mr16590116pln.373.1516221752404; Wed, 17 Jan 2018 12:42:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516221752; cv=none; d=google.com; s=arc-20160816; b=BNam3ovvXvpvqbrUkwoUipx2dwrIJwDWMD7BrftplZh02viGJCjwA/UwUMzFtCAgzm +RxswvGp2r9GAvnrVnQzM7RFRJTeaFJYualzyoAi9ASORfipAsJlKN49/WonBvaxubyd g+KguXiQFuqoMmflJpL3WVDKBMD8P3fpZ3xF1jq87Wl1+Hr5/LkzGhfUmdIColJRZBQr KHpu48f/GbM3tv1nNPr/exMylDUl2oeZBGgnEbJLn7TFnmjRjmRncPfMNqaagmNn+1kb gpk9MzyJhEv2jz6bJC8gpVEy+UbVlNjoliGJQSEP+6G752XyTvtJq40bJD0Ud5ArF/// aJVw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature:arc-authentication-results; bh=UB9rtjq79F+pSnhpZzMuKVYqDm8IxbUCeV3KsiaqQkM=; b=ibBEi1F9J9+X5lkhOzgAXGaIX33nJdCSfdVdfLYHSrX/GSLZlhEupq0ix4ndwEnYHd Bt4XPy8VqeH26GL7usHaTCdzSadx11wzq5V98q4s7pkC9BhkccBIGsIwQGLTh6QyVLCr 63GNys2EIMFh70guzK4U+L2XykRl3wmw69GX/Y90V7XRCckJNHmr3hl34WCXtdtGQ+by ZsFKsmuA29BnEcxB6MvCwDhBDMwr8i/ae3HFywH9PmWQiOYlkAS2gx0zAc+/n1pIoWcU TSNrJUIoGUrjPNmspsQXTVYfCpDYTOoj3z6kScAuTOMnAktwpPdva7UNntO9dpDXts86 PKmg== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=So+CNmze; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o62si4441919pga.342.2018.01.17.12.42.18; Wed, 17 Jan 2018 12:42:32 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@infradead.org header.s=bombadil.20170209 header.b=So+CNmze; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754601AbeAQUkv (ORCPT + 99 others); Wed, 17 Jan 2018 15:40:51 -0500 Received: from bombadil.infradead.org ([65.50.211.133]:54649 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753772AbeAQUWx (ORCPT ); Wed, 17 Jan 2018 15:22:53 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=bombadil.20170209; h=References:In-Reply-To:Message-Id: Date:Subject:Cc:To:From:Sender:Reply-To:MIME-Version:Content-Type: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=UB9rtjq79F+pSnhpZzMuKVYqDm8IxbUCeV3KsiaqQkM=; b=So+CNmzeBhDh/WL00l7dnMU9w ep/AKipifiGBDjQ1yQHFVzbjaCGyHl5v/Qjee9dRGIqANpV0pADciM+bICo45F8v2RKJvqsvyIz70 pyQPcsHblKXUUeMpdDYMlLGuvtcMXttLNKveewFcOiJQeyvkOeYfFGiau3daaJllpYPsW7Q2oPRvF nqHN+szlzigGSZHJ+JXLIRmkIajsPLof/ZjFKddfsTGRTV8CSp6cQeHLVMLlOPsl1HrCn3I6x8w3d 2oEheuhDGyQFnld5oDZ9Aolu6ORnwtlhUpsGtR7piSeSu0hr8fc4/I99nVGGrwoOvxQJLEZE+CEEv Egh5V1Trg==; Received: from willy by bombadil.infradead.org with local (Exim 4.89 #1 (Red Hat Linux)) id 1ebuEV-00066V-U0; Wed, 17 Jan 2018 20:22:51 +0000 From: Matthew Wilcox To: linux-kernel@vger.kernel.org Cc: Matthew Wilcox , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, linux-f2fs-devel@lists.sourceforge.net, linux-nilfs@vger.kernel.org, linux-btrfs@vger.kernel.org, linux-xfs@vger.kernel.org, linux-usb@vger.kernel.org, Bjorn Andersson , Stefano Stabellini , iommu@lists.linux-foundation.org, linux-remoteproc@vger.kernel.org, linux-s390@vger.kernel.org, intel-gfx@lists.freedesktop.org, cgroups@vger.kernel.org, linux-sh@vger.kernel.org, David Howells Subject: [PATCH v6 64/99] dax: Convert grab_mapping_entry to XArray Date: Wed, 17 Jan 2018 12:21:28 -0800 Message-Id: <20180117202203.19756-65-willy@infradead.org> X-Mailer: git-send-email 2.14.3 In-Reply-To: <20180117202203.19756-1-willy@infradead.org> References: <20180117202203.19756-1-willy@infradead.org> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Matthew Wilcox Signed-off-by: Matthew Wilcox --- fs/dax.c | 98 +++++++++++++++++----------------------------------------------- 1 file changed, 26 insertions(+), 72 deletions(-) diff --git a/fs/dax.c b/fs/dax.c index 494e8fb7a98f..3eb0cf176d69 100644 --- a/fs/dax.c +++ b/fs/dax.c @@ -44,6 +44,7 @@ /* The 'colour' (ie low bits) within a PMD of a page offset. */ #define PG_PMD_COLOUR ((PMD_SIZE >> PAGE_SHIFT) - 1) +#define PMD_ORDER (PMD_SHIFT - PAGE_SHIFT) static wait_queue_head_t wait_table[DAX_WAIT_TABLE_ENTRIES]; @@ -89,10 +90,10 @@ static void *dax_radix_locked_entry(sector_t sector, unsigned long flags) DAX_ENTRY_LOCK); } -static unsigned int dax_radix_order(void *entry) +static unsigned int dax_entry_order(void *entry) { if (xa_to_value(entry) & DAX_PMD) - return PMD_SHIFT - PAGE_SHIFT; + return PMD_ORDER; return 0; } @@ -299,10 +300,11 @@ static void *grab_mapping_entry(struct address_space *mapping, pgoff_t index, { XA_STATE(xas, &mapping->pages, index); bool pmd_downgrade = false; /* splitting 2MiB entry into 4k entries? */ - void *entry, **slot; + void *entry; + xas_set_order(&xas, index, size_flag ? PMD_ORDER : 0); restart: - xa_lock_irq(&mapping->pages); + xas_lock_irq(&xas); entry = get_unlocked_mapping_entry(&xas); if (WARN_ON_ONCE(entry && !xa_is_value(entry))) { @@ -326,84 +328,36 @@ static void *grab_mapping_entry(struct address_space *mapping, pgoff_t index, } } - /* No entry for given index? Make sure radix tree is big enough. */ - if (!entry || pmd_downgrade) { - int err; - - if (pmd_downgrade) { - /* - * Make sure 'entry' remains valid while we drop - * xa_lock. - */ - entry = lock_slot(&xas); - } - - xa_unlock_irq(&mapping->pages); + if (pmd_downgrade) { + entry = lock_slot(&xas); /* * Besides huge zero pages the only other thing that gets * downgraded are empty entries which don't need to be * unmapped. */ - if (pmd_downgrade && dax_is_zero_entry(entry)) + if (dax_is_zero_entry(entry)) { + xas_pause(&xas); + xas_unlock_irq(&xas); unmap_mapping_range(mapping, (index << PAGE_SHIFT) & PMD_MASK, PMD_SIZE, 0); - - err = radix_tree_preload( - mapping_gfp_mask(mapping) & ~__GFP_HIGHMEM); - if (err) { - if (pmd_downgrade) - put_locked_mapping_entry(mapping, index); - return ERR_PTR(err); + xas_lock_irq(&xas); } - xa_lock_irq(&mapping->pages); - - if (!entry) { - /* - * We needed to drop the pages lock while calling - * radix_tree_preload() and we didn't have an entry to - * lock. See if another thread inserted an entry at - * our index during this time. - */ - entry = __radix_tree_lookup(&mapping->pages, index, - NULL, &slot); - if (entry) { - radix_tree_preload_end(); - xa_unlock_irq(&mapping->pages); - goto restart; - } - } - - if (pmd_downgrade) { - radix_tree_delete(&mapping->pages, index); - mapping->nrexceptional--; - dax_wake_entry(&xas, entry, true); - } - + xas_store(&xas, NULL); + mapping->nrexceptional--; + dax_wake_entry(&xas, entry, true); + } + if (!entry || pmd_downgrade) { entry = dax_radix_locked_entry(0, size_flag | DAX_EMPTY); - - err = __radix_tree_insert(&mapping->pages, index, - dax_radix_order(entry), entry); - radix_tree_preload_end(); - if (err) { - xa_unlock_irq(&mapping->pages); - /* - * Our insertion of a DAX entry failed, most likely - * because we were inserting a PMD entry and it - * collided with a PTE sized entry at a different - * index in the PMD range. We haven't inserted - * anything into the radix tree and have no waiters to - * wake. - */ - return ERR_PTR(err); - } - /* Good, we have inserted empty locked entry into the tree. */ - mapping->nrexceptional++; - xa_unlock_irq(&mapping->pages); - return entry; + xas_store(&xas, entry); + if (!xas_error(&xas)) + mapping->nrexceptional++; + } else { + entry = lock_slot(&xas); } - entry = lock_slot(&xas); out_unlock: - xa_unlock_irq(&mapping->pages); + xas_unlock_irq(&xas); + if (xas_nomem(&xas, GFP_NOIO)) + goto restart; return entry; } @@ -682,7 +636,7 @@ static int dax_writeback_one(struct block_device *bdev, * worry about partial PMD writebacks. */ sector = dax_radix_sector(entry); - size = PAGE_SIZE << dax_radix_order(entry); + size = PAGE_SIZE << dax_entry_order(entry); id = dax_read_lock(); ret = bdev_dax_pgoff(bdev, sector, size, &pgoff); -- 2.15.1