Received: by 2002:ac0:e34a:0:0:0:0:0 with SMTP id g10csp671770imn; Tue, 26 Jul 2022 06:29:42 -0700 (PDT) X-Google-Smtp-Source: AGRyM1sugWgKZJt+TtGi3fjUbq6cREBICZkoEipTonFB2v+jiB2Qx1LQGScDNzaz1b7q6Rb7aGZH X-Received: by 2002:a17:907:948e:b0:72f:b536:57 with SMTP id dm14-20020a170907948e00b0072fb5360057mr13323599ejc.491.1658842182146; Tue, 26 Jul 2022 06:29:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658842182; cv=none; d=google.com; s=arc-20160816; b=n2qREmz+cPiITNpuCDWIKxe4H/N2GEn7SZFVoEXbTMsce6ahqdzyOu+hd48W8Ru/ho ZksggE/v8aguAhcFEJnUoN8jm5IJfCviGEAjtQ25ELCmgxc62spz6DZG9lCbCgNpKpTc Z6Nrr8NvMrvHw1ZZj0lOHvQE7534o4jbBXBCqcM8Nhg73V4ZyIHxGUXaGaDoTHQFbHSQ fByxGODfkWFsnUHxBrxUlhd8UYRJyk2QSgkpQpaXYRcojPHzu/Ayrbfej1wl4y9CSk+t xU0KuIz7Xoa8R/1WHyRVZehv7/kHOPbak4zMm60uYwCLi3/mIS9ywDe7koeLAIpJSMHj yqJA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :organization:from:references:cc:to:content-language:subject :user-agent:mime-version:date:message-id:dkim-signature; bh=vrzWkxSAPBHXLRbnyCznISHc9dnhhjZ2sshc1fHvJVY=; b=rcFCqwPiapyuMl4WJ0p+z6VUniiokZtwMcx8mNiWCMYY+GZ/KWrk2lOCGebgckIVA7 uxcVzCgltlCpRLu2sAdliYCliuwqHbApz5IaW8vzrVv3ZfTEj6WW1jgQgzAdUayT89E3 KZjjHyjMK/BRktdemXbQo6P150jgmD6HATAvfqp1u3lULvJzkvxF7QBu3YKSTTX/2pCe gUjDd4G+kiaFk1VfepWNkbnJCpBtCeFrgvyBf5t0DaNhi5R8/JBsmAp4QC18jdUGxxCK z1NwmM4wmHNjGpZWqupLMOkBVR4kIu5oDoTksxGT/JNaXHHohWBn4ApwOqwlV/CxZb9Z LK/A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=BspjvBcv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hg4-20020a1709072cc400b00706b9787b43si17189370ejc.319.2022.07.26.06.29.16; Tue, 26 Jul 2022 06:29:42 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=BspjvBcv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233676AbiGZNMQ (ORCPT + 99 others); Tue, 26 Jul 2022 09:12:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36178 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232549AbiGZNMO (ORCPT ); Tue, 26 Jul 2022 09:12:14 -0400 Received: from mga01.intel.com (mga01.intel.com [192.55.52.88]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D30AFDFE6 for ; Tue, 26 Jul 2022 06:12:13 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1658841133; x=1690377133; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=AzPLCeQXDT+9FTdbOU/i6VdONfz+YZYIkJIYf1EfhFc=; b=BspjvBcvpGvaOv22yyDpZuMuiIxEXRKmySSM8f+PQZObzXsI1CH/xurI r+d9q08xwfaKp/gJMuoEV/OT8oeIwte966qHMJazjivZXq60E9PnjKKdR OFagDbpKtA7UF7O/GaK0j7dLSh74ctuz1ffwpsmj7sPkzKfUPPQWYfqL/ i2ByrFOnZhYM2BNr8juzWT1Xg+JiFBJjU8oMx3heOwsRAB0SBRyVLBJEZ te4PJmj5O5hIeIuyQqMDtHHjWhTdNWJztP48KaCBDjQIgn6b74oulMlfM u0zgB1G76KgxCfr31fmFZ6/q78OW90AP0ahKO5SKlkrbzqXxYnj8vYXAs g==; X-IronPort-AV: E=McAfee;i="6400,9594,10419"; a="313715494" X-IronPort-AV: E=Sophos;i="5.93,193,1654585200"; d="scan'208";a="313715494" Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Jul 2022 06:12:13 -0700 X-IronPort-AV: E=Sophos;i="5.93,193,1654585200"; d="scan'208";a="927325787" Received: from atilson-mobl2.ger.corp.intel.com (HELO [10.213.238.124]) ([10.213.238.124]) by fmsmga005-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Jul 2022 06:12:10 -0700 Message-ID: <4f670d9f-c8e6-6c1d-34a8-3f3ef6f35beb@linux.intel.com> Date: Tue, 26 Jul 2022 14:12:09 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.11.0 Subject: Re: [PATCH v4] drm/i915: stop using swiotlb Content-Language: en-US To: Robert Beckett , Jani Nikula , Joonas Lahtinen , Rodrigo Vivi , David Airlie , Daniel Vetter Cc: kernel@collabora.com, Christoph Hellwig , Thomas Hellstrom , Matthew Auld , Tvrtko Ursulin , intel-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org References: <20220725141833.1970029-1-bob.beckett@collabora.com> From: Tvrtko Ursulin Organization: Intel Corporation UK Plc In-Reply-To: <20220725141833.1970029-1-bob.beckett@collabora.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-5.6 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_EF,HK_RANDOM_ENVFROM,HK_RANDOM_FROM, NICE_REPLY_A,RCVD_IN_DNSWL_HI,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL, SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 25/07/2022 15:18, Robert Beckett wrote: > Calling swiotlb functions directly is nowadays considered harmful. See > https://lore.kernel.org/intel-gfx/20220711082614.GA29487@lst.de/ > > Replace swiotlb_max_segment() calls with dma_max_mapping_size(). > In i915_gem_object_get_pages_internal() no longer consider max_segment > only if CONFIG_SWIOTLB is enabled. There can be other (iommu related) > causes of specific max segment sizes. > > Cc: Christoph Hellwig > Cc: Tvrtko Ursulin > Cc: Thomas Hellstrom > Cc: Matthew Auld > > v2: - restore UINT_MAX clamp in i915_sg_segment_size() > - drop PAGE_SIZE check as it will always be >= PAGE_SIZE > v3: - actually clamp to UINT_MAX in i915_sg_segment_size() > v4: - round down max segment size to PAGE_SIZE > > Reviewed-by: Christoph Hellwig > Reviewed-by: Tvrtko Ursulin > Signed-off-by: Robert Beckett > --- > drivers/gpu/drm/i915/gem/i915_gem_internal.c | 19 ++++--------------- > drivers/gpu/drm/i915/gem/i915_gem_shmem.c | 2 +- > drivers/gpu/drm/i915/gem/i915_gem_ttm.c | 4 ++-- > drivers/gpu/drm/i915/gem/i915_gem_userptr.c | 2 +- > drivers/gpu/drm/i915/i915_scatterlist.h | 17 ++++------------- > 5 files changed, 12 insertions(+), 32 deletions(-) > > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_internal.c b/drivers/gpu/drm/i915/gem/i915_gem_internal.c > index c698f95af15f..24f37658f1bb 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_internal.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_internal.c > @@ -6,7 +6,6 @@ > > #include > #include > -#include > > #include "i915_drv.h" > #include "i915_gem.h" > @@ -38,22 +37,12 @@ static int i915_gem_object_get_pages_internal(struct drm_i915_gem_object *obj) > struct scatterlist *sg; > unsigned int sg_page_sizes; > unsigned int npages; > - int max_order; > + int max_order = MAX_ORDER; > + unsigned int max_segment; > gfp_t gfp; > > - max_order = MAX_ORDER; > -#ifdef CONFIG_SWIOTLB > - if (is_swiotlb_active(obj->base.dev->dev)) { > - unsigned int max_segment; > - > - max_segment = swiotlb_max_segment(); > - if (max_segment) { > - max_segment = max_t(unsigned int, max_segment, > - PAGE_SIZE) >> PAGE_SHIFT; > - max_order = min(max_order, ilog2(max_segment)); > - } > - } > -#endif > + max_segment = i915_sg_segment_size(i915->drm.dev) >> PAGE_SHIFT; > + max_order = min(max_order, ilog2(max_segment)); > > gfp = GFP_KERNEL | __GFP_HIGHMEM | __GFP_RECLAIMABLE; > if (IS_I965GM(i915) || IS_I965G(i915)) { > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_shmem.c b/drivers/gpu/drm/i915/gem/i915_gem_shmem.c > index 4eed3dd90ba8..34b9c76cd8e6 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_shmem.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_shmem.c > @@ -194,7 +194,7 @@ static int shmem_get_pages(struct drm_i915_gem_object *obj) > struct intel_memory_region *mem = obj->mm.region; > struct address_space *mapping = obj->base.filp->f_mapping; > const unsigned long page_count = obj->base.size / PAGE_SIZE; > - unsigned int max_segment = i915_sg_segment_size(); > + unsigned int max_segment = i915_sg_segment_size(i915->drm.dev); > struct sg_table *st; > struct sgt_iter sgt_iter; > struct page *page; > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_ttm.c b/drivers/gpu/drm/i915/gem/i915_gem_ttm.c > index 5a5cf332d8a5..7a828c9c0f6d 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_ttm.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_ttm.c > @@ -189,7 +189,7 @@ static int i915_ttm_tt_shmem_populate(struct ttm_device *bdev, > struct drm_i915_private *i915 = container_of(bdev, typeof(*i915), bdev); > struct intel_memory_region *mr = i915->mm.regions[INTEL_MEMORY_SYSTEM]; > struct i915_ttm_tt *i915_tt = container_of(ttm, typeof(*i915_tt), ttm); > - const unsigned int max_segment = i915_sg_segment_size(); > + const unsigned int max_segment = i915_sg_segment_size(i915->drm.dev); > const size_t size = (size_t)ttm->num_pages << PAGE_SHIFT; > struct file *filp = i915_tt->filp; > struct sgt_iter sgt_iter; > @@ -568,7 +568,7 @@ static struct i915_refct_sgt *i915_ttm_tt_get_st(struct ttm_tt *ttm) > ret = sg_alloc_table_from_pages_segment(st, > ttm->pages, ttm->num_pages, > 0, (unsigned long)ttm->num_pages << PAGE_SHIFT, > - i915_sg_segment_size(), GFP_KERNEL); > + i915_sg_segment_size(i915_tt->dev), GFP_KERNEL); > if (ret) { > st->sgl = NULL; > return ERR_PTR(ret); > diff --git a/drivers/gpu/drm/i915/gem/i915_gem_userptr.c b/drivers/gpu/drm/i915/gem/i915_gem_userptr.c > index 094f06b4ce33..dfc35905dba2 100644 > --- a/drivers/gpu/drm/i915/gem/i915_gem_userptr.c > +++ b/drivers/gpu/drm/i915/gem/i915_gem_userptr.c > @@ -129,7 +129,7 @@ static void i915_gem_object_userptr_drop_ref(struct drm_i915_gem_object *obj) > static int i915_gem_userptr_get_pages(struct drm_i915_gem_object *obj) > { > const unsigned long num_pages = obj->base.size >> PAGE_SHIFT; > - unsigned int max_segment = i915_sg_segment_size(); > + unsigned int max_segment = i915_sg_segment_size(obj->base.dev->dev); > struct sg_table *st; > unsigned int sg_page_sizes; > struct page **pvec; > diff --git a/drivers/gpu/drm/i915/i915_scatterlist.h b/drivers/gpu/drm/i915/i915_scatterlist.h > index 9ddb3e743a3e..1377a97a20cf 100644 > --- a/drivers/gpu/drm/i915/i915_scatterlist.h > +++ b/drivers/gpu/drm/i915/i915_scatterlist.h > @@ -9,7 +9,7 @@ > > #include > #include > -#include > +#include > > #include "i915_gem.h" > > @@ -127,19 +127,10 @@ static inline unsigned int i915_sg_dma_sizes(struct scatterlist *sg) > return page_sizes; > } > > -static inline unsigned int i915_sg_segment_size(void) > +static inline unsigned int i915_sg_segment_size(struct device *dev) > { > - unsigned int size = swiotlb_max_segment(); > - > - if (size == 0) > - size = UINT_MAX; > - > - size = rounddown(size, PAGE_SIZE); > - /* swiotlb_max_segment_size can return 1 byte when it means one page. */ > - if (size < PAGE_SIZE) > - size = PAGE_SIZE; > - > - return size; > + size_t max = min_t(size_t, UINT_MAX, dma_max_mapping_size(dev)); Hope your patience is not growing thin but blank line here as per checkpatch. I could fudge when applying though. Otherwise it finally passed CI, yay! Now do we dig deeper to a) understand how/where it failed and b) check what are the possible return values from dma_max_mapping_size(), or call it done? Regards, Tvrtko > + return round_down(max, PAGE_SIZE); > } > > bool i915_sg_trim(struct sg_table *orig_st);