Received: by 2002:a05:6a10:af89:0:0:0:0 with SMTP id iu9csp5546798pxb; Wed, 26 Jan 2022 14:41:56 -0800 (PST) X-Google-Smtp-Source: ABdhPJzcQlJpjpTcvn6hwbUM33YoRH9b2udN8+6DOlCHO529h6FPOmOMZMsuev6ShQewGuB2i7WO X-Received: by 2002:a05:6402:34c5:: with SMTP id w5mr1124183edc.226.1643236916058; Wed, 26 Jan 2022 14:41:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1643236916; cv=none; d=google.com; s=arc-20160816; b=gJUCHxCn9y4kIYXp9I7L5RSadMGXqTeZk/DeyMEryO/oNVb1whQVHwtdDEVOlOhe01 9kPyu+uZO+j4WI5uK5JvrRlrMc28VX9wb82ZljWndUzlPB34TbZHcwul+HnyjMimrJj0 H0mWQjkNkYet/ETwPQg3DDBVtp6K2qMAy1jrV8Wlf5SzBAMSn4eQSWu93iXFAZL9mbQx q29c+T4r1W2Z2r8xhMqITSjWowIf8Yiod/nDcn3I1DNZTKIzikiQjDNT/cNcnnYAGRa8 w0MClTscnM0IdaKpf2rYmaSzux9OBT6IfvGoCu7/JhIcYIOsQT+0aOgJT4W1Dsg3PzHL qQLA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=qHlhdS8aDRACLLmfosDzZlVPHLUXdH2upSIE5s80FsI=; b=VKgKjv7va/YoAoBwqywp8Ky1J5BvFQH+hpuK7H6fbwBVZAMEQiIweOclc12ga6BXxH i8OyJhKx2J8Kc9ec3IpepisuxYqjKnm3SMmMkzfhlthk2PD3fxddQdoPPzGXUqLGsB9k t31DpC0eg+RqxipUrkdN+NeIXAO7cM1HwYz3Y7qCpVBIzqOWY7r7QN0KsNSlElcCQzhY xfGL0njfdyLqOc94/wiwo2ZxwUzrNXdU/b+JjM+PAJeMda6HEvaKGnwx3QKssI32f6+S 7/hIZka5XjYuoA8aqC58WQoB8HQDBWYb/yqsoviqQiJDgjRasLeuh4jVolsNw/AOz7G4 dlVA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=FeGoxmCI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id gb11si386332ejc.684.2022.01.26.14.41.30; Wed, 26 Jan 2022 14:41:56 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=FeGoxmCI; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231231AbiAZUgk (ORCPT + 99 others); Wed, 26 Jan 2022 15:36:40 -0500 Received: from mga01.intel.com ([192.55.52.88]:9982 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230489AbiAZUgd (ORCPT ); Wed, 26 Jan 2022 15:36:33 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1643229393; x=1674765393; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=m6a7ksHqrFBRFuzuDvYBCJL8gq6YspyfPiiCWbUNdnw=; b=FeGoxmCIwdbQsyjBovX/jVsnADXq5WgfQzRF9f3Lkg9f/FWbh7xDTBZQ HcXbWqQd3GqzhV3hff32BZ2nWblMsYcvHjppj4Krdxtrk8ou97h8d5Azi o18MAMEp92qmjQ/9dKUogBvavkMN9pK6d75l431oC83sRgI4dWC4cTFhx 6SpKVto/chapLcEYFEX2zRdsJaqkg09XZHj8A96ZDKkGTMZzZ1HW9t5ze 0Jalp/E1OpC8jAT6HUQMLnoeKrzEpSIuE3aM2wP0Sa7sdbrv46A6dzR+m c/HTf68LCUUgmu/bYzd/CAaGAvJC9FerREWnXTAA460aLegYZqhPClXd1 g==; X-IronPort-AV: E=McAfee;i="6200,9189,10239"; a="271098460" X-IronPort-AV: E=Sophos;i="5.88,319,1635231600"; d="scan'208";a="271098460" Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Jan 2022 12:36:22 -0800 X-IronPort-AV: E=Sophos;i="5.88,319,1635231600"; d="scan'208";a="581221487" Received: from lucas-s2600cw.jf.intel.com ([10.165.21.202]) by fmsmga008-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Jan 2022 12:36:21 -0800 From: Lucas De Marchi To: intel-gfx@lists.freedesktop.org Cc: dri-devel@lists.freedesktop.org, Matt Roper , linux-media@vger.kernel.org, linaro-mm-sig@lists.linaro.org, linux-kernel@vger.kernel.org, =?UTF-8?q?Christian=20K=C3=B6nig?= , Daniel Vetter , Daniele Ceraolo Spurio , David Airlie , John Harrison , Joonas Lahtinen , Maarten Lankhorst , Matthew Auld , Matthew Brost , Sumit Semwal , =?UTF-8?q?Thomas=20Hellstr=C3=B6m?= , Tvrtko Ursulin Subject: [PATCH 00/19] drm/i915/guc: Refactor ADS access to use dma_buf_map Date: Wed, 26 Jan 2022 12:36:43 -0800 Message-Id: <20220126203702.1784589-1-lucas.demarchi@intel.com> X-Mailer: git-send-email 2.35.0 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org While porting i915 to arm64 we noticed some issues accessing lmem. Some writes were getting corrupted and the final state of the buffer didn't have exactly what we wrote. This became evident when enabling GuC submission: depending on the number of engines the ADS struct was being corrupted and GuC would reject it, refusin to initialize. From Documentation/core-api/bus-virt-phys-mapping.rst: This memory is called "PCI memory" or "shared memory" or "IO memory" or whatever, and there is only one way to access it: the readb/writeb and related functions. You should never take the address of such memory, because there is really nothing you can do with such an address: it's not conceptually in the same memory space as "real memory" at all, so you cannot just dereference a pointer. (Sadly, on x86 it **is** in the same memory space, so on x86 it actually works to just deference a pointer, but it's not portable). When reading or writing words directly to IO memory, in order to be portable the Linux kernel provides the abstraction detailed in section "Differences between I/O access functions" of Documentation/driver-api/device-io.rst. This limits our ability to simply overlay our structs on top a buffer and directly access it since that buffer may come from IO memory rather than system memory. Hence the approach taken in intel_guc_ads.c needs to be refactored. This is not the only place in i915 that neeed to be changed, but the one causing the most problems, with a real reproducer. This first set of patch focuses on fixing the gem object to pass the ADS After the addition of a few helpers in the dma_buf_map API, most of intel_guc_ads.c can be converted to use it. The exception is the regset initialization: we'd incur into a lot of extra indirection when reading/writting each register. So the regset is converted to use a temporary buffer allocated on probe, which is then copied to its final location when finishing the initialization or on gt reset. Testing on some discrete cards, after this change we can correctly pass the ADS struct to GuC and have it initialized correctly. thanks Lucas De Marchi Cc: linux-media@vger.kernel.org Cc: dri-devel@lists.freedesktop.org Cc: linaro-mm-sig@lists.linaro.org Cc: linux-kernel@vger.kernel.org Cc: Christian König Cc: Daniel Vetter Cc: Daniele Ceraolo Spurio Cc: David Airlie Cc: John Harrison Cc: Joonas Lahtinen Cc: Maarten Lankhorst Cc: Matt Roper Cc: Matthew Auld Cc: Matthew Brost Cc: Sumit Semwal Cc: Thomas Hellström Cc: Tvrtko Ursulin Lucas De Marchi (19): dma-buf-map: Add read/write helpers dma-buf-map: Add helper to initialize second map drm/i915/gt: Add helper for shmem copy to dma_buf_map drm/i915/guc: Keep dma_buf_map of ads_blob around drm/i915/guc: Add read/write helpers for ADS blob drm/i915/guc: Convert golden context init to dma_buf_map drm/i915/guc: Convert policies update to dma_buf_map drm/i915/guc: Convert engine record to dma_buf_map dma-buf-map: Add wrapper over memset drm/i915/guc: Convert guc_ads_private_data_reset to dma_buf_map drm/i915/guc: Convert golden context prep to dma_buf_map drm/i915/guc: Replace check for golden context size drm/i915/guc: Convert mapping table to dma_buf_map drm/i915/guc: Convert capture list to dma_buf_map drm/i915/guc: Prepare for error propagation drm/i915/guc: Use a single pass to calculate regset drm/i915/guc: Convert guc_mmio_reg_state_init to dma_buf_map drm/i915/guc: Convert __guc_ads_init to dma_buf_map drm/i915/guc: Remove plain ads_blob pointer drivers/gpu/drm/i915/gt/shmem_utils.c | 32 ++ drivers/gpu/drm/i915/gt/shmem_utils.h | 3 + drivers/gpu/drm/i915/gt/uc/intel_guc.h | 14 +- drivers/gpu/drm/i915/gt/uc/intel_guc_ads.c | 374 +++++++++++------- drivers/gpu/drm/i915/gt/uc/intel_guc_ads.h | 3 +- .../gpu/drm/i915/gt/uc/intel_guc_submission.c | 11 +- include/linux/dma-buf-map.h | 127 ++++++ 7 files changed, 405 insertions(+), 159 deletions(-) -- 2.35.0