Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp8519804rwi; Tue, 25 Oct 2022 07:39:02 -0700 (PDT) X-Google-Smtp-Source: AMsMyM6+uIbp8dLyvupft9MykfCdc6zCfeI/oaFg5uFL0zGfdfJlyk8zBK2rF5OjyLxnnBC1SxOC X-Received: by 2002:a65:6856:0:b0:46e:bcc2:8b36 with SMTP id q22-20020a656856000000b0046ebcc28b36mr19018254pgt.378.1666708742687; Tue, 25 Oct 2022 07:39:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666708742; cv=none; d=google.com; s=arc-20160816; b=c/3e7ub2zUr2GN+IVL9o4ATOJ5saEHdxdr8Na2qqal25kEhlRsaWnYI6VkQu6CutuE w4aeQmccUIHNZOp9T5kZD1RFDbjKL2mOebmDGiwuM+LQJrzlDOKfn4ZGfz3kRA/AnpaP 577aOipb3Ux/YtRSH25Imcl3yukMIAFfxL7oYZ79vHLfwdsTIrnYDE2X6jtrfqom0KIJ psAA/fqsSGYoYzDlvStbeLGUWt7RY9VFHidVzdB5pHGgN8eaGbyRXsEFPckg4kc0VDAQ CI5qO9ywrmTky5CrwRrDYvBpprsV1AaK8Glw1MqWqpP3rf7JHZsKrsMWMu+N3hIsG0WZ GKvA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=9p8b3r4r5R88ziLyc8TQ9IiZL1OeYDsYpURNkbKqfCk=; b=rPjbli5wfLdoOMjBc9D1Qqsb05GsuaMUjPtYIImy6hHjQjClFcEgju96CxQll+Re69 cSCaO07Li7SkLmceZz9lHAT0o6AKP5+KC1FQ0r981oIfNm/i0c8kzU5bUZyJRy4II9zj jANxpllc7NLH+koF4Ib3ZAlCUHomnEbirlFfgoPcay1kq9/7BN5Vl1t9HeczjxZxBGjd zXh3CKxFAVx3nkMwRud6eUI8jBpVs9z7joF3gXHlb9nCPBALNNvjY1PZs6786OnkoyPj XnnJX4gfd2srRe1UgvCFBxpJHgMBVIWeL1Q25i3WAT3aqeIJBnQRrg/d/m/Tnf5AcbZL 2MwQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=BwEjBo8P; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id i14-20020a63e90e000000b0044d72a10ab0si3182224pgh.342.2022.10.25.07.38.49; Tue, 25 Oct 2022 07:39:02 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=BwEjBo8P; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231709AbiJYOVy (ORCPT + 99 others); Tue, 25 Oct 2022 10:21:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49400 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232511AbiJYOVt (ORCPT ); Tue, 25 Oct 2022 10:21:49 -0400 Received: from mail-ot1-x32c.google.com (mail-ot1-x32c.google.com [IPv6:2607:f8b0:4864:20::32c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 313BE43E6A for ; Tue, 25 Oct 2022 07:21:47 -0700 (PDT) Received: by mail-ot1-x32c.google.com with SMTP id cb2-20020a056830618200b00661b6e5dcd8so7785451otb.8 for ; Tue, 25 Oct 2022 07:21:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=9p8b3r4r5R88ziLyc8TQ9IiZL1OeYDsYpURNkbKqfCk=; b=BwEjBo8P6bns8/5Dw+Ak9i3INaSL07KKFzCbBHV0ZDUpuIYq9lxoAdu8fMawt4JwML CAxyLuzSoTsoix10JAiVA9GgCWjNmaS/RXMLqHuvC4zmtCTsN0z3QnN9i7T8kEQeP6dL 3qSsTmRB3cR+iALMV7dB+5tv3Y5lAFaO3xA24cijN/uHkUo3v4jKaWZmZOnmyP3ubWk/ Co7EL2PG3bdzsy+rdBGHzrIpK2QXXplE6PlXRCnEQSS050GNQtJS0cwTE7ausKfzheAi OjLwOODPw5cJiv10pIUypDx5Dzxy+PElu2H4X/bsP6h3wyyx6t6ddZ94/MPDMJKoMdUk HwjQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=9p8b3r4r5R88ziLyc8TQ9IiZL1OeYDsYpURNkbKqfCk=; b=0VzQF2ts5mUvO5EFERmveanQXGskBv25evQr6hYWDvMZmj2wUsJr30Htv87ObgaIwL PRC9miOZ3cUgQfFPvsyuHsXuhRvT2P7Iz4sOtXJJ0AyBamoHbiTIMZHKYNV+kM0fJFV2 eJ7k6eh6vunEjANocP6WcCmtLHpteVwnsiOem5hbDvaMaYselwW4NmCLwT2NeFn/wz8m pvXhlIvDfYFiacbiaiDkPIcldV1cuC1DN9FyBm2zyPfzvxTPprhLnkBxI3asz5n34tXN NA64QOlP+dYTsb9ypCOSF4uQL8OzpB1XsU+iaCHOJ1+Lkin0UbBs5++5McWCI7YbeGfQ hmIg== X-Gm-Message-State: ACrzQf3uPB6CmZmL5eb+GHZEhje4bqG5pmuGN6UDTARYdjzGfcqpmG+5 gp5Jlz9+UMj9FzvfVMI1g6inp95u1r9sLX6Nxhg= X-Received: by 2002:a05:6830:2475:b0:661:b91c:f32a with SMTP id x53-20020a056830247500b00661b91cf32amr19355124otr.123.1666707706758; Tue, 25 Oct 2022 07:21:46 -0700 (PDT) MIME-Version: 1.0 References: <20221022214622.18042-1-ogabbay@kernel.org> In-Reply-To: From: Alex Deucher Date: Tue, 25 Oct 2022 10:21:34 -0400 Message-ID: Subject: Re: [RFC PATCH 0/3] new subsystem for compute accelerator devices To: Jason Gunthorpe Cc: Dave Airlie , Tvrtko Ursulin , Jiho Chu , Jeffrey Hugo , Thomas Zimmermann , Arnd Bergmann , John Hubbard , Oded Gabbay , linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, Christoph Hellwig , Jacek Lawrynowicz , Greg Kroah-Hartman , Alex Deucher , Yuji Ishikawa , Kevin Hilman , Maciej Kwapulinski , Jagan Teki Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Oct 25, 2022 at 7:15 AM Jason Gunthorpe wrote: > > On Tue, Oct 25, 2022 at 12:27:11PM +1000, Dave Airlie wrote: > > > The userspace for those is normally bespoke like ROCm, which uses > > amdkfd, and amdkfd doesn't operate like most device files from what I > > know, so I'm not sure we'd want it to operate as an accel device. > > I intensely dislike this direction that drivers will create their own > char devs buried inside their device driver with no support or > supervision. > > We've been here before with RDMA and it is just a complete mess. > > Whatever special non-drm stuff amdkfd need to do should be supported > through the new subsystem, in a proper maintainable way. We plan to eventually move ROCm over the drm interfaces once we get user mode queues working on non-compute queues which is already in progress. ROCm already uses the existing drm nodes and libdrm for a number of things today (buffer sharing, media and compute command submission in certain cases, etc.). I don't see much value in the accel nodes for AMD products at this time. Even when we transition, there are still a bunch of things that we'd need to think about, so the current kfd node may stick around until we figure out a plan for those areas. E.g., the kfd node provides platform level compute topology information; e.g., the NUMA details for connected GPUs and CPUs, non-GPU compute node information, cache level topologies, etc. Alex > > Jason