Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp4222604rwb; Fri, 30 Sep 2022 15:04:36 -0700 (PDT) X-Google-Smtp-Source: AMsMyM58xXk6biPQPvav6ypZSlKVUVO443R7xCo+JRCztEnJuRQfutWXjzD6zdrqZCVhsSl199ui X-Received: by 2002:a17:907:3c81:b0:77a:327a:815f with SMTP id gl1-20020a1709073c8100b0077a327a815fmr8041645ejc.422.1664575476606; Fri, 30 Sep 2022 15:04:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664575476; cv=none; d=google.com; s=arc-20160816; b=WIodBCrbQOb2IcJKmmUBiUVWcUVuDirQinMTMR/zZmLEJjtDpnpVlsMajRQeDWWGa+ f6PxbSCICD6JJr+WprLlJkz15SZNhheRgYyV8cFIZZ/A8/TbXDQzILOE/g3RCnYHqGAz XbJNXYYECQEfLoN9IEWepXhb14dlrB94im0kuTbo6FiruMnag8ZZ1bQvy6EV3+kSx6yq 8wNZ+oQ5YST4Dl1txbAcGKv0MPanFd3TV370d7rqnzDs5JbnEXJAV+XaAiOIIUoQMgGk oFenpe/+FlcRcJ7X+iPbL+VSGl77OLPUWT6/8dOuvNQHRRyJxF81KurfmBp6J+n7fq+w 11Ng== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=LHKjqiv/1IHiQva1vWxJ8QVlM67W4929ggbGJZR64u0=; b=tuudEIDzHShJ691sEAxrKeLJKx5XJz9B+lMioe6e3OcmTWVBmwjp/JS6IZtSSAmdfN aS7nFqjWCz4jPa8khIwvtZJfeh8RVLbEoQZpdDpYahnWMLDynXhYNnuJ45V0CFVo/Cni J1BjLEjBxMIzAH4VlURHwdShJJJJTXpfsZ58TJKYGF9njvXpOwKNom9Xt7KZOxatvHtx tvE6qc2P4MZlzTDfxbMf+vJrTXB1U4n+2qdaB0/MJbAj0o9JWg/6F+RlFiM/FCYDq245 +SkoC+VVN+pHqlLK3TWsggcwBMIZbV+IfCukQDy68PhP6LJBN/wwWIlNvL9jdGDF65au fC6w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=FEV0LEPS; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id h19-20020a05640250d300b0044785c547dbsi3331262edb.212.2022.09.30.15.04.10; Fri, 30 Sep 2022 15:04:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=FEV0LEPS; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232424AbiI3Vlv (ORCPT + 99 others); Fri, 30 Sep 2022 17:41:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51814 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232195AbiI3Vln (ORCPT ); Fri, 30 Sep 2022 17:41:43 -0400 Received: from mail-pg1-x532.google.com (mail-pg1-x532.google.com [IPv6:2607:f8b0:4864:20::532]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5E39120857C for ; Fri, 30 Sep 2022 14:41:42 -0700 (PDT) Received: by mail-pg1-x532.google.com with SMTP id 78so5176632pgb.13 for ; Fri, 30 Sep 2022 14:41:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date; bh=LHKjqiv/1IHiQva1vWxJ8QVlM67W4929ggbGJZR64u0=; b=FEV0LEPSMUlSbwnVYD7mmedr/iHQ0xyAf5qU00LNXYcOuaLKugLFBkuVC39Lt/YJd5 Eo42iKlC8ydkOvxLBF168YxPi5HGb+yashCijTKzrI2iQNniLKARB4eFNwYHZZHZX18G yIOGCNWh4sAbwwguZwrdJCzgwnzu6QziSqIeLxmW6/5FTsSeYSAss82v3DCuWStdd+Tv rMknqB5ja5yzNIsqc3OJ6kLIWTPIm9p4lXODhJwfMGamIkzqgCroHAFsch/J9vlUw5VY +uq8vmdEVhA4zx4KkmVei59WXosO5u5E9waoefGAXde8xh5AozVFsRxNB+0oSubHC2ET dFJg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date; bh=LHKjqiv/1IHiQva1vWxJ8QVlM67W4929ggbGJZR64u0=; b=7X2/xBjlhTtfMU+JbMBJnybzpbIhizr94YMN//EMH1x4VSgzbif2VPFdrB/eyP30RM vFLFtVVUh1JgI57R3PfsjSyD4VYsHrq8R7B0jmEhMIAPsrCX+ohBi/CP3r5ywLuQSSwH uij7yabS6PwAYVP/014KYs2xxaXFwXqY2uMdzTRokkar2FWofWlqB62m0IdaPIOTuxIN McfbZFc+DrUE74YwJqRCO5IembFpq3V3/m6cHbH++1W/Kn94wxUJpWpDm4barShf4hBT t70t09N/PS+60A/xpNBXGo1vj2QBpBt4iEN5MWHUFeA6Upg4Es6eOBlcqEGx45fLAfZi PxdA== X-Gm-Message-State: ACrzQf1qZCtTSq9q6mkN4RntOQOC1SExl9BGUcunnBsQwyRDaOEEQAVZ xp38MYVg94jfqs04zC1Zyw4= X-Received: by 2002:a05:6a02:186:b0:439:49a3:479b with SMTP id bj6-20020a056a02018600b0043949a3479bmr9250552pgb.171.1664574101595; Fri, 30 Sep 2022 14:41:41 -0700 (PDT) Received: from localhost.localdomain (ec2-13-113-80-70.ap-northeast-1.compute.amazonaws.com. [13.113.80.70]) by smtp.gmail.com with ESMTPSA id p124-20020a62d082000000b0053617cbe2d2sm2281711pfg.168.2022.09.30.14.41.37 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 30 Sep 2022 14:41:39 -0700 (PDT) From: Zhang Boyang To: Andrey Grodzovsky , =?UTF-8?q?Christian=20K=C3=B6nig?= , linux-kernel@vger.kernel.org, amd-gfx@lists.freedesktop.org Cc: "David C . Rankin" , Steven J Abner Subject: [RFC PATCH 0/1] drm/amdgpu: Fix NULL-deref in amdgpu_device_fini_sw() Date: Sat, 1 Oct 2022 05:41:09 +0800 Message-Id: <20220930214110.1074226-1-zhangboyang.id@gmail.com> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, There are several reports of "Fatal error during GPU init" will cause NULL-deref in amdgpu_device_fini_sw(). Although the NULL-deref is result instead of reason, this NULL-deref will confuse user. https://lore.kernel.org/lkml/a8bce489-8ccc-aa95-3de6-f854e03ad557@suddenlinkmail.com/ https://lore.kernel.org/lkml/AT9WHR.3Z1T3VI9A2AQ3@att.net/ This is probably because "adev" is not fully initialized when amdgpu_device_init() failed. Thus subsequent amdgpu_device_fini_sw() will try to release "adev->reset_domain" and cause NULL-deref. This patch fixes this problem by guarding the code with an "if". However, I'm new to this module and I didn't fully understand the code, so please review my code carefully. Best Regards, Zhang Boyang