Received: by 10.192.165.148 with SMTP id m20csp4819680imm; Tue, 24 Apr 2018 08:54:35 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/hnvnW5FTaGz3yKHPRlNCQjgCn3S0Hv73+DqkAX5LGZIrYqefnnS+ncUBYwUOtIqBzLZp6 X-Received: by 2002:a17:902:207:: with SMTP id 7-v6mr25454424plc.261.1524585275388; Tue, 24 Apr 2018 08:54:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1524585275; cv=none; d=google.com; s=arc-20160816; b=T5VsSbNqhyUviOR9kFjpTxmuRsEXwbqMPlO1+F7bLSfukIQkpry9zjNt4hWljUiLxx HInW73oZdOT2tpUDTl+oZ/7oCpTSYmhQ7x/pEeIgYjadVME6ajYLPiof+1Qx9/SpU7Ue UHA+M8Jd/9IK2O7D5TG4l8rFfHluCQ3CffG7H1DtAfZbitKVr5O1db4iVrAlGB1AvVA6 7mWaVJ0ONGrUPoC4hpyge2UL+XfG2JI1XeMRP0avNTDyfBBmmFfT+eARXj8/UiNx/JBS 23bkqHSIz7adfGZnUf5ppiFuqDsM8V28Ora+nRbU4JZOR+J8JGytX8kQnvkcdtru6rdA /dVA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:content-transfer-encoding :spamdiagnosticmetadata:spamdiagnosticoutput:content-language :accept-language:in-reply-to:references:message-id:date:thread-index :thread-topic:subject:cc:to:from:dkim-signature :arc-authentication-results; bh=5BG89VSzm6pGNEgyc26GvnQc0Sm+DoYwFKSGX3uKujs=; b=NbHw4qBCr5rtOjjNo+/IvT4W0ilTH3QJB6mY5oxg8n9ZJnhVqv9yaVeHjQDbRaIny0 dQ5dpvKh4RyJFsZlBTDLwjMjqGf4g7fMwaDtQY8/xWQLnXywbA15LnYDAfxZs9qNusuT 59ISnTu/bHWk6/AsQBTbMgefHkOIvqPlnU3nJpu9sC0RPuiZwJPeAKwcyBrcx4l2rKVz HrQfNrbxmGwrWw7mDedeBseP4x4YT+lgfQT5P/QhG+x90i16Nev+ztPdp2ymPi4La2k0 S8mZH7tFL/gCd9wKjS5xD3zZ9QXkaWtS6940YmecxRNxggQxbIZuxRl2J2DQE/R/b+XP 4uPg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amdcloud.onmicrosoft.com header.s=selector1-amd-com header.b=vfutZkA7; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 32-v6si14146492pla.348.2018.04.24.08.54.20; Tue, 24 Apr 2018 08:54:35 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@amdcloud.onmicrosoft.com header.s=selector1-amd-com header.b=vfutZkA7; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752318AbeDXPwv (ORCPT + 99 others); Tue, 24 Apr 2018 11:52:51 -0400 Received: from mail-cys01nam02on0065.outbound.protection.outlook.com ([104.47.37.65]:20928 "EHLO NAM02-CY1-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751179AbeDXPws (ORCPT ); Tue, 24 Apr 2018 11:52:48 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amdcloud.onmicrosoft.com; s=selector1-amd-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=5BG89VSzm6pGNEgyc26GvnQc0Sm+DoYwFKSGX3uKujs=; b=vfutZkA7p4angpzHDxZunUEyDn8er1sB622UUdM4koBXWgO5s/UpOlMsRcUlOHVKigJb066SerzmQwvcAoBEJkafF+I4/iw13AAEiAqi+m1/s53kRzBjcA2YX+FvGZGqlSgc1/BQuTHE2HyHdqR52gPI7jHSh6v8bZ3/erwO6+Q= Received: from DM5PR12MB2440.namprd12.prod.outlook.com (52.132.141.33) by DM5PR12MB2424.namprd12.prod.outlook.com (52.132.141.29) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P256) id 15.20.696.13; Tue, 24 Apr 2018 15:52:46 +0000 Received: from DM5PR12MB2440.namprd12.prod.outlook.com ([fe80::60:6669:42e6:f409]) by DM5PR12MB2440.namprd12.prod.outlook.com ([fe80::60:6669:42e6:f409%13]) with mapi id 15.20.0696.019; Tue, 24 Apr 2018 15:52:46 +0000 From: "Panariti, David" To: "Grodzovsky, Andrey" , "linux-kernel@vger.kernel.org" , "amd-gfx@lists.freedesktop.org" CC: "Deucher, Alexander" , "Koenig, Christian" , "oleg@redhat.com" , "akpm@linux-foundation.org" , "ebiederm@xmission.com" , "Grodzovsky, Andrey" Subject: RE: [PATCH 3/3] drm/amdgpu: Switch to interrupted wait to recover from ring hang. Thread-Topic: [PATCH 3/3] drm/amdgpu: Switch to interrupted wait to recover from ring hang. Thread-Index: AQHT2+GBbg9vSzZK6E2CBnt82gNLC6QQD8FQ Date: Tue, 24 Apr 2018 15:52:45 +0000 Message-ID: References: <1524583836-12130-1-git-send-email-andrey.grodzovsky@amd.com> <1524583836-12130-4-git-send-email-andrey.grodzovsky@amd.com> In-Reply-To: <1524583836-12130-4-git-send-email-andrey.grodzovsky@amd.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=David.Panariti@amd.com; x-originating-ip: [65.223.155.30] x-ms-publictraffictype: Email x-microsoft-exchange-diagnostics: 1;DM5PR12MB2424;7:mwIIzr13FcUL6snEjXCzpRzyouB6VUL1EEqoMfOBhzFSyzvF4Uqd5yzRsSiavXU2jNXPREJzMPWnvopNt2a0twUcjEMlYX2XhBmGJYP+UJjdFwN8mhW+vxlIJa6FfmvcdBR38+AYVb39eQ3W2uuh8Fb7a3aFAYC60bmYpaUB/UFbRl0GX7PtW3JjPBGhbkloLwbDshdugKrWraXfitLpCywbg1l6k444SonjY3Kz+/2FxMcpGWtxZUxXje7AFate;20:Pk/vD2bdtM1uNal8QAEYKE4ICmJb19YlTxR+GgwWI+QKwaGaYW9rWUBHOa0MHqh1LeicWLsRvLdBtSruGgbjkvNqvAMYYAwTNoc7r2z2A/o7TLeZn+BBYe73WZ4cxDnua/kGb3k6eJA+I2EoMPGR3Vm4yd2kmGRdIDzD5bdWMX1I2pQQO9Qy9dO9aMbNWhk8sI9A4L4OaKfpybLCJ+J1T+cgl8SuolMgCjaub3tLXLDx4Fz/Fzyhvyb0dtD3Rr2T x-ms-exchange-antispam-srfa-diagnostics: SOS;SOR; x-forefront-antispam-report: SFV:SKI;SCL:-1;SFV:NSPM;SFS:(10009020)(346002)(39860400002)(396003)(39380400002)(376002)(366004)(189003)(199004)(13464003)(25786009)(3280700002)(3660700001)(33656002)(74316002)(305945005)(7736002)(5660300001)(5250100002)(97736004)(72206003)(2900100001)(2501003)(2906002)(478600001)(229853002)(53936002)(106356001)(105586002)(55016002)(9686003)(81156014)(3846002)(8676002)(81166006)(6116002)(6246003)(14454004)(8936002)(2201001)(4326008)(86362001)(66066001)(110136005)(476003)(99286004)(11346002)(53546011)(26005)(486006)(59450400001)(68736007)(7696005)(6506007)(316002)(76176011)(6436002)(102836004)(54906003)(446003);DIR:OUT;SFP:1101;SCL:1;SRVR:DM5PR12MB2424;H:DM5PR12MB2440.namprd12.prod.outlook.com;FPR:;SPF:None;LANG:en;PTR:InfoNoRecords;MX:1;A:1; x-ms-office365-filtering-ht: Tenant x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(7020095)(4652020)(48565401081)(5600026)(2017052603328)(7153060)(7193020);SRVR:DM5PR12MB2424; x-ms-traffictypediagnostic: DM5PR12MB2424: x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:(9452136761055)(767451399110)(217544274631240); x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(8211001083)(6040522)(2401047)(5005006)(8121501046)(3002001)(10201501046)(3231232)(944501410)(52105095)(93006095)(93001095)(6055026)(6041310)(20161123558120)(20161123560045)(20161123564045)(20161123562045)(201703131423095)(201702281528075)(20161123555045)(201703061421075)(201703061406153)(6072148)(201708071742011);SRVR:DM5PR12MB2424;BCL:0;PCL:0;RULEID:;SRVR:DM5PR12MB2424; x-forefront-prvs: 0652EA5565 received-spf: None (protection.outlook.com: amd.com does not designate permitted sender hosts) x-microsoft-antispam-message-info: 0AN0/YXi0TwqO3l2R+Rz0xdjirFeAg/mOlu65w0DLjTIwZs3jCohFwylucC14bnO47psEzBuFAsu51Ub0OA3AZ89kWGlW9KWRahby8NLrVfqDVQuHAJFzltxt21XTvEkve6xLA3ixJ/SNkXaXC7pzLkHL388QmV5sCvG0QzECnowzKQV0JAZZE6TJRjYNbKA spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-MS-Office365-Filtering-Correlation-Id: 5b27c2c0-904c-41f9-2b2a-08d5a9fb6c43 X-OriginatorOrg: amd.com X-MS-Exchange-CrossTenant-Network-Message-Id: 5b27c2c0-904c-41f9-2b2a-08d5a9fb6c43 X-MS-Exchange-CrossTenant-originalarrivaltime: 24 Apr 2018 15:52:45.9457 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 3dd8961f-e488-4e60-8e11-a82d994e183d X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM5PR12MB2424 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, It looks like there can be an infinite loop if neither of the if()'s become= true. Is that an impossible condition? -----Original Message----- From: Andrey Grodzovsky =20 Sent: Tuesday, April 24, 2018 11:31 AM To: linux-kernel@vger.kernel.org; amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander ; Koenig, Christian ; Panariti, David ; oleg@redha= t.com; akpm@linux-foundation.org; ebiederm@xmission.com; Grodzovsky, Andrey= Subject: [PATCH 3/3] drm/amdgpu: Switch to interrupted wait to recover from= ring hang. If the ring is hanging for some reason allow to recover the waiting by send= ing fatal signal. Originally-by: David Panariti Signed-off-by: Andrey Grodzovsky --- drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c | 14 ++++++++++---- 1 file changed, 10 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c b/drivers/gpu/drm/amd/= amdgpu/amdgpu_ctx.c index eb80edf..37a36af 100644 --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_ctx.c @@ -421,10 +421,16 @@ int amdgpu_ctx_wait_prev_fence(struct amdgpu_ctx *ctx= , unsigned ring_id) =20 if (other) { signed long r; - r =3D dma_fence_wait_timeout(other, false, MAX_SCHEDULE_TIMEOUT); - if (r < 0) { - DRM_ERROR("Error (%ld) waiting for fence!\n", r); - return r; + + while (true) { + if ((r =3D dma_fence_wait_timeout(other, true, + MAX_SCHEDULE_TIMEOUT)) >=3D 0) + return 0; + + if (fatal_signal_pending(current)) { + DRM_ERROR("Error (%ld) waiting for fence!\n", r); + return r; + } } } =20 -- 2.7.4