Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp3464374pxk; Mon, 28 Sep 2020 19:31:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxl4/ROEg+kN8C9vzqCHWaIBye9xKsM6vHKv0KpGJOBoH2HaAP9pvBoWY+M+stmSgSZNimW X-Received: by 2002:a50:d987:: with SMTP id w7mr941856edj.113.1601346693344; Mon, 28 Sep 2020 19:31:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1601346693; cv=none; d=google.com; s=arc-20160816; b=JJzd320PGTLi38/JqM01ICWKe8OcESDz5g2fnZaXpbrV1iVETXDWTgxHPlxOGM2VPc YAlFKv302TAdRyI1Zam5JinqUUaIww19zabr8wtQvLa6W5ksoYmi9Rio4REIy1LlwnX0 2YhQVFtI+is/YhmwiQMzjoL1B/sOETHCjfT+BA6ICpl6xqyFqmgq1MxW30R4LURXyeDN /LExGdMdk8IfOWB6+RFBpIPdIgPKpmVOcy6WPtuUfj9uWzO6KkVDLB02k2WtKHyAI9aS 3dNCjpP9NmwaL8H/500Ge4Lu/j4ReHoQpFhk8TVVG6J3ugN7+uIocxUVOzbuiTQH7iMr 28MQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=ygg9/4iRHT21kcmBGkSM1pLATrrcqMs4HjO1+e+77XE=; b=vLek8MrIBBvV9axp7z0/3GdFKqGhX1INsWZnvlZRTzHF+CTaUOLHgeyDWoON8MqPwX 4sNCL8pUyf5gpRpDUj2rAKBrDoOC5XH0qBOPVfUHiQHXXTUNc5GAK1V4FLe1pJWYiBVq NydT4E/OriXfwohAJWcn3O5JYtnLFzZrzrzXuneyxnVhub8PMDpmKNyqFYNQ9aW0oX+i JXIlXkBSzXaiTKE3U1y8M9EAcGb78cBR6O4iG6P5iRYqJLhcofk46bE9Wf8D2Xn0MyAV /G90leJwBBpl5R0G6T7HxAWC94iXuRQtqlpEagUGoeL+WNyBCqO5HfuY2696HnUrg1sh zRvg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=Vl6Hngjm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i7si883317edj.22.2020.09.28.19.31.03; Mon, 28 Sep 2020 19:31:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=Vl6Hngjm; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727314AbgI2C2Y (ORCPT + 99 others); Mon, 28 Sep 2020 22:28:24 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38164 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727305AbgI2C2W (ORCPT ); Mon, 28 Sep 2020 22:28:22 -0400 Received: from mail-ed1-x544.google.com (mail-ed1-x544.google.com [IPv6:2a00:1450:4864:20::544]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C1546C061755; Mon, 28 Sep 2020 19:28:21 -0700 (PDT) Received: by mail-ed1-x544.google.com with SMTP id e22so4625938edq.6; Mon, 28 Sep 2020 19:28:21 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=ygg9/4iRHT21kcmBGkSM1pLATrrcqMs4HjO1+e+77XE=; b=Vl6HngjmW6fkWNnFUozmkjOJ7Aeg1U7JaEUUQe3qD7RGFCCRL2X9yWPmil47nY3Fh9 XQtiIhpFdniiqBQ/3oSBl+ChvTWQ69m8pd7tdftr9ZocGBS5AcQw7pKxYzPoQBMK2j/H NlNjhPEJ6MRLs7e9ehYzO4o06ErMlg2vQrEn6q3zZcB5BU2J9oB6zKinOmvdvoEovCBA oXxuoIKVH4BTXHp17ncBzqzUWHF17twiyb50Uh8eE2+OG6ukHGlXZmjyyGJfBI25jhRv G4G9GlIco+Sw/+c+822xbQENRwToPh9gtzIbj87X1zL9/j4NGi+z+L1EkNwcxzHJLAJm 7Y4w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=ygg9/4iRHT21kcmBGkSM1pLATrrcqMs4HjO1+e+77XE=; b=R5MX+GzQXAJskmUgNJSZUwQQo7Hn4c+15EVSS3ziSdUN4ViWuH4w737K7j2kVGDElm MZZ4eIcLCjCCW4A4fIljqvp7DSaqXTHWHRGiSdGv/tty0ADYigHhJUKI3DLdRe5OLedF ROEqzBDDBSMg3DKb+dY3zDsKfnb7OlM/9RLd1jT0F6VNNeMkR8Ud1dW//iijLHyBjwgu /QBLk5vg5UfWO8czpq9nrRF/UkT8BL0ClLXzMw7nX+ks0ir/WaKSNAMEU1LucA5MkmOQ e3WSS7jMC9ixvlIIem4PqFR6gSvqyB3CrQEc6smGHrDOSa7S7X4iBzSArxKSfCU3vUCy jy9w== X-Gm-Message-State: AOAM5329lXjqN8Irg1a51a+e7t8vstCd8L6+58Ze53UMneDZnTxkfp5x dAuE2D6Q5cBQgVvQHkIeKU/WTaSmfWiEe61d+ho= X-Received: by 2002:a05:6402:7d2:: with SMTP id u18mr942326edy.69.1601346499551; Mon, 28 Sep 2020 19:28:19 -0700 (PDT) MIME-Version: 1.0 References: <20200927032829.11321-1-haifeng.zhao@intel.com> <20200927032829.11321-3-haifeng.zhao@intel.com> <14b7d988-212b-93dc-6fa6-6b155d5c8ac3@kernel.org> <16431a60-027e-eca9-36f4-74d348e88090@kernel.org> <38cc8252-e485-ef11-93a1-7b43ad85fc2e@intel.com> In-Reply-To: <38cc8252-e485-ef11-93a1-7b43ad85fc2e@intel.com> From: Ethan Zhao Date: Tue, 29 Sep 2020 10:28:08 +0800 Message-ID: Subject: Re: [PATCH 2/5 V2] PCI: pciehp: check and wait port status out of DPC before handling DLLSC and PDC To: "Kuppuswamy, Sathyanarayanan" Cc: Sinan Kaya , "Zhao, Haifeng" , "bhelgaas@google.com" , "oohall@gmail.com" , "ruscur@russell.cc" , "lukas@wunner.de" , "andriy.shevchenko@linux.intel.com" , "stuart.w.hayes@gmail.com" , "mr.nuke.me@gmail.com" , "mika.westerberg@linux.intel.com" , Keith Busch , "linux-pci@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "Jia, Pei P" , "ashok.raj@linux.intel.com" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Sep 29, 2020 at 12:45 AM Kuppuswamy, Sathyanarayanan wrote: > > > On 9/28/20 9:43 AM, Sinan Kaya wrote: > > On 9/28/2020 7:10 AM, Sinan Kaya wrote: > >> On 9/27/2020 10:01 PM, Zhao, Haifeng wrote: > >>> Sinan, > >>> I explained the reason why locks don't protect this case in the patch description part. > >>> Write side and read side hold different semaphore and mutex. > >>> > >> I have been thinking about it some time but is there any reason why we > >> have to handle all port AER/DPC/HP events in different threads? > >> > >> Can we go to single threaded event loop for all port drivers events? > >> > >> This will require some refactoring but it wlll eliminate the lock > >> nightmares we are having. > >> > >> This means no sleeping. All sleeps need to happen outside of the loop. > >> > >> I wanted to see what you all are thinking about this. > >> > >> It might become a performance problem if the system is > >> continuously observing a hotplug/aer/dpc events. > >> > >> I always think that these should be rare events. > > If restructuring would be too costly, the preferred solution should be > > to fix the locks in hotplug driver rather than throwing there a random > > wait call. > Since the current race condition is detected between DPC and > hotplug, I recommend synchronizing them. The locks are the first place to root cause and try to fix. but not so easy to refactor the remove-scan-semaphore and the bus-walk-mutex. too expensive work. --- rework every piece of code that uses them. Thanks, Ethan > > -- > Sathyanarayanan Kuppuswamy > Linux Kernel Developer >