Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp3409990pxf; Mon, 5 Apr 2021 11:15:59 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwAK+j7teyL7yB7o9450nS1GYLKRu92Mb0jJFxx664jUBQzOA2iH8ryKfmuTotxDR7CSz2j X-Received: by 2002:a17:906:234b:: with SMTP id m11mr7950448eja.404.1617646559019; Mon, 05 Apr 2021 11:15:59 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617646559; cv=none; d=google.com; s=arc-20160816; b=y390YmBrnq9p/QQO3re8meRrA+TR0Co4eoeVQqW7MmTEyUw+DBJB1ApmLzjMDUOhl3 Ft7KVhNGb2Oc1lCmcS6ezqj/nd4nMxTvKsmwh7Bb0nkf3N2thnC5KHNyqwgOuuO3dWKq TL4OSDM72IdTHf1sQEy+nBMEboyXy7SjjFbijSSglKyWaIJwOGy5+YO+YP2qe2ndWawp MGThLuVeMRDQmDuFRjpALrp1j7jKEm83VFWAkruwc4yRJlCYJ2AyQeKzRiUbdhTyUHoW pyCxnOIMVrTMSXSDSvPAl/k6UXD9S2+DIYnOH3P/Y1/oZKb1SZAFal0K98e4y7oYLOkG 88lA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=K58/0ButtwDm+u3SeZ0++LoII3QgC96fjrDog0fzDgY=; b=0FHlnGhSxvrgm8VbiHMk0ztigV9Nee5Ek9Ki9Fk9vow47F3RqRnatLUDHhz8rr5WPU qwRBUoc3jxwW31ZzbO0QxVoEKzdQxLFatfpG7bMGpFAbuVfQFPl0XkqYnwAKqNO4zllM trcQkJRHnSFl+6Q9Koq4vwm564L2dENtKaPLRPYeKz6x/pKoPkVz+FvACwYKyZGZGza8 qbcgfEiHedRlJvuzFWj0pDj34gjE2oTm2EUr7Fl35iPjatMruH8VzFAJ/Fbcq3jMcAfz 3RicSnmPck7viC8dtEJy6OtQFawQoZWi29rFAdCPVl+bLtkA0rZ08S5tmlw+DW1C5/XA xUbQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lirMONbc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id hs11si4758276ejc.421.2021.04.05.11.15.35; Mon, 05 Apr 2021 11:15:59 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lirMONbc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235381AbhDEMKS (ORCPT + 99 others); Mon, 5 Apr 2021 08:10:18 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38562 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234148AbhDEMKR (ORCPT ); Mon, 5 Apr 2021 08:10:17 -0400 Received: from mail-ej1-x631.google.com (mail-ej1-x631.google.com [IPv6:2a00:1450:4864:20::631]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6BA3DC061788; Mon, 5 Apr 2021 05:10:00 -0700 (PDT) Received: by mail-ej1-x631.google.com with SMTP id hq27so16466740ejc.9; Mon, 05 Apr 2021 05:10:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=K58/0ButtwDm+u3SeZ0++LoII3QgC96fjrDog0fzDgY=; b=lirMONbc3CuUs2bfQDM8ilAmPx70M15rzERwljV6DMHC2hX+yNcvqo8NOn2FaME9OC PAXPdG2xMf/0I8v1hgL5QKZJ0GQXbR5zNqGaKZptF28doxViu7BMoFZtK+UAhooGAJLt hRdIlhULFAYACA/bbweCQWXu8bL73ZJvAD0CGRNiqo8MD3OaNUhyp75O8ykZ1XOevDJP JqV7f1ecD+ZmAuL/d6QxSGHFgiEqhXSCyZdq1sxOBWAXb+Gr1Xx6GF1HkcP9p5W+bBOn jvU1DM4jLsv8iB3S8xKFtEA7bevoOkYGST6sx5eHx4PNpFnAUFCfgxiSaAPbgR+lHPLC kJTw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=K58/0ButtwDm+u3SeZ0++LoII3QgC96fjrDog0fzDgY=; b=LzOTZ1vo8XdCQNGAJEAgdYaCwVZ/FZiFOs/K/o6zHPsQCtDIRukczJHJKuYvOFMuGi /6JnP8H6VDU3VhYJ1fVrVLnkRw5bV68nRfCZW8rhKdzxmdsP3FA6Pwgsdik6mOgquXrW pzWMwpO49sm32zKaBXlbxgjpyCxKalU8s6pwjd9JfW4g7e5E7Jk57cT0z807UfD5Xx6B Wxx9LSvVP33K7n02FhJy4si4Uax6HQXWHZRkk5jUQ1ss+13IaSp0XJ1ipjGcs39DZ9rF pvRFINrle5iWIYBP4ie2a/jxvJTQsQBLHttL9pfv92FqOrbZdGmHPhBJ/RdmO3ywTeCn a8uA== X-Gm-Message-State: AOAM533RcyrSDKY/AJRpd9RC3a7hfZqnHLEEXycl1AYUpXDEHn9hZ0bb kXjX7IadoDcQKGCu7oeAVxg= X-Received: by 2002:a17:907:929:: with SMTP id au9mr27751863ejc.28.1617624598975; Mon, 05 Apr 2021 05:09:58 -0700 (PDT) Received: from ?IPv6:2003:ea:8f1f:bb00:417d:b38d:188d:9829? (p200300ea8f1fbb00417db38d188d9829.dip0.t-ipconnect.de. [2003:ea:8f1f:bb00:417d:b38d:188d:9829]) by smtp.googlemail.com with ESMTPSA id d19sm10922518edr.45.2021.04.05.05.09.57 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 05 Apr 2021 05:09:58 -0700 (PDT) Subject: Re: [PATCH] net: phy: fix PHY possibly unwork after MDIO bus resume back To: christian.melki@t2data.com, Joakim Zhang , andrew@lunn.ch, linux@armlinux.org.uk, davem@davemloft.net, kuba@kernel.org Cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org, linux-imx@nxp.com References: <20210404100701.6366-1-qiangqing.zhang@nxp.com> <97e486f8-372a-896f-6549-67b8fb34e623@gmail.com> From: Heiner Kallweit Message-ID: <010f896e-befb-4238-5219-01969f3581e3@gmail.com> Date: Mon, 5 Apr 2021 14:09:51 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.9.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 05.04.2021 10:43, Christian Melki wrote: > On 4/5/21 12:48 AM, Heiner Kallweit wrote: >> On 04.04.2021 16:09, Heiner Kallweit wrote: >>> On 04.04.2021 12:07, Joakim Zhang wrote: >>>> commit 4c0d2e96ba055 ("net: phy: consider that suspend2ram may cut >>>> off PHY power") invokes phy_init_hw() when MDIO bus resume, it will >>>> soft reset PHY if PHY driver implements soft_reset callback. >>>> commit 764d31cacfe4 ("net: phy: micrel: set soft_reset callback to >>>> genphy_soft_reset for KSZ8081") adds soft_reset for KSZ8081. After these >>>> two patches, I found i.MX6UL 14x14 EVK which connected to KSZ8081RNB doesn't >>>> work any more when system resume back, MAC driver is fec_main.c. >>>> >>>> It's obvious that initializing PHY hardware when MDIO bus resume back >>>> would introduce some regression when PHY implements soft_reset. When I >>> >>> Why is this obvious? Please elaborate on why a soft reset should break >>> something. >>> >>>> am debugging, I found PHY works fine if MAC doesn't support suspend/resume >>>> or phy_stop()/phy_start() doesn't been called during suspend/resume. This >>>> let me realize, PHY state machine phy_state_machine() could do something >>>> breaks the PHY. >>>> >>>> As we known, MAC resume first and then MDIO bus resume when system >>>> resume back from suspend. When MAC resume, usually it will invoke >>>> phy_start() where to change PHY state to PHY_UP, then trigger the stat> machine to run now. In phy_state_machine(), it will start/config >>>> auto-nego, then change PHY state to PHY_NOLINK, what to next is >>>> periodically check PHY link status. When MDIO bus resume, it will >>>> initialize PHY hardware, including soft_reset, what would soft_reset >>>> affect seems various from different PHYs. For KSZ8081RNB, when it in >>>> PHY_NOLINK state and then perform a soft reset, it will never complete >>>> auto-nego. >>> >>> Why? That would need to be checked in detail. Maybe chip errata >>> documentation provides a hint. >>> >> >> The KSZ8081 spec says the following about bit BMCR_PDOWN: >> >> If software reset (Register 0.15) is >> used to exit power-down mode >> (Register 0.11 = 1), two software >> reset writes (Register 0.15 = 1) are >> required. The first write clears >> power-down mode; the second >> write resets the chip and re-latches >> the pin strapping pin values. >> >> Maybe this causes the issue you see and genphy_soft_reset() isn't >> appropriate for this PHY. Please re-test with the KSZ8081 soft reset >> following the spec comment. >> > > Interesting. Never expected that behavior. > Thanks for catching it. Skimmed through the datasheets/erratas. > This is what I found (micrel.c): > > 10/100: > 8001 - Unaffected? > 8021/8031 - Double reset after PDOWN. > 8041 - Errata. PDOWN broken. Recommended do not use. Unclear if reset > solves the issue since errata says no error after reset but is also > claiming that only toggling PDOWN (may) or power will help. > 8051 - Double reset after PDOWN. > 8061 - Double reset after PDOWN. > 8081 - Double reset after PDOWN. > 8091 - Double reset after PDOWN. > > 10/100/1000: > Nothing in gigabit afaics. > > Switches: > 8862 - Not affected? > 8863 - Errata. PDOWN broken. Reset will not help. Workaround exists. > 8864 - Not affected? > 8873 - Errata. PDOWN broken. Reset will not help. Workaround exists. > 9477 - Errata. PDOWN broken. Will randomly cause link failure on > adjacent links. Do not use. > > This certainly explains a lot. > >>>> >>>> This patch changes PHY state to PHY_UP when MDIO bus resume back, it >>>> should be reasonable after PHY hardware re-initialized. Also give state >>>> machine a chance to start/config auto-nego again. >>>> >>> >>> If the MAC driver calls phy_stop() on suspend, then phydev->suspended >>> is true and mdio_bus_phy_may_suspend() returns false. As a consequence >>> phydev->suspended_by_mdio_bus is false and mdio_bus_phy_resume() >>> skips the PHY hw initialization. >>> Please also note that mdio_bus_phy_suspend() calls phy_stop_machine() >>> that sets the state to PHY_UP. >>> >> >> Forgot that MDIO bus suspend is done before MAC driver suspend. >> Therefore disregard this part for now. >> >>> Having said that the current argumentation isn't convincing. I'm not >>> aware of such issues on other systems, therefore it's likely that >>> something is system-dependent. >>> >>> Please check the exact call sequence on your system, maybe it >>> provides a hint. >>> >>>> Signed-off-by: Joakim Zhang >>>> --- >>>> drivers/net/phy/phy_device.c | 7 +++++++ >>>> 1 file changed, 7 insertions(+) >>>> >>>> diff --git a/drivers/net/phy/phy_device.c b/drivers/net/phy/phy_device.c >>>> index cc38e326405a..312a6f662481 100644 >>>> --- a/drivers/net/phy/phy_device.c >>>> +++ b/drivers/net/phy/phy_device.c >>>> @@ -306,6 +306,13 @@ static __maybe_unused int mdio_bus_phy_resume(struct device *dev) >>>> ret = phy_resume(phydev); >>>> if (ret < 0) >>>> return ret; >>>> + >>>> + /* PHY state could be changed to PHY_NOLINK from MAC controller resume >>>> + * rounte with phy_start(), here change to PHY_UP after re-initializing >>>> + * PHY hardware, let PHY state machine to start/config auto-nego again. >>>> + */ >>>> + phydev->state = PHY_UP; >>>> + >>>> no_resume: >>>> if (phydev->attached_dev && phydev->adjust_link) >>>> phy_start_machine(phydev); >>>> >>> >> > This is a quick draft of the modified soft reset for KSZ8081. Some tests would be appreciated. diff --git a/drivers/net/phy/micrel.c b/drivers/net/phy/micrel.c index a14a00328..4902235a8 100644 --- a/drivers/net/phy/micrel.c +++ b/drivers/net/phy/micrel.c @@ -1091,6 +1091,42 @@ static void kszphy_get_stats(struct phy_device *phydev, data[i] = kszphy_get_stat(phydev, i); } +int ksz8081_soft_reset(struct phy_device *phydev) +{ + int bmcr, ret, val; + + phy_lock_mdio_bus(phydev); + + bmcr = __phy_read(phydev, MII_BMCR); + if (bmcr < 0) + return bmcr; + + bmcr |= BMCR_RESET; + + if (bmcr & BMCR_PDOWN) + __phy_write(phydev, MII_BMCR, bmcr); + + if (phydev->autoneg == AUTONEG_ENABLE) + bmcr |= BMCR_ANRESTART; + + __phy_write(phydev, MII_BMCR, bmcr & ~BMCR_ISOLATE); + + phy_unlock_mdio_bus(phydev); + + phydev->suspended = 0; + + ret = phy_read_poll_timeout(phydev, MII_BMCR, val, !(val & BMCR_RESET), + 50000, 600000, true); + if (ret) + return ret; + + /* BMCR may be reset to defaults */ + if (phydev->autoneg == AUTONEG_DISABLE) + ret = genphy_setup_forced(phydev); + + return ret; +} + static int kszphy_suspend(struct phy_device *phydev) { /* Disable PHY Interrupts */ @@ -1303,7 +1339,7 @@ static struct phy_driver ksphy_driver[] = { .driver_data = &ksz8081_type, .probe = kszphy_probe, .config_init = ksz8081_config_init, - .soft_reset = genphy_soft_reset, + .soft_reset = ksz8081_soft_reset, .config_intr = kszphy_config_intr, .handle_interrupt = kszphy_handle_interrupt, .get_sset_count = kszphy_get_sset_count, -- 2.31.1