Received: by 10.213.65.68 with SMTP id h4csp1576130imn; Thu, 15 Mar 2018 03:51:03 -0700 (PDT) X-Google-Smtp-Source: AG47ELuZQDJnSXwNRAEq5axVpIgNFlLMgZegzsxmpWhqnQDlbV1W8rolY5es4lONmk+47m7OxqWo X-Received: by 10.101.88.76 with SMTP id s12mr5001874pgr.423.1521111063636; Thu, 15 Mar 2018 03:51:03 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521111063; cv=none; d=google.com; s=arc-20160816; b=i0M22ksnGM8IpzJ8tx3PIttl3jzjIaFK7Ttul/VgF8yGWcc4MkdyuTNwdup3SrWKBH /JU8olQqaDuEfmczYHK7cDo5Cu8NTfPN18Hq8auNq/PFi4w+iLcQ4VkK5krFvFQO1THJ Uze/QF2fweyXLpKyMA6dWu5iJ+OOGggUrtd+IuMWkHFqtXYUD1GM1dj2nfiagO62OM1F xmHfUEas2vWx7GWa0Q67F7oQRdvvtpC6ltIdO99UXKm6JubU1WuDn93A9ByIxxXq2Nfs sULODfxdLuAy9Xt2jFBCVoG2722w+HS98ILrZJiBrsvJHhERCgp2hN3DKcLFmMs7Kjw2 bm0w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=huDsnJEfkZMA9pP6YtGTRBoEOBTtgla0F0Ww6u6zNjE=; b=YA5mluvKoN1smLXeMN+qQKQp39YmTMkY2a9Knzreu6GH7PdkQiIkN499m0P1cEXjdU VYp74yIFxvuzRvOCI3OAmbokBkSa+33cm9qcynSIfTOfkRIBwdxh/zH96YNb7y0tT1K3 gX4ViyHlmwajEUxgRzEVuJD0JP7cdB+LiQzboJ78VHgx8aAY2xgQeirurasdPe6NWSa/ fbsq9ryDMPw1EN61WcBJF28RX3vi8E1J7C8kHu0FFV3HIzKBQwaZlFVPEy9RmjiBH01z 7MApLLLqOWJDxRPogWBaKnltDSWCA9kBORrwhfQGb/N2annOEKsE4BSoDcU+WKvP4urE oO0g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a15-v6si3736225pln.421.2018.03.15.03.50.49; Thu, 15 Mar 2018 03:51:03 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752007AbeCOKsi convert rfc822-to-8bit (ORCPT + 99 others); Thu, 15 Mar 2018 06:48:38 -0400 Received: from mondschein.lichtvoll.de ([194.150.191.11]:39423 "EHLO mail.lichtvoll.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751617AbeCOKsg (ORCPT ); Thu, 15 Mar 2018 06:48:36 -0400 Received: from merkaba.localnet (unknown [91.221.105.244]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.lichtvoll.de (Postfix) with ESMTPSA id 0511A2C09A3; Thu, 15 Mar 2018 11:48:35 +0100 (CET) From: Martin Steigerwald To: Martin Steigerwald Cc: Hans de Goede , Linux Kernel Mailing List , Thorsten Leemhuis , Tejun Heo Subject: Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Date: Thu, 15 Mar 2018 11:48:29 +0100 Message-ID: <52455167.vuuyuMgTrX@merkaba> In-Reply-To: <3573548.kp1edD77Gq@merkaba> References: <27165802.vQ9JbjrmvU@merkaba> <3573548.kp1edD77Gq@merkaba> MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Martin Steigerwald - 14.03.18, 12:01: > Hans de Goede - 11.03.18, 15:37: > > Hi Martin, > > > > On 11-03-18 09:20, Martin Steigerwald wrote: > > > Hello. > > > > > > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue > > > with SMART checks occassionally failing like this: > > > > > > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending > > > checks > > > udisksd[24408]: Error performing housekeeping for drive > > > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating > > > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense > > > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50 > > > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 > > > 00 > > > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba > > > udisksd[24408]: Error performing housekeeping for drive > > > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error > > > updating > > > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected > > > sense > > > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00 > > > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00 > > > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0) > > > > > > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520) > > > > > > However when I then check manually with smartctl -a | -x | -H the device > > > reports SMART data just fine. > > > > > > As smartd correctly detects that device is in sleep mode, this may be an > > > userspace issue in udisksd. > > > > > > Also at some boot attempts the boot hangs with a message like "could not > > > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1 > > > on to LVs (each on one of the SSDs). A configuration that requires a > > > manual > > > adaption to InitRAMFS in order to boot (basically vgchange -ay before > > > btrfs device scan). > > > > > > I wonder whether that has to do with the new SATA LPM policy stuff, but > > > as > > > I had issues with > > > > > > 3 => Medium power with Device Initiated PM enabled > > > > > > (machine did not boot, which could also have been caused by me > > > accidentally > > > removing all TCP/IP network support in the kernel with that setting) > > > > > > I set it back to > > > > > > CONFIG_SATA_MOBILE_LPM_POLICY=0 > > > > > > (firmware settings) > > > > Right, so at that settings the LPM policy changes are effectively > > disabled and cannot explain your SMART issues. > > > > Still I would like to zoom in on this part of your bug report, because > > for Fedora 28 we are planning to ship with CONFIG_SATA_MOBILE_LPM_POLICY=3 > > and AFAIK Ubuntu has similar plans. > > > > I suspect that the issue you were seeing with > > CONFIG_SATA_MOBILE_LPM_POLICY=3 were with the Crucial disk ? I've attached > > a patch for you to test, which disabled LPM for your model Crucial SSD > > (but > > keeps it on for the Intel disk) if you can confirm that with that patch > > you > > can run with > > CONFIG_SATA_MOBILE_LPM_POLICY=3 without issues that would be great. > > With 4.16-rc5 with CONFIG_SATA_MOBILE_LPM_POLICY=3 the system successfully > booted three times in a row. So feel free to add tested-by. > > Let´s see whether the blk_mq_terminate_expired or the smartd/udisks error > messages reappear with rc5. I still think they are a different issue. As expected these two other issues still happen with 4.16-rc5 Thanks, -- Martin