Received: by 10.213.65.68 with SMTP id h4csp910010imn; Wed, 14 Mar 2018 04:03:13 -0700 (PDT) X-Google-Smtp-Source: AG47ELv8yiI54vENb+R6tGsh2SOS7nRgcqOGJnTbCigtdnr9Q8EIcZp7biacZ0oiN//vmRbumkLN X-Received: by 10.98.226.16 with SMTP id a16mr3846872pfi.157.1521025393455; Wed, 14 Mar 2018 04:03:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521025393; cv=none; d=google.com; s=arc-20160816; b=JC5hmi2Bj7CQIk7VNL0GsnIyfPXAMhQBWCcX+MtcMJ3xLGVL0mP8HBPk+wek4VGWjV CHKmQaWfISRy4IWH2J5avrfFYp+LSPXVz4sVq5lIm1fYUBljbsiBxfcKOEMoo9GUSdhk WwvfnAnzr5Qo5pPfIp/nzi4se6UuA/J7OOUDo6vj/AyYRWY1CzfLnwo8CuwvpenYgcKl 7VymghdVOgky9bCyeNAYyNg/YULdnk9I3KqX6Rsm0Gp1YciAgvspo+13rdTEOPzbWLRs da0av2StDoTfqo7vsiAfUKgnLZAMeasPQ/+cWeZZ5jXrdmG7lJWI5yE3EJvaUzowldUD keyA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=/gZ2j7MfzFuiAIbmwukMVgurslGes4mhV7apBkogdOM=; b=qJi4HL3Za5Y1tlH4cDJjpD/oN9q5EA6SoC6XnHTG6Ht1x9bg0gsSTYe7GZByyc7APk n9vWGUu2Sm2O9lVUSyjPt+dHOjYDFurD7nKN8sZGiHh0uLnO92u4mrThqyPFK31jRdP4 pvVpLLEYX4f7EvkO7GDGSUFPa1+dQjKv3hMAqt38m+MwxySshV7sKt3toMSDstZDOZMd 5P82Pdl/dyeFPzPEje3+efTKqivknHJwd7uXevtrl2vUc7dBSnQISvnEirw9Bilv+zDS VfcgccoVGnO9OTmlmpGZC7i/rl0aX4pKLZfHXrZPRHGa2N1PSx8fUxqycM+sAeW1IQg0 viJA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e74si682960pfd.97.2018.03.14.04.02.36; Wed, 14 Mar 2018 04:03:13 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751402AbeCNLBY convert rfc822-to-8bit (ORCPT + 99 others); Wed, 14 Mar 2018 07:01:24 -0400 Received: from mondschein.lichtvoll.de ([194.150.191.11]:34091 "EHLO mail.lichtvoll.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751005AbeCNLBX (ORCPT ); Wed, 14 Mar 2018 07:01:23 -0400 Received: from merkaba.localnet (unknown [91.221.105.244]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.lichtvoll.de (Postfix) with ESMTPSA id 4507C2BE6AB; Wed, 14 Mar 2018 12:01:22 +0100 (CET) From: Martin Steigerwald To: Hans de Goede Cc: Linux Kernel Mailing List , Thorsten Leemhuis , Tejun Heo Subject: Re: [Possible REGRESSION, 4.16-rc4] Error updating SMART data during runtime and could not connect to lvmetad at some boot attempts Date: Wed, 14 Mar 2018 12:01:21 +0100 Message-ID: <3573548.kp1edD77Gq@merkaba> In-Reply-To: References: <27165802.vQ9JbjrmvU@merkaba> MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hans de Goede - 11.03.18, 15:37: > Hi Martin, > > On 11-03-18 09:20, Martin Steigerwald wrote: > > Hello. > > > > Since 4.16-rc4 (upgraded from 4.15.2 which worked) I have an issue > > with SMART checks occassionally failing like this: > > > > smartd[28017]: Device: /dev/sdb [SAT], is in SLEEP mode, suspending checks > > udisksd[24408]: Error performing housekeeping for drive > > /org/freedesktop/UDisks2/drives/INTEL_SSDSA2CW300G3_[…]: Error updating > > SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense > > data returned:#0120000: 0e 09 0c 00 00 00 ff 00 00 00 00 00 00 00 50 > > 00 ..............P.#0120010: 00 00 00 00 00 00 00 00 00 00 00 00 00 > > 00 00 00 ................#012 (g-io-error-quark, 0) merkaba > > udisksd[24408]: Error performing housekeeping for drive > > /org/freedesktop/UDisks2/drives/Crucial_CT480M500SSD3_[…]: Error updating > > SMART dat a: Error sending ATA command CHECK POWER MODE: Unexpected sense > > data returned:#0120000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00 > > 00 ................#0120010: 00 0 0 00 00 50 00 00 00 00 00 00 00 > > 00 00 00 00 ....P...........#012 (g-io-error-quark, 0) > > > > (Intel SSD is connected via SATA, Crucial via mSATA in a ThinkPad T520) > > > > However when I then check manually with smartctl -a | -x | -H the device > > reports SMART data just fine. > > > > As smartd correctly detects that device is in sleep mode, this may be an > > userspace issue in udisksd. > > > > Also at some boot attempts the boot hangs with a message like "could not > > connect to lvmetad, scanning manually for devices". I use BTRFS RAID 1 > > on to LVs (each on one of the SSDs). A configuration that requires a > > manual > > adaption to InitRAMFS in order to boot (basically vgchange -ay before > > btrfs device scan). > > > > I wonder whether that has to do with the new SATA LPM policy stuff, but as > > I had issues with > > > > 3 => Medium power with Device Initiated PM enabled > > > > (machine did not boot, which could also have been caused by me > > accidentally > > removing all TCP/IP network support in the kernel with that setting) > > > > I set it back to > > > > CONFIG_SATA_MOBILE_LPM_POLICY=0 > > > > (firmware settings) > > Right, so at that settings the LPM policy changes are effectively > disabled and cannot explain your SMART issues. > > Still I would like to zoom in on this part of your bug report, because > for Fedora 28 we are planning to ship with CONFIG_SATA_MOBILE_LPM_POLICY=3 > and AFAIK Ubuntu has similar plans. > > I suspect that the issue you were seeing with > CONFIG_SATA_MOBILE_LPM_POLICY=3 were with the Crucial disk ? I've attached > a patch for you to test, which disabled LPM for your model Crucial SSD (but > keeps it on for the Intel disk) if you can confirm that with that patch you > can run with > CONFIG_SATA_MOBILE_LPM_POLICY=3 without issues that would be great. With 4.16-rc5 with CONFIG_SATA_MOBILE_LPM_POLICY=3 the system successfully booted three times in a row. So feel free to add tested-by. Let´s see whether the blk_mq_terminate_expired or the smartd/udisks error messages reappear with rc5. I still think they are a different issue. Thanks, -- Martin