Received: by 10.213.65.68 with SMTP id h4csp1090052imn; Wed, 28 Mar 2018 20:24:05 -0700 (PDT) X-Google-Smtp-Source: AIpwx48aw7P/4+tHzwBdRjdSSvOjFLv+GHJU9HVFliYKPWqKag+n/0gXnhXIztUbDlB1ohdmi1Mq X-Received: by 10.98.70.199 with SMTP id o68mr4916831pfi.169.1522293845530; Wed, 28 Mar 2018 20:24:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1522293845; cv=none; d=google.com; s=arc-20160816; b=zJUQVW7rExLGhdrQcyFZErHYkgpHZXlHBYtPUG9LTfNLyfRlk2R/YXkMraLXFmthLo GXxmsA+A6oc9Owcuow0kuhQmDYaV+2CuS93G3Zdg90V1lZQrOGTAgZxb4gMBCRWOWuam fr9UwOu0BkTxUjINkCXOsa1YGZ1uUyNfvaCvA0ACcsLvceFzSdPh3q5gNvZQf6TpBhWD qLZC8EDVHSjLO/Rtxe1f1yFQPwICSLbn+6PXggQl1/ltPQpHA2dhT5USN/u9DbAK5NsU 8Ywc58iN8S7k3dH4G1kJUN0mBl7yNnWMcQbgnm3MCAt/jE3Gb18hbLBH8epRyzboX3ui 8ZiA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:cc:references:to:subject:dkim-signature :arc-authentication-results; bh=5B0mZ6aOQuZkV6bDNN/2AlikSStFdf8excOxpnAD5Uc=; b=XpDNrv3aVA+uNrx5iph8rKMbsAqyxv4R9m6ywd6MZ0cIrMrpFhEAIJFA1Jce/D08l/ OBiSNjpCy2fw/pXU/6bapIaXrCPH2cIB4siEB6EdYKBPJqY2AgGk6UZsKHS0FfHpL115 N7VEOxWdFrvl7H5BakV0wiwDmy4oM2V5ZaSZxdkYWiOe94U4JxD8bs5sLS+PgThmjkMm lr2BtZAdZieBe3+mH2CrGOo3OaOUlWCNJQ3GmqnHZHQlOI1fftV3YimOiBq/PnXpOnoT 7IIRmqcc2NiAJN/zuNBtWu9xZDWZHx/4IOLGQfPLTz8LoCIWSI/RmBfzOR3OsVfNe+sE zLgg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel-dk.20150623.gappssmtp.com header.s=20150623 header.b=Iozrw+8E; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id j3si3742446pfb.288.2018.03.28.20.23.49; Wed, 28 Mar 2018 20:24:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel-dk.20150623.gappssmtp.com header.s=20150623 header.b=Iozrw+8E; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751223AbeC2DWx (ORCPT + 99 others); Wed, 28 Mar 2018 23:22:53 -0400 Received: from mail-io0-f181.google.com ([209.85.223.181]:45175 "EHLO mail-io0-f181.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751099AbeC2DWv (ORCPT ); Wed, 28 Mar 2018 23:22:51 -0400 Received: by mail-io0-f181.google.com with SMTP id 141so5963575iou.12 for ; Wed, 28 Mar 2018 20:22:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kernel-dk.20150623.gappssmtp.com; s=20150623; h=subject:to:references:cc:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=5B0mZ6aOQuZkV6bDNN/2AlikSStFdf8excOxpnAD5Uc=; b=Iozrw+8EHTeiEhT8oSZbozVQ2n0FQqXdB/wZ8CmfBR05PQoBjusgcXwuvVoMUyNLZq mvmtiCqZ5NBKV1EFdA3cUeG5G0Vx935tW6F+G2t2EpFvfH26voEJhc9j8pwS3QFq8PPp cMWPpVCGUDcTtDfSPYYcuz5ciUFuFyNgZyJCYbacYi5zS5X+SM160ANxsuAFe9ySOtiK H9GHO/u5EncJ3qNdoK1LCnxFeAJ4p7lPvsF88vO+xn0HQ05EcZMCe86lJlDPFU0ULgbE 8EbfU1wHuPnBwsOLMUYtGf3CMZHXzI6A5p8P5uhQx49CR7mz1jVvi/m/d4hanLNLj38p xhgw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:cc:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=5B0mZ6aOQuZkV6bDNN/2AlikSStFdf8excOxpnAD5Uc=; b=qsZeSBGoToNbVg9Bii2+TNCsmEGw04MkoX1mM3Nawj5oSA97KMLBzTXdQI2R2mljje yywtfZxfwLrQly7un8mlTeSx37QNW84yEMrr4BD9Z8wvdd/VSc0kneKDdlb2hv/fcWgb a13LUR5pzm5zePSTb5d/Jjdvt4x/U2xkUaOP7etvTR+nnEYwrZIWB+VwQ+2TqZr7Ju5/ Ew9paeDRoEoQXPZmG142ckpqgRT/BD+jtfIIhY9OopT9DlIeUZV5uTA/sWbl9+tyP700 PmzhhWrIlQclMw8DY3hUdO/xzc40rkKpzymmg2BcVFFiGMi6cOAbtcQBzNXqUksfzxTt tHbw== X-Gm-Message-State: AElRT7EBw2zQzF2lJyk0gUKnFP7aQXTh70+sof+KyTlFv6sgMAbC7qCu KSn5GJXhaRZVxpRC/gJ9EF0vhw== X-Received: by 10.107.152.213 with SMTP id a204mr49434597ioe.260.1522293770646; Wed, 28 Mar 2018 20:22:50 -0700 (PDT) Received: from [192.168.1.180] (107.191.0.158.static.utbb.net. [107.191.0.158]) by smtp.gmail.com with ESMTPSA id t134-v6sm504157itt.43.2018.03.28.20.22.48 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 28 Mar 2018 20:22:49 -0700 (PDT) Subject: Re: General protection fault with use_blk_mq=1. To: "Zephaniah E. Loss-Cutler-Hull" , "Zephaniah E. Loss-Cutler-Hull" , linux-kernel@vger.kernel.org, linux-block@vger.kernel.org, linux-scsi@vger.kernel.org References: <7d8a9c62-7d3e-879c-5b5b-30707f04553e@aehallh.com> Cc: Paolo Valente From: Jens Axboe Message-ID: <735c5d75-eacf-8ed2-ba9b-9ff4b0b5290d@kernel.dk> Date: Wed, 28 Mar 2018 21:22:47 -0600 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 3/28/18 9:13 PM, Zephaniah E. Loss-Cutler-Hull wrote: > On 03/28/2018 06:02 PM, Jens Axboe wrote: >> On 3/28/18 5:03 PM, Zephaniah E. Loss-Cutler-Hull wrote: >>> I am not subscribed to any of the lists on the To list here, please CC >>> me on any replies. >>> >>> I am encountering a fairly consistent crash anywhere from 15 minutes to >>> 12 hours after boot with scsi_mod.use_blk_mq=1 dm_mod.use_blk_mq=1> >>> The crash looks like: >>> > >>> >>> Looking through the code, I'd guess that this is dying inside >>> blkg_rwstat_add, which calls percpu_counter_add_batch, which is what RIP >>> is pointing at. >> >> Leaving the whole thing here for Paolo - it's crashing off insertion of >> a request coming out of SG_IO. Don't think we've seen this BFQ failure >> case before. >> >> You can mitigate this by switching the scsi-mq devices to mq-deadline >> instead. >> > > I'm thinking that I should also be able to mitigate it by disabling > CONFIG_DEBUG_BLK_CGROUP. > > That should remove that entire chunk of code. > > Of course, that won't help if this is actually a symptom of a bigger > problem. Yes, it's not a given that it will fully mask the issue at hand. But turning off BFQ has a much higher chance of working for you. This time actually CC'ing Paolo. -- Jens Axboe