Received: by 2002:a25:f815:0:0:0:0:0 with SMTP id u21csp2254873ybd; Thu, 27 Jun 2019 09:15:30 -0700 (PDT) X-Google-Smtp-Source: APXvYqwwFhpQINOiWFSaAUiK6Bz9iVeyqVpZjAkGeW7CKz4XtGnI9itWFYvt2aRqTv7mnS+HwdTN X-Received: by 2002:a63:da52:: with SMTP id l18mr4554235pgj.131.1561652130409; Thu, 27 Jun 2019 09:15:30 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1561652130; cv=none; d=google.com; s=arc-20160816; b=Y1iHSktRzmutPD+RDQvoLPJ31C0WWyoW7pw/LrCT87Z+49+0CYZsMzq66Z2VYSYgTH keQYqk4AmbFarFKMom3t6LSjncJjhPxloQSoW3LmLrlN8ToNMpXLKlGsMcMMEUjAvepJ gLzQ+QHNGu+RIMFw+EnQOQaI22IuoXxfcG/1DcEy6itnAe55h/ysRBvJl2G7KCtlxlP1 1H5r8M6onzm7C69JVUzPybKhUXELlcSHZwVTjtnG9p8SPh7mirD5U8iKEVvMfY3eKzLK ACHcPFcObTo2pwJWQtKbf3W8RpMCrsNYu0us2aQv10pnBrrGEkIAJlalwlGli8XThjVd u1/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from; bh=8R3eTa27d9gHjcH5R+px6WAn/6UnrjRyUZ+60dJSY0k=; b=RSPoDmY6F7Q6scA7qsBX9vchjKULKN9BZmY0mvlRhupHOSL9ynv0sZjrUAFOs7e/GD pDQjmBtD5Ia9ErQDsh6PzAaqBEhsep9JG0+n3BK2bAsuZqFHnoSqEJpq47QDuhUkN84D faU6tgswAhg55S01HN9c+B1EChkkZIj1HgoaQi9/zRBX554kCIz+P0PZE1i/QGTmFtAJ nUELVezOjwsWNC8puMZO9rf0GsbtbC1Jrb33w/TmjYRR4TCOdxhlJGvIacR9pzapWpMo EuamUzo6OPFMXBBqgFunI07N6MCDp2ruKXs/VF93TVVZbjhvC7usw3q9JkYxuKb1dAxz i4XQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f4si2353831pgv.489.2019.06.27.09.15.14; Thu, 27 Jun 2019 09:15:30 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726640AbfF0QOU (ORCPT + 99 others); Thu, 27 Jun 2019 12:14:20 -0400 Received: from relay.sw.ru ([185.231.240.75]:44580 "EHLO relay.sw.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726464AbfF0QOR (ORCPT ); Thu, 27 Jun 2019 12:14:17 -0400 Received: from [10.94.4.83] (helo=finist-ce7.sw.ru) by relay.sw.ru with esmtp (Exim 4.92) (envelope-from ) id 1hgX2H-00047n-6O; Thu, 27 Jun 2019 19:14:09 +0300 From: Konstantin Khorenko To: Adaptec OEM Raid Solutions , Prasad B Munirathnam , Raghava Aditya Renukunta Cc: Konstantin Khorenko , linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org, "James E . J . Bottomley" , "Martin K . Petersen" Subject: [PATCH 0/1] aacraid: Host adapter Adaptec 6405 constantly resets under high io load Date: Thu, 27 Jun 2019 19:14:07 +0300 Message-Id: <20190627161408.10295-1-khorenko@virtuozzo.com> X-Mailer: git-send-email 2.15.1 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Problem description: ==================== A node with Adaptec 6405 controller, latest BIOS V5.3-0[19204] A lot of disks attached to the controller. Simple test: running mkfs.ext4 on many disks on the same controller in parallel (mkfs is not important here, any serious io load triggers controller aborts) Results: * no problems (controller resets) with kernels prior to 395e5df79a95 ("scsi: aacraid: Remove reference to Series-9") * latest ms kernel v5.2-rc6-15-g249155c20f9b - mkfs processes are in D state, lot of complains in logs like: [ 654.894633] aacraid: Host adapter abort request. aacraid: Outstanding commands on (0,1,43,0): [ 699.441034] aacraid: Host adapter abort request. aacraid: Outstanding commands on (0,1,40,0): [ 699.442950] aacraid: Host adapter reset request. SCSI hang ? [ 714.457428] aacraid: Host adapter reset request. SCSI hang ? ... [ 759.514759] aacraid: Host adapter reset request. SCSI hang ? [ 759.514869] aacraid 0000:03:00.0: outstanding cmd: midlevel-0 [ 759.514870] aacraid 0000:03:00.0: outstanding cmd: lowlevel-0 [ 759.514872] aacraid 0000:03:00.0: outstanding cmd: error handler-498 [ 759.514873] aacraid 0000:03:00.0: outstanding cmd: firmware-471 [ 759.514875] aacraid 0000:03:00.0: outstanding cmd: kernel-60 [ 759.514912] aacraid 0000:03:00.0: Controller reset type is 3 [ 759.515013] aacraid 0000:03:00.0: Issuing IOP reset [ 850.296705] aacraid 0000:03:00.0: IOP reset succeeded Same complains on Ubuntu kernel 4.15.0-50-generic: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1777586 Controller: =========== 03:00.0 RAID bus controller: Adaptec Series 6 - 6G SAS/PCIe 2 (rev 01) Subsystem: Adaptec Series 6 - ASR-6405 - 4 internal 6G SAS ports Test: ===== # cat dev.list /dev/sdq1 /dev/sde1 /dev/sds1 /dev/sdb1 /dev/sdk1 /dev/sdaj1 /dev/sdaf1 /dev/sdd1 /dev/sdac1 /dev/sdai1 /dev/sdz1 /dev/sdj1 /dev/sdy1 /dev/sdn1 /dev/sdae1 /dev/sdg1 /dev/sdi1 /dev/sdc1 /dev/sdf1 /dev/sdl1 /dev/sda1 /dev/sdab1 /dev/sdr1 /dev/sdo1 /dev/sdah1 /dev/sdm1 /dev/sdt1 /dev/sdp1 /dev/sdad1 /dev/sdh1 =========================================== # cat run_mkfs.sh #!/bin/bash while read i; do mkfs.ext4 $i -q -E lazy_itable_init=1 -O uninit_bg -m 0 & done ================================= # cat dev.list | ./run_mkfs.sh The issue is 100% reproducible. i've bisected to the culprit patch, it's 395e5df79a95 ("scsi: aacraid: Remove reference to Series-9") it changes arc ctrl checks for Series-6 controllers and i've checked that resurrection of original logic in arc ctrl checks eliminates controller hangs/resets. Konstantin Khorenko (1): scsi: aacraid: resurrect correct arc ctrl checks for Series-6 drivers/scsi/aacraid/aacraid.h | 11 ----------- drivers/scsi/aacraid/comminit.c | 14 ++++++++++---- drivers/scsi/aacraid/commsup.c | 4 +++- drivers/scsi/aacraid/linit.c | 7 +++++-- 4 files changed, 18 insertions(+), 18 deletions(-) -- 2.15.1