Received: by 10.213.65.68 with SMTP id h4csp891762imn; Tue, 20 Mar 2018 19:09:44 -0700 (PDT) X-Google-Smtp-Source: AG47ELsjFpuoMmOIv7qXE/05WgKqU/glFAzgVIb4725dowbTEVE3WP14eoMwo3759NcCG6FqknGd X-Received: by 2002:a17:902:30f:: with SMTP id 15-v6mr18261126pld.365.1521598184565; Tue, 20 Mar 2018 19:09:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1521598184; cv=none; d=google.com; s=arc-20160816; b=zZQ7Iw/iTkhmPUBQq1tCtgPvp3kgiHAZyC8pR6pARdPJB/AfBQ6BBkuZTZnjr2ld1O KwI03GMFhHNB4pzV2ekLweHwMlQ5PMIOI/qC2WjHEH711drTd8FECwNmrcAELZ0oejB8 44KCaLVCbL7so6X8fSZiPro/DknhSzA5BXc7uF5mZjHlzsERByWscYatrpJbSmbVDIY3 GfCxsY4JFi0mvxgq9DWFWobjrKhU9gOQ1O3RokE2vzRksu9ec+LPOZY7BxnX+cw3lLP3 YmSjvOxc4uEfGSPbSigDfFATAD0OMmIz5B6Sj29iFlpcrtmwwB1/WbMPbTlkRRhbdGfY 1+zQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:cc:to:from :arc-authentication-results; bh=K/he3mW4w7+WT21beKVuWTtNh2heoVXoV9YfsJSlMik=; b=g/WZEoh1T+4kLdKT4XRyGcZc+YPEQ8HD6vvAivJkj2waL45KUNv+30HG/FBQoz1z3c IvGWOWtk24FdRmn40lHPvG68XAnoelagz4FFTTKz1U5E1/fblpJPZ/vY/dm8HoUaIm8I XPh5Jd7l3ClJ9dTmzFUPKLaHrBz599ZTnHWhbFT1AFUPPDjjF21i4UGgRgZhCC1ognF8 kP9/WG/YVUq8jSQLu21g11Y9TpEMNXym2dj1PPO4AMO1veSFazF9UIGiwGhkJSKd+LQM Yx06wcVp86APepk1hYyCLs0i7kxmQWhZ/nwyWRZo6zkouXe//SWVYdftAOWcXpP6F33x ON7w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id be9-v6si2899735plb.201.2018.03.20.19.09.29; Tue, 20 Mar 2018 19:09:44 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751697AbeCUCId (ORCPT + 99 others); Tue, 20 Mar 2018 22:08:33 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:56100 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1751459AbeCUCIb (ORCPT ); Tue, 20 Mar 2018 22:08:31 -0400 Received: from pps.filterd (m0098413.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w2L24EwQ111395 for ; Tue, 20 Mar 2018 22:08:30 -0400 Received: from e35.co.us.ibm.com (e35.co.us.ibm.com [32.97.110.153]) by mx0b-001b2d01.pphosted.com with ESMTP id 2gu9qvgh5d-1 (version=TLSv1.2 cipher=AES256-SHA256 bits=256 verify=NOT) for ; Tue, 20 Mar 2018 22:08:30 -0400 Received: from localhost by e35.co.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Tue, 20 Mar 2018 20:08:29 -0600 Received: from b03cxnp07029.gho.boulder.ibm.com (9.17.130.16) by e35.co.us.ibm.com (192.168.1.135) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; Tue, 20 Mar 2018 20:08:26 -0600 Received: from b03ledav003.gho.boulder.ibm.com (b03ledav003.gho.boulder.ibm.com [9.17.130.234]) by b03cxnp07029.gho.boulder.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id w2L28Qoc6685128; Tue, 20 Mar 2018 19:08:26 -0700 Received: from b03ledav003.gho.boulder.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 05ECE6A03B; Tue, 20 Mar 2018 20:08:26 -0600 (MDT) Received: from tuxmaker.boeblingen.de.ibm.com (unknown [9.152.85.9]) by b03ledav003.gho.boulder.ibm.com (Postfix) with ESMTP id 937A96A042; Tue, 20 Mar 2018 20:08:24 -0600 (MDT) From: Dong Jia Shi To: linux-kernel@vger.kernel.org, linux-s390@vger.kernel.org, kvm@vger.kernel.org Cc: cohuck@redhat.com, borntraeger@de.ibm.com, bjsdjshi@linux.vnet.ibm.com, pasic@linux.vnet.ibm.com, pmorel@linux.vnet.ibm.com Subject: [PATCH 0/4] vfio: ccw: error handling fixes and improvements Date: Wed, 21 Mar 2018 03:08:18 +0100 X-Mailer: git-send-email 2.13.5 X-TM-AS-GCONF: 00 x-cbid: 18032102-0012-0000-0000-000015EDBA46 X-IBM-SpamModules-Scores: X-IBM-SpamModules-Versions: BY=3.00008713; HX=3.00000241; KW=3.00000007; PH=3.00000004; SC=3.00000254; SDB=6.01006058; UDB=6.00512229; IPR=6.00785341; MB=3.00020153; MTD=3.00000008; XFM=3.00000015; UTC=2018-03-21 02:08:27 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 18032102-0013-0000-0000-000051F466D1 Message-Id: <20180321020822.86255-1-bjsdjshi@linux.vnet.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:,, definitions=2018-03-21_01:,, signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 impostorscore=0 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1709140000 definitions=main-1803210024 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Conny, Halil reported a host crash when using vfio-ccw. The root cause of the problem is that vfio_pin_pages fails with EINVAL for reasons unknown. He has experienced such failures after online-ing a dasd in the guest (the dasd has 3 partitions, hat may or may not have any significance). The problem isn't experienced on every attempt to online the dasd, and breaking at css_do_ssch seems to make things work. One thing is sure: the host kernel should not crash under the described circumstances. To fix the problem, the first patch of this series fixes the cleanup when cp_prefetch fails in the higher level. The 2nd and the 3rd patches provide correctness and denfensive actions for the interfaces in the lower level. The 4th patch is trying to add tracepoints for vfio-ccw, so that we can debug such issue easier in future. For details see the commit message portions of the inividual patches. Thanks. Dong Jia Shi (2): vfio: ccw: refactor and improve pfn_array_alloc_pin() vfio: ccw: set ccw->cda to NULL defensively Halil Pasic (2): vfio: ccw: fix cleanup if cp_prefetch fails vfio: ccw: add traceponits for interesting error paths drivers/s390/cio/Makefile | 1 + drivers/s390/cio/vfio_ccw_cp.c | 121 ++++++++++++++++++++------------------ drivers/s390/cio/vfio_ccw_fsm.c | 13 ++++ drivers/s390/cio/vfio_ccw_trace.h | 86 +++++++++++++++++++++++++++ 4 files changed, 163 insertions(+), 58 deletions(-) create mode 100644 drivers/s390/cio/vfio_ccw_trace.h -- 2.13.5