Received: by 10.192.165.148 with SMTP id m20csp3690812imm; Mon, 30 Apr 2018 04:53:13 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpjV/Qy1nYbSDp/Ly/8oILQKryzALxKhB6Pb1wDvqwyXIMiI9wn8v2oKrqDFN2FDcpY71b2 X-Received: by 2002:a65:668f:: with SMTP id b15-v6mr9598464pgw.183.1525089193281; Mon, 30 Apr 2018 04:53:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525089193; cv=none; d=google.com; s=arc-20160816; b=Yth0Xu30KQuyZO6+vbnto3/56+4kKnRWqhsnHzKqWfhBZs5477i4B9QSKGVwfe49pQ co3O6iz+Yn/MvbB5IG4iyeyrCzy1Fx032J7mSA1qGH+zZKoilAwAiIfm/uJk26HqdnMh B7Otb1+K+7Ip2Ytnij74Be05TF0gp9RSetBy7opxzFyhZ8UTI4mGYboLBE0TZfAipJce orkEn/u2t5vZqFvNFvO73g6ZO2tYrevypF2SyEQiu9ddUisVdQz46us9msnsTzGaCer6 klAgjSL+7dk1s2tG+m/vjkuhUdoSrc1grAIoL8JQ+vNyuPtknicvjLJjHCmFduLN7Dj7 lNQw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :organization:references:in-reply-to:message-id:subject:cc:to:from :date:arc-authentication-results; bh=6uWMnRyYFskolfxBQSN5Iu0jSk2SlALb3eAJ/D70Rls=; b=1C5oMIoffXF5/k6qcgxquQ0+pjuoiKtepaPe5BBRJbnXnok3t/rcT0VpNE+6GPamjS FW2cbqhaCAQieAETttVoAn2pQSFWnYVK7PPWb9o8jL7BrYPMBWOkLgViJWqpvP6BDysE plqwtZ3aMo2MK+l6UJeXHInsS5mkY3CLMFsRU/7QqXuzQHGAtEU9n1lOLzmEbUvXBFPS tVTb9mbM0KqL0RPcNrn1yTaX6NcV19xokHmo5yZALajAukN52okYUoIf4FsiMQfI0jax zaXlEvoJ6ekYM7YGq3ZhUhCzCgfm3VB26JZi5O6/Ed03Zc4azE8uIUSrpwMRDvKRpZ67 zt4A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h4-v6si7129293pls.490.2018.04.30.04.52.59; Mon, 30 Apr 2018 04:53:13 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754337AbeD3LwB (ORCPT + 99 others); Mon, 30 Apr 2018 07:52:01 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:59246 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753878AbeD3Lv6 (ORCPT ); Mon, 30 Apr 2018 07:51:58 -0400 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.rdu2.redhat.com [10.11.54.6]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id A6523400ADD2; Mon, 30 Apr 2018 11:51:56 +0000 (UTC) Received: from gondolin (dhcp-192-222.str.redhat.com [10.33.192.222]) by smtp.corp.redhat.com (Postfix) with ESMTP id 43AAD2166BAD; Mon, 30 Apr 2018 11:51:55 +0000 (UTC) Date: Mon, 30 Apr 2018 13:51:53 +0200 From: Cornelia Huck To: Dong Jia Shi , Halil Pasic Cc: linux-kernel@vger.kernel.org, linux-s390@vger.kernel.org, kvm@vger.kernel.org, borntraeger@de.ibm.com, bjsdjshi@linux.ibm.com, pasic@linux.ibm.com, pmorel@linux.ibm.com Subject: Re: [PATCH v2 5/5] vfio: ccw: add traceponits for interesting error paths Message-ID: <20180430135153.1d108675.cohuck@redhat.com> In-Reply-To: <20180428055023.GS5428@bjsdjshi@linux.vnet.ibm.com> References: <20180423110113.59385-1-bjsdjshi@linux.vnet.ibm.com> <20180423110113.59385-6-bjsdjshi@linux.vnet.ibm.com> <20180427121353.4453bdc2.cohuck@redhat.com> <20180428055023.GS5428@bjsdjshi@linux.vnet.ibm.com> Organization: Red Hat GmbH MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.78 on 10.11.54.6 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.6]); Mon, 30 Apr 2018 11:51:57 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.6]); Mon, 30 Apr 2018 11:51:57 +0000 (UTC) for IP:'10.11.54.6' DOMAIN:'int-mx06.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'cohuck@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 28 Apr 2018 13:50:23 +0800 Dong Jia Shi wrote: > * Cornelia Huck [2018-04-27 12:13:53 +0200]: > > > On Mon, 23 Apr 2018 13:01:13 +0200 > > Dong Jia Shi wrote: > > > > typo in subject: s/traceponits/tracepoints/ > > > > > From: Halil Pasic > > > > > > Add some tracepoints so we can inspect what is not working as is should. > > > > > > Signed-off-by: Halil Pasic > > > Signed-off-by: Dong Jia Shi > > > --- > > > drivers/s390/cio/Makefile | 1 + > > > drivers/s390/cio/vfio_ccw_fsm.c | 16 +++++++- > > > drivers/s390/cio/vfio_ccw_trace.h | 77 +++++++++++++++++++++++++++++++++++++++ > > > 3 files changed, 93 insertions(+), 1 deletion(-) > > > create mode 100644 drivers/s390/cio/vfio_ccw_trace.h > > > > > > > @@ -135,6 +142,8 @@ static void fsm_io_request(struct vfio_ccw_private *private, > > > goto err_out; > > > > > > io_region->ret_code = cp_prefetch(&private->cp); > > > + trace_vfio_ccw_cp_prefetch(get_schid(private), > > > + io_region->ret_code); > > > if (io_region->ret_code) { > > > cp_free(&private->cp); > > > goto err_out; > > > @@ -142,11 +151,13 @@ static void fsm_io_request(struct vfio_ccw_private *private, > > > > > > /* Start channel program and wait for I/O interrupt. */ > > > io_region->ret_code = fsm_io_helper(private); > > > + trace_vfio_ccw_fsm_io_helper(get_schid(private), > > > + io_region->ret_code); > > > if (io_region->ret_code) { > > > cp_free(&private->cp); > > > goto err_out; > > > } > > > - return; > > > + goto out; > > > } else if (scsw->cmd.fctl & SCSW_FCTL_HALT_FUNC) { > > > /* XXX: Handle halt. */ > > > io_region->ret_code = -EOPNOTSUPP; > > > @@ -159,6 +170,9 @@ static void fsm_io_request(struct vfio_ccw_private *private, > > > > > > err_out: > > > private->state = VFIO_CCW_STATE_IDLE; > > > +out: > > > + trace_vfio_ccw_io_fctl(scsw->cmd.fctl, get_schid(private), > > > + io_region->ret_code); > > > } > > > > > > /* > > > > I really don't want to bikeshed, especially as some tracepoints are > > better than no tracepoints, but... > > > > We now trace fctl/schid/ret_code unconditionally (good). > > > > We trace the outcome of cp_prefetch() and fsm_io_helper() > > unconditionally. We don't, however, trace all things that may go wrong. > > We have the tracepoint at the end, but it cannot tell us where the > > error came from. Should we have tracepoints in every place (in this > > function) that may generate an error? Only if there is an actual error? > > Are the two enough for common debug scenarios? > Trace actual error sounds like a better idea than trace unconditionally > of these two functions. > These two are not enough for common debug scenarios. For example, we > cann't tell if a -EOPNOTSUPP is a orb->tm.b problem, or error code > returned by cp_init(). > > Idea to improve: > 1. Trace actual error. > 2. Define a trace event and add error trace for cp_init(). Hm. Going from what I have done in the past when doing printk debugging: - stick in a message that is always hit, with some information about parameters, if it makes sense - stick in a message "foo happened!" in the error branches - or, alternatively, trace the called functions So tracing on failure only might be more useful? Have all failure paths under a common knob to turn on/off? > > Opinions? We can just go ahead with this and improve things later > > on, I guess. > > > I think it's also fine to do this - better something than nothing. We > could at least have a code base to be improved to make everybody > happier in future. Maybe keep the patch as it is now, except trace the errors only (keeping the fctl trace point)? Halil, as you wrote the patch (and I presume you found it helpful): What is your opinion?