Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp3667888pxu; Tue, 8 Dec 2020 19:26:47 -0800 (PST) X-Google-Smtp-Source: ABdhPJznEzM0Yc8oN+0EiEkNbnGfNpenvq91CxH3o06w8tBcG/HWoMOyp8iX0pN6aXVpt0A7xO3B X-Received: by 2002:a17:906:e18:: with SMTP id l24mr363530eji.434.1607484407041; Tue, 08 Dec 2020 19:26:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1607484407; cv=none; d=google.com; s=arc-20160816; b=qbTCVvhM/ke1hmpJZxMaDvo7tNHTOzS/BTsAkEJNpCdHJc7fu1LsewjdvI4QM+fqGy GDmv3FRDzP5LT26mecuHNNXWUgx81qv7Hjs0Q9SnDHpBb+yaAmeRGgI5dWk61sLOEQh/ u3pQNAOH0vjrhtGZ5hjIuRzYLfx1hL6sM3yazgg8GRK9SenVD+anLTUrBUCtPltIhlWE BwXVa8D6mGuAFofSFUaNdiv3nlbzUbnuxjuq9Q7gQmGFXATUDBms8dusB+jJEzHcWlT5 8C0HReJYfp8spglLgn3M2/y4COEnubknoEj4zcOIXyFLQ+kuORL2WlagsII75CiFZ3zV BNow== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=K8aClBMRVYokWWuO1V9mT0g9fN0gk6W4m2aWoAT7LVY=; b=FnmmHB7eWDF04SoyOeY3f36pWBa3bhukEJr8F9V8UhXaqO1b2yfKLRgbEt0ueaUQJZ SB7PMY5notEgEzRXOzHfMNBgeqIN/YkSwJ9SdbYFkLRc1MMr+97vQjy3JEyYTWNSdnuA o4kch3u0YblhAmSSSd09kqafyF5APd8jabelXHX5FxcnoGkf5m8awM+5OqUoiSyt6akf pd+S5eeIGAr51Mh2Tcf2vXdVNYxxrzj6+rWEeMNoerKfbt53XmO4SR3Tag0uqsrPnBrx +qdIRcN+ZILlaeubmQjqiinx5NsM7QKEDXBWeJED3pInf8iq3YlQzjr7H496ySlLkg6q CazQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2020-01-29 header.b=K0+F92ZR; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u2si83280ejo.423.2020.12.08.19.26.14; Tue, 08 Dec 2020 19:26:47 -0800 (PST) Received-SPF: pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2020-01-29 header.b=K0+F92ZR; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727136AbgLIDZX (ORCPT + 99 others); Tue, 8 Dec 2020 22:25:23 -0500 Received: from userp2130.oracle.com ([156.151.31.86]:44690 "EHLO userp2130.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725789AbgLIDZX (ORCPT ); Tue, 8 Dec 2020 22:25:23 -0500 Received: from pps.filterd (userp2130.oracle.com [127.0.0.1]) by userp2130.oracle.com (8.16.0.42/8.16.0.42) with SMTP id 0B93Nugk045915; Wed, 9 Dec 2020 03:24:30 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=date : from : to : cc : subject : message-id : references : mime-version : content-type : in-reply-to; s=corp-2020-01-29; bh=K8aClBMRVYokWWuO1V9mT0g9fN0gk6W4m2aWoAT7LVY=; b=K0+F92ZRPR5GPBEgsOD7pzt6dLRm7gICFNJJsrgHAO1M1e5VPTKDR1Ggw8XibHLBGQak jvz20R0Jn9Y22JdNZJC9D4Pg49h1B6dLf0QujJiMSdF5r+v5UxhbW8XFsmgik/SgSmmz l6rRHVHks+DPCnTgmOyYHiMA+AUoNAPD6OOOiPIIZFVNhvYVkOPR05zyBFLFdgPD11e2 RargWX+WBVayf8T+FL3qr1D8ZT0Q78+zy9dC0P2dM+6+podoYqtl4rN3E/2jnCCItQ1w WRWwjwUylOsdxZ0wJFWea6sRbgfYW1GE0ds4R5eGklrsyGL8AilPhh36GtG+z8PW1pEV NQ== Received: from userp3020.oracle.com (userp3020.oracle.com [156.151.31.79]) by userp2130.oracle.com with ESMTP id 3581mqx04k-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=FAIL); Wed, 09 Dec 2020 03:24:30 +0000 Received: from pps.filterd (userp3020.oracle.com [127.0.0.1]) by userp3020.oracle.com (8.16.0.42/8.16.0.42) with SMTP id 0B93KfTx141591; Wed, 9 Dec 2020 03:24:29 GMT Received: from aserv0121.oracle.com (aserv0121.oracle.com [141.146.126.235]) by userp3020.oracle.com with ESMTP id 358kytxcrh-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 09 Dec 2020 03:24:29 +0000 Received: from abhmp0018.oracle.com (abhmp0018.oracle.com [141.146.116.24]) by aserv0121.oracle.com (8.14.4/8.13.8) with ESMTP id 0B93ORx1019135; Wed, 9 Dec 2020 03:24:27 GMT Received: from localhost (/10.159.152.73) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Tue, 08 Dec 2020 19:24:27 -0800 Date: Tue, 8 Dec 2020 19:24:25 -0800 From: "Darrick J. Wong" To: Gabriel Krisman Bertazi Cc: David Howells , viro@zeniv.linux.org.uk, tytso@mit.edu, khazhy@google.com, adilger.kernel@dilger.ca, linux-ext4@vger.kernel.org, linux-fsdevel@vger.kernel.org, kernel@collabora.com Subject: Re: [PATCH 5/8] vfs: Include origin of the SB error notification Message-ID: <20201209032425.GD106255@magnolia> References: <20201208003117.342047-6-krisman@collabora.com> <20201208003117.342047-1-krisman@collabora.com> <952750.1607431868@warthog.procyon.org.uk> <87r1o05ua6.fsf@collabora.com> <20201208184123.GC106255@magnolia> <87lfe85c6b.fsf@collabora.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87lfe85c6b.fsf@collabora.com> X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9829 signatures=668683 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 adultscore=0 spamscore=0 mlxscore=0 malwarescore=0 suspectscore=1 mlxlogscore=999 bulkscore=0 phishscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2012090021 X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9829 signatures=668683 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=1 mlxlogscore=999 clxscore=1015 malwarescore=0 priorityscore=1501 adultscore=0 lowpriorityscore=0 phishscore=0 spamscore=0 impostorscore=0 mlxscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2012090022 Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On Tue, Dec 08, 2020 at 04:29:32PM -0300, Gabriel Krisman Bertazi wrote: > "Darrick J. Wong" writes: > > > On Tue, Dec 08, 2020 at 09:58:25AM -0300, Gabriel Krisman Bertazi wrote: > >> David Howells writes: > >> > >> > Gabriel Krisman Bertazi wrote: > >> > > >> >> @@ -130,6 +131,8 @@ struct superblock_error_notification { > > > > FWIW I wonder if this really should be inode_error_notification? > > > > If (for example) ext4 discovered an error in the blockgroup descriptor > > and wanted to report it, the inode and block numbers would be > > irrelevant, but the blockgroup number would be nice to have. > > A previous RFC had superblock_error_notification and > superblock_inode_error_notification split, I think we can recover that. > > > > >> >> __u32 error_cookie; > >> >> __u64 inode; > >> >> __u64 block; > >> >> + char function[SB_NOTIFICATION_FNAME_LEN]; > >> >> + __u16 line; > >> >> char desc[0]; > >> >> }; > >> > > >> > As Darrick said, this is a UAPI breaker, so you shouldn't do this (you can, > >> > however, merge this ahead a patch). Also, I would put the __u16 before the > >> > char[]. > >> > > >> > That said, I'm not sure whether it's useful to include the function name and > >> > line. Both fields are liable to change over kernel commits, so it's not > >> > something userspace can actually interpret. I think you're better off dumping > >> > those into dmesg. > >> > > >> > Further, this reduces the capacity of desc[] significantly - I don't know if > >> > that's a problem. > >> > >> Yes, that is a big problem as desc is already quite limited. I don't > > > > How limited? > > The largest notification is 128 bytes, the one with the biggest header > is superblock_error_notification which leaves 56 bytes for description. > > > > >> think it is a problem for them to change between kernel versions, as the > >> monitoring userspace can easily associate it with the running kernel. > > > > How do you make that association? $majordistro's 4.18 kernel is not the > > same as the upstream 4.18. Wouldn't you rather the notification message > > be entirely self-describing rather than depending on some external > > information about the sender? > > True. I was thinking on my use case where the customer controls their > infrastructure and would specialize their userspace tools, but that is > poor design on my part. A self describing mechanism would be better. > > > > >> The alternative would be generating something like unique IDs for each > >> error notification in the filesystem, no? > >> > >> > And yet further, there's no room for addition of new fields with the desc[] > >> > buffer on the end. Now maybe you're planning on making use of desc[] for > >> > text-encoding? > >> > >> Yes. I would like to be able to provide more details on the error, > >> without having a unique id. For instance, desc would have the formatted > >> string below, describing the warning: > >> > >> ext4_warning(inode->i_sb, "couldn't mark inode dirty (err %d)", err); > > > > Depending on the upper limit on the length of messages, I wonder if you > > could split the superblock notification and the description string into > > separate messages (with maybe the error cookie to tie them together) so > > that the struct isn't limited by having a VLA on the end, and the > > description can be more or less an arbitrary string? > > > > (That said I'm not familiar with the watch queue system so I have no > > idea if chained messages even make sense here, or are already > > implemented in some other way, or...) > > I don't see any support for chaining messages in the current watch_queue > implementation, I'd need to extend the interface to support it. I > considered this idea before, given the small description size, but I > thought it would be over-complicated, even though much more future > proof. I will look into that. > > What about the kernel exporting a per-filesystem table, as a build > target or in /sys/fs//errors, that has descriptions strings for each > error? Then the notification can have only the FS type, index to the > table and params. This won't exactly be self-describing as you wanted > but, differently from function:line, it removes the need for the source > code, and allows localization. The per-filesystem table would be > stable ABI, of course. Yikes. I don't think people are going to be ok with a message table where we can never remove the strings. I bet GregKH won't like that either (one value per sysfs file). (Maybe I misread that and all you meant by stable ABI is the fact that the table exists at a given path and the notification message gives you a index into ... wherever we put it.) --D > > -- > Gabriel Krisman Bertazi