Received: by 2002:ac0:946b:0:0:0:0:0 with SMTP id j40csp2693195imj; Mon, 11 Feb 2019 07:06:52 -0800 (PST) X-Google-Smtp-Source: AHgI3Iaz/ITEASf/38OH/2ko+wuZn9veKuPjLz5E8NWMYHbfWtqsMJKihGlzxE9YhzwBUzdpQ3dj X-Received: by 2002:a17:902:f095:: with SMTP id go21mr14015633plb.199.1549897611975; Mon, 11 Feb 2019 07:06:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549897611; cv=none; d=google.com; s=arc-20160816; b=ioVihuDRdLqv+j70hSlilAalBi2Gnn2wWP/ls5mWf20Jm8EfpFOqASjUUE9Uy1WZ1h dTuD2dOlhZFfvTK0RCRap2ksjenAE8DQfiKVdQf4+JkYIrEDkTnWI/+1LQWa0lh1XPfq V75SAURmJcR0U60m05rBVNc6XWHfBaXmRQq+78sVwfGjtJL+KLXbPku2Il71ltj8vhoS tZdPYS6q4SeCDPuJnz5e7xLqt3GjXRZqWRHX2zyJglT6HLS3cPVCIl3DqyQwW/FAiPO7 HkFJmla2suPASnSJ7zSAfNmIipvCTM5udFXZ/+51ghGwxGB+9ksB8FOmTt+cJYh2jUsA 1RJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=dXv3iWekTrGjoswDsg9CYoRhzxIAgrEbomFHszraLVs=; b=i5Ud+NFxKKWdTI6xyv5CGiZ5oIvwH+o4ANSuAHR0OxCZL/PNzck1c+6AaiuVReMqpA PpfjqMmhosx6DIiFTUE9/V5qXZ04SUdhtJQIOkQ5X2pLzawKoX4EMS0AM17rGonuEKQm vMuqfIKqmmkepzZlqI9Zv8uS4/Yf977k2vx8O6MuOorCqCAs1orqVj6tOXx99PVql6G+ z/bS95QlKiP7QGrZEUbgEw30KD+aJ0555eodi4EMud4RyppgvycO6+flP2O+PPnTLaT3 QET7u2Z3YAxXsgac/xyq9L9nWASsQXt12z+MqQ+Vty4oosCx3VJPfhPLDLpca1tobTCp UjaQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 3si10636353plo.217.2019.02.11.07.06.34; Mon, 11 Feb 2019 07:06:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727482AbfBKPFc (ORCPT + 99 others); Mon, 11 Feb 2019 10:05:32 -0500 Received: from charlotte.tuxdriver.com ([70.61.120.58]:57537 "EHLO smtp.tuxdriver.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390996AbfBKPF3 (ORCPT ); Mon, 11 Feb 2019 10:05:29 -0500 Received: from cpe-2606-a000-111b-405a-9816-2c85-c514-8f7a.dyn6.twc.com ([2606:a000:111b:405a:9816:2c85:c514:8f7a] helo=localhost) by smtp.tuxdriver.com with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1gtD8u-0006Ah-Nu; Mon, 11 Feb 2019 10:05:21 -0500 Date: Mon, 11 Feb 2019 10:04:32 -0500 From: Neil Horman To: Marcelo Ricardo Leitner Cc: David Miller , julien@arista.com, netdev@vger.kernel.org, linux-sctp@vger.kernel.org, linux-kernel@vger.kernel.org, vyasevich@gmail.com, lucien.xin@gmail.com Subject: Re: [PATCH net] sctp: make sctp_setsockopt_events() less strict about the option length Message-ID: <20190211150432.GA13525@hmswarspite.think-freely.org> References: <20190206201430.18830-1-julien@arista.com> <20190206203754.GC13621@localhost.localdomain> <20190209.151217.175627323493244750.davem@davemloft.net> <20190210124616.GG13621@localhost.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190210124616.GG13621@localhost.localdomain> User-Agent: Mutt/1.10.1 (2018-07-13) X-Spam-Score: -2.9 (--) X-Spam-Status: No Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Feb 10, 2019 at 10:46:16AM -0200, Marcelo Ricardo Leitner wrote: > On Sat, Feb 09, 2019 at 03:12:17PM -0800, David Miller wrote: > > From: Marcelo Ricardo Leitner > > Date: Wed, 6 Feb 2019 18:37:54 -0200 > > > > > On Wed, Feb 06, 2019 at 12:14:30PM -0800, Julien Gomes wrote: > > >> Make sctp_setsockopt_events() able to accept sctp_event_subscribe > > >> structures longer than the current definitions. > > >> > > >> This should prevent unjustified setsockopt() failures due to struct > > >> sctp_event_subscribe extensions (as in 4.11 and 4.12) when using > > >> binaries that should be compatible, but were built with later kernel > > >> uapi headers. > > > > > > Not sure if we support backwards compatibility like this? > > > > What a complete mess we have here. > > > > Use new socket option numbers next time, do not change the size and/or > > layout of existing socket options. > > What about reusing the same socket option, but defining a new struct? > Say, MYSOCKOPT supports struct mysockopt, struct mysockopt2, struct > mysockopt3... > > That way we have a clear definition of the user's intent. > Thats possible, but I think thats pretty equivalaent to what daves saying, in that he wants us to identify all the sizes of this struct and the git history and act on them accordingly. Having internal versions of the struct seems like a fine way to get there, but I think we need to consider how we got to this situations before we go down the implementation path. > > > > This whole thread, if you read it, is basically "if we compatability > > this way, that breaks, and if we do compatability this other way oh > > shit this other thing doesn't work." > > > > I think we really need to specifically check for the difference sizes > > that existed one by one, clear out the part not given by the user, and > > backport this as far back as possible in a way that in the older kernels > > we see if the user is actually trying to use the new features and if so > > error out. > > I'm afraid clearing out may not be enough, though seems it's the best > we can do so far. If the struct is allocated but not fully initialized > via a memset, but by setting its fields one by one, the remaining new > fields will be left uninitinialized. > I'm not sure this even makes sense. Currently (as I understood it), the issue we are facing is the one in which an application is built against a newer kernel and run on an older one, the implication there being that the application will pass in a buffer that is larger than what the kernel expects. In that situation, clearing isn't needed, all thats needed (I think), is a memcmp of the space between the sizeof(kernel struct version), and sizeof(userspace struct version) to see if any bits are non-zero. If they are, we error out, otherwise, we ignore the space and move forward as though that overage doesn't exist. Mind you, I'm not (yet) advocating for that approach, just trying to clarify whats needed. > > > > Which, btw, is terrible behavior. Newly compiled apps should work on > > older kernels if they don't try to use the new features, and if they > > One use case here is: a given distro is using kernel X and app Foo is > built against it. Then upgrades to X+1, Foo is patched to fix an issue > and is rebuilt against X+1. The user upgrades Foo package but for > whatever reason, doesn't upgrade kernel or reboot the system. Here, > Foo doesn't work anymore until the new kernel is also running. > Yes, thats the use case that we're trying to address. > > can the ones that want to try to use the new features should be able > > to fall back when that feature isn't available in a non-ambiguous > > and precisely defined way. > > > > The fact that the use of the new feature is hidden in the new > > structure elements is really rotten. > > > > This patch, at best, needs some work and definitely a longer and more > > detailed commit message. > FWIW, before we decide on a course of action, I think I need to point out that, over the last 10 years, we've extended this structure 6 times, in the following commits: 0f3fffd8ab1db 7e8616d8e7731 e1cdd553d482c 35ea82d611da5 c95129d127c6d b444153fb5a64 The first two I believe were modifications during a period when sctp was actually getting integrated to the kernel, but the last 4 were definately done during more recent development periods and wen't in without any commentary about the impact to UAPI compatibility. The check for optlen > sizeof(struct sctp_event_subscribe) was made back in 2008, and while not spelled out, seems pretty clearly directed at enforcing compatibility with older appliations, not compatibility with newer applications running on older kernels. I really worry about situations in which we need to support applications expecting features that the running kernel doesn't have. In this particular situation it seems like a fixable thing, but I could envision situations in which we just can't do it, and I don't want to set that expectation when we can't consistently meet it. So, if the consensus is that we need to support applications built on newer kernels, but run on older kernels (and I'd like to get verbal consensus on that), then we need to identify a method to fix this. I'm still hesitant to do anything that involves us accepting any size buffer over the kernel expected size, as that puts us in a position to have to read large amounts of user data (i.e. possible DOS), and just picking an arbitrary large number to limit the buffer size seems wrong. What if, on receipt of a structure from a newer kernel (implying a size larger than what the kernel expects), we clamp optlen to the kernel size, and put_user it back to the application? i.e. we don't check any data above and beyond what the the kernel knows about, but we use the optlen as an indicator to user space that not all the data was processed? That allows the kernel to ignore the overage safely, and while its not in the socket api extension RFC, its not violating anything, and is something we can document in the sctp(7) man page as a linux only behavior. Thoughts? Neil