Received: by 2002:ac0:98c7:0:0:0:0:0 with SMTP id g7-v6csp933231imd; Thu, 1 Nov 2018 07:48:56 -0700 (PDT) X-Google-Smtp-Source: AJdET5eF1owZGQpUqHtl77FcQ7vu9y8IzDAEiyVtBoy3x76S+rA82TWAnb9bnA88jxhqSnmAd1Pi X-Received: by 2002:a17:902:a5c6:: with SMTP id t6-v6mr7994402plq.31.1541083736881; Thu, 01 Nov 2018 07:48:56 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1541083736; cv=none; d=google.com; s=arc-20160816; b=dhfwzreMbWDkmF9JZehI0cHuq9voCW7N+ZsMOwokUnZbIvGu2zlU3wmei2NAOGKazQ FLyDjEzFzEa/EK4WLpXSoVhtjU6BMasuCCu84Lr2oIPI8whpJhT7XvDe94IJGr5/YWVL Zrn8KUJp8M7rJWdCVp89YXhZ5Q+5NZL4ZHi8ximD0QuFYnnXX8ZxfTQBVtb6YIwQQjjQ uOgLCmFUrxm6xyZ76A1HfP6ddAQN5pBiRMdTKYdnrtyV4IpmU46JN5Q9IMMNsz4+6rgu zcf/AMGRs9rnqBbyzoLKegJS6VGx6lvvOpyXh09308n1/3X3KvaVSKhXTu9eo46NMFMC LZvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=eAt7h6WuZlXy+9SW3pgwyJPBb1bKS+PDEgrlI0CDrJM=; b=J6UGMAK+uv4MK3UbOzH2FqqXMGj89yPCtZ7GTg7XZ+Gns/A7Vyh5g8LaCoaCCtoGjM aks4Hy1ooVn14/K38N1h11MOZGCyDtPrbXEOE0/4ljUvRihNAKS0nBi3TUiJCnH2c+f+ 6xqL4pBLN6Sb68+ab6mFHbrJGu4F4X7mxUEYMmorrlDIO8qoe32Pl6J2uAWff9lzIE40 vej7GSwAY28b2k/7UaFSB66f20QMSDQykjhTi5tLNGQ6tLEOKaZtWbVG8oWWJbw5YBVY FgZTHYx3PKBSt664QrCNbiNrfcjadD3svzTV/M6J3oy6sFIdXpICp8AEl4PwXvxbk96Q 3c4g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e2-v6si28608013pgm.95.2018.11.01.07.48.40; Thu, 01 Nov 2018 07:48:56 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728649AbeKAXv1 (ORCPT + 99 others); Thu, 1 Nov 2018 19:51:27 -0400 Received: from mx1.redhat.com ([209.132.183.28]:33156 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728264AbeKAXv1 (ORCPT ); Thu, 1 Nov 2018 19:51:27 -0400 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id DB2E958E35; Thu, 1 Nov 2018 14:48:09 +0000 (UTC) Received: from dhcp-27-174.brq.redhat.com (unknown [10.43.17.31]) by smtp.corp.redhat.com (Postfix) with SMTP id 4D85E60C47; Thu, 1 Nov 2018 14:48:07 +0000 (UTC) Received: by dhcp-27-174.brq.redhat.com (nbSMTP-1.00) for uid 1000 oleg@redhat.com; Thu, 1 Nov 2018 15:48:08 +0100 (CET) Date: Thu, 1 Nov 2018 15:48:05 +0100 From: Oleg Nesterov To: Tycho Andersen Cc: Kees Cook , Andy Lutomirski , "Eric W . Biederman" , "Serge E . Hallyn" , Christian Brauner , Tyler Hicks , Akihiro Suda , Aleksa Sarai , linux-kernel@vger.kernel.org, containers@lists.linux-foundation.org, linux-api@vger.kernel.org Subject: Re: [PATCH v8 1/2] seccomp: add a return code to trap to userspace Message-ID: <20181101144804.GD23232@redhat.com> References: <20181029224031.29809-1-tycho@tycho.ws> <20181029224031.29809-2-tycho@tycho.ws> <20181030143235.GA3385@redhat.com> <20181030153231.GB7343@cisco> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181030153231.GB7343@cisco> User-Agent: Mutt/1.5.24 (2015-08-30) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.39]); Thu, 01 Nov 2018 14:48:10 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/30, Tycho Andersen wrote: > > > I am not sure I understand the value of signaled/SECCOMP_NOTIF_FLAG_SIGNALED... > > I mean, why it is actually useful? > > > > Sorry if this was already discussed. > > :) no problem, many people have complained about this. This is an > implementation of Andy's suggestion here: > https://lkml.org/lkml/2018/3/15/1122 > > You can see some more detailed discussion here: > https://lkml.org/lkml/2018/9/21/138 Cough, sorry, I simply can't understand what are you talking about ;) It seems that I need to read all the previous emails... So let me ask a stupid question below. > > But my main concern is that either way wait_for_completion_killable() allows > > to trivially create a process which doesn't react to SIGSTOP, not good... > > > > Note also that this can happen if, say, both the tracer and tracee run in the > > same process group and SIGSTOP is sent to their pgid, if the tracer gets the > > signal first the tracee won't stop. > > > > Of freezer. try_to_freeze_tasks() can fail if it freezes the tracer before > > it does SECCOMP_IOCTL_NOTIF_SEND. > > I think in general the way this is intended to be used these things > wouldn't happen. Why? > was malicious and had the ability to create a user namespace to > exhaust pids this way, Not sure I understand how this connects to my question... nevermind. > so perhaps we should drop this part of the > patch. I have no real need for it, but perhaps Andy can elaborate? Yes I think it would be nice to avoid wait_for_completion_killable(). So please help me to understand the problem. Once again, why can not seccomp_do_user_notification() use wait_for_completion_interruptible() only? This is called before the task actually starts the syscall, so -ERESTARTNOINTR if signal_pending() can't hurt. Now lets suppose seccomp_do_user_notification() simply does err = wait_for_completion_interruptible(&n.ready); if (err < 0 && state != SECCOMP_NOTIFY_REPLIED) { syscall_set_return_value(ERESTARTNOINTR); list_del(&n.list); return -1; } (I am ignoring the locking/etc). Now the obvious problem is that the listener doing SECCOMP_IOCTL_NOTIF_SEND can't distinguish -ENOENT from the case when the tracee was killed, yes? Is it that important? Any other problem? Oleg.