Received: by 2002:a25:824b:0:0:0:0:0 with SMTP id d11csp1766341ybn; Thu, 26 Sep 2019 01:46:24 -0700 (PDT) X-Google-Smtp-Source: APXvYqxVh3Vn9fYrD+j7s2IRh29A1owwg5OfFQlZKWyLrq4Gh1Lnrp02DQlJL3d/0rhraHuuar5H X-Received: by 2002:a17:906:2542:: with SMTP id j2mr2072510ejb.278.1569487584447; Thu, 26 Sep 2019 01:46:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569487584; cv=none; d=google.com; s=arc-20160816; b=eBzJqmWbp1Jhxguij1nocGne5lJO0dyhrCooYaDOJaF47P8yOgn4RlGuQCLj8OQCNo RQfOW3JwsyplAKq6zG3x+oXrFWRkvmdhFWdZ8kZnCIY57MWGJvD18X02DoAHk81RAJpZ r0GYMio8sw6juA4BOw4Hu6bwMcH9UiXpKHb4knnmQxPYl7zAddzaM+NYjt4IgcR5AC5f n/wnP5f6tGJpoMFqgUy+vzIjFOLThLU7nwoZqYv6R2Etb+ddtuNlHmLsuqedjHc8NraH x1qrZq4R7mYTxC8E2tu1qHlG3J/zF9SXTwdJ6UCELFfhrohYosePQX+fxEpQPGrP7xW/ 84aA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=WT+ViXYDpLiKZm9g+jXUpQJh7eNYQrFGBuQHugCYwRk=; b=JYUqkx17Nbb4TG14oc4n1mwjyQXBrqcSl4D8+WHsB2EZ74RYnzFnxMFoQ2TKeMc0cW rve6146a/NRYZFi5CdFBkJSkgRj7yPmvZv88msBZTMeaZ5cRfNCiiJ7Tmz4WErH7a3Q9 QLiQ8rM6rJnYsaG3hwP4mpWnsnVlInPxkDiC2OMba6ngjsbm8iPDJVLkZ8/HGcD3jjZA zpDmLGYNKw7bduPvzGdJDVQRmyhZaREaZDC4nyliHbf9SbdyBvO1S9GitoR1ARy3UaIJ R3p7XuZb/JVXVjizWMdyzBclqQDBLJQlH8zm0pY4UX9dOmEVkkiTRMzU/MX6kZD7GoeM EsHA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w6si876137eda.336.2019.09.26.01.46.01; Thu, 26 Sep 2019 01:46:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2409995AbfIXUHs (ORCPT + 99 others); Tue, 24 Sep 2019 16:07:48 -0400 Received: from youngberry.canonical.com ([91.189.89.112]:38737 "EHLO youngberry.canonical.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728405AbfIXUHr (ORCPT ); Tue, 24 Sep 2019 16:07:47 -0400 Received: from lmontsouris-656-1-55-152.w80-15.abo.wanadoo.fr ([80.15.152.152] helo=wittgenstein) by youngberry.canonical.com with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1iCr65-0008FZ-2D; Tue, 24 Sep 2019 20:07:42 +0000 Date: Tue, 24 Sep 2019 22:07:36 +0200 From: Christian Brauner To: "Michael Kerrisk (man-pages)" Cc: Florian Weimer , Oleg Nesterov , Jann Horn , "Eric W. Biederman" , Daniel Colascione , Joel Fernandes , linux-man , Linux API , lkml Subject: Re: For review: pidfd_send_signal(2) manual page Message-ID: <20190924200735.2dvqhan7ynnmfc7s@wittgenstein> References: <87pnjr9rth.fsf@mid.deneb.enyo.de> <20190923142325.jowzbnwjw7g7si7j@wittgenstein> <90dd38d5-34b3-b72f-8e5a-b51f944f22fb@gmail.com> <20190924195701.7pw2olbviieqsg5q@wittgenstein> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190924195701.7pw2olbviieqsg5q@wittgenstein> User-Agent: NeoMutt/20180716 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Sep 24, 2019 at 09:57:04PM +0200, Christian Brauner wrote: > On Tue, Sep 24, 2019 at 09:44:49PM +0200, Michael Kerrisk (man-pages) wrote: > > Hello Christian, > > > > On 9/23/19 4:23 PM, Christian Brauner wrote: > > > On Mon, Sep 23, 2019 at 01:26:34PM +0200, Florian Weimer wrote: > > >> * Michael Kerrisk: > > >> > > >>> SYNOPSIS > > >>> int pidfd_send_signal(int pidfd, int sig, siginfo_t info, > > >>> unsigned int flags); > > >> > > >> This probably should reference a header for siginfo_t. > > > > > > Agreed. > > > > > >> > > >>> ESRCH The target process does not exist. > > >> > > >> If the descriptor is valid, does this mean the process has been waited > > >> for? Maybe this can be made more explicit. > > > > > > If by valid you mean "refers to a process/thread-group leader" aka is a > > > pidfd then yes: Getting ESRCH means that the process has exited and has > > > already been waited upon. > > > If it had only exited but not waited upon aka is a zombie, then sending > > > a signal will just work because that's currently how sending signals to > > > zombies works, i.e. if you only send a signal and don't do any > > > additional checks you won't notice a difference between a process being > > > alive and a process being a zombie. The userspace visible behavior in > > > terms of signaling them is identical. > > > > (Thanks for the clarification. I added the text "(i.e., it has > > terminated and been waited on)" to the ESRCH error.) > > > > >>> The pidfd_send_signal() system call allows the avoidance of race > > >>> conditions that occur when using traditional interfaces (such as > > >>> kill(2)) to signal a process. The problem is that the traditional > > >>> interfaces specify the target process via a process ID (PID), with > > >>> the result that the sender may accidentally send a signal to the > > >>> wrong process if the originally intended target process has termi‐ > > >>> nated and its PID has been recycled for another process. By con‐ > > >>> trast, a PID file descriptor is a stable reference to a specific > > >>> process; if that process terminates, then the file descriptor > > >>> ceases to be valid and the caller of pidfd_send_signal() is > > >>> informed of this fact via an ESRCH error. > > >> > > >> It would be nice to explain somewhere how you can avoid the race using > > >> a PID descriptor. Is there anything else besides CLONE_PIDFD? > > > > > > If you're the parent of the process you can do this without CLONE_PIDFD: > > > pid = fork(); > > > pidfd = pidfd_open(); > > > ret = pidfd_send_signal(pidfd, 0, NULL, 0); > > > if (ret < 0 && errno == ESRCH) > > > /* pidfd refers to another, recycled process */ > > > > Although there is still the race between the fork() and the > > pidfd_open(), right? > > Actually no and my code is even too complex. > If you are the parent, and this is really a sequence that obeys the > ordering pidfd_open() before waiting: > > pid = fork(); > if (pid == 0) > exit(EXIT_SUCCESS); > pidfd = pidfd_open(pid, 0); > waitid(pid, ...); > > Then you are guaranteed that pidfd will refer to pid. No recycling can > happen since the process has not been waited upon yet (That is, > excluding special cases such as where you have a mainloop where a > callback reacts to a SIGCHLD event and waits on the child behind your > back and your next callback in the mainloop calls pidfd_open() while the > pid has been recycled etc.). If we wanted to be super nitpicky one could also get in that situation where you do: signal(SIGCHLD,SIG_IGN); // or struct sigaction sa; sa.sa_handler = SIG_IGN; sigemptyset(&sa.sa_mask); sa.sa_flags = 0; sigaction(SIGCHLD, &sa, 0) pid = fork(); if (pid == 0) exit(EXIT_SUCCESS); pidfd = pidfd_open(); because then the process gets autoreaped and can be recycled. But again, that's just bad form and in that scenario one should again use clone(CLONE_PIDFD) instead of fork(). Christian