Received: by 2002:a05:6a10:22f:0:0:0:0 with SMTP id 15csp357200pxk; Wed, 2 Sep 2020 03:24:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy+LMf1mtQxcpc6norbUq7u3GGDNhUgH8l5ZWxJLXfMKsRUaXnmXNJQSv3HBZPnw47Hy0/f X-Received: by 2002:a05:6402:220d:: with SMTP id cq13mr951009edb.260.1599042273039; Wed, 02 Sep 2020 03:24:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1599042273; cv=none; d=google.com; s=arc-20160816; b=ioYkOOyV9ZfPHzdfMXvxubOR+CXJV6u6uaRQ8760oe6jNXXIPUKyAqHQbOHatK439v PKm6jGVsZyn+MrNL6mX8x9BKyzp+CUF9STd3LYprVTKNDl+vQYWegejc5LkksIY8xUm8 V1nJKctWMabVtK/cyUsvLM+HjTsCShP41kYtdd7G26qftBFg11hx6Vv2oSPngRqpe4eP eO73MM/2GtkAb8Mbe6Zb84EMsgZ5dPcVO3PDtdbCE5Aa04WcgwSWCUtgaByFi0Hopbue wXAZnAgLb9PbCnf7SiVT1joVKrQbIsdXFv2kKYwEpEVKD7UqhQCxWN4IujSqGYLXXaLt eAjw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=miI22GcisCFAspChQ9IpbO9lEVf4jtBI4Ej2Vnj1geE=; b=aYP3jfvNXTvnaXGByQfxY+NO9Sr3+G0EKujJQAZndmWnfaFZY+BFsMpwtB0WMNKjLt zXfYiRoUrQ+eXmZ2cXQBftTiWv62kBAg45VFlH78QfQkix+VqbZS/UKUCDXduq4ni+QW qLK/oVFdrkhSKxlbvh9fVp1hH7h6C16xUlrPF9JJN2SF/QlbP4+cZcs6Qicx8TnXUnlB kGyRXVg0Dur36CeGOun9t0zQlnsux9Bk1wRy793dkZR/lsyEieh8bSTg92d6ceO0hrr2 4oG/fkamWRaHCEC/eCxgUFyar1jOezVnYQ0TEalpspIuD8L90RsehILTXQM3LAVmb3fY Vqzg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id de17si1693725edb.89.2020.09.02.03.24.09; Wed, 02 Sep 2020 03:24:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726521AbgIBKXa (ORCPT + 99 others); Wed, 2 Sep 2020 06:23:30 -0400 Received: from youngberry.canonical.com ([91.189.89.112]:49488 "EHLO youngberry.canonical.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726140AbgIBKXU (ORCPT ); Wed, 2 Sep 2020 06:23:20 -0400 Received: from ip5f5af70b.dynamic.kabel-deutschland.de ([95.90.247.11] helo=wittgenstein.fritz.box) by youngberry.canonical.com with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1kDPvB-00069F-FI; Wed, 02 Sep 2020 10:23:17 +0000 From: Christian Brauner To: linux-kernel@vger.kernel.org Cc: Christian Brauner , "Peter Zijlstra (Intel)" , Ingo Molnar , Thomas Gleixner , Oleg Nesterov , "Eric W. Biederman" , Kees Cook , Sargun Dhillon , Aleksa Sarai , linux-kselftest@vger.kernel.org, Josh Triplett , Jens Axboe , linux-api@vger.kernel.org, Christian Brauner Subject: [PATCH v2 0/4] Support non-blocking pidfds Date: Wed, 2 Sep 2020 12:21:26 +0200 Message-Id: <20200902102130.147672-1-christian.brauner@ubuntu.com> X-Mailer: git-send-email 2.28.0 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Passing a non-blocking pidfd to waitid() currently has no effect, i.e. is not supported. There are users which would like to use waitid() on pidfds that are O_NONBLOCK and mix it with pidfds that are blocking and both pass them to waitid(). The expected behavior is to have waitid() return -EAGAIN for non-blocking pidfds and to block for blocking pidfds without needing to perform any additional checks for flags set on the pidfd before passing it to waitid(). Non-blocking pidfds will return EAGAIN from waitid() when no child process is ready yet. Returning -EAGAIN for non-blocking pidfds makes it easier for event loops that handle EAGAIN specially. It also makes the API more consistent and uniform. In essence, waitid() is treated like a read on a non-blocking pidfd or a recvmsg() on a non-blocking socket. With the addition of support for non-blocking pidfds we support the same functionality that sockets do. For sockets() recvmsg() supports MSG_DONTWAIT for pidfds waitid() supports WNOHANG. Both flags are per-call options. In contrast non-blocking pidfds and non-blocking sockets are a setting on an open file description affecting all threads in the calling process as well as other processes that hold file descriptors referring to the same open file description. Both behaviors, per call and per open file description, have genuine use-cases. A concrete use-case that was brought on-list (see [1]) was Josh's async pidfd library. Ever since the introduction of pidfds and more advanced async io various programming languages such as Rust have grown support for async event libraries. These libraries are created to help build epoll-based event loops around file descriptors. A common pattern is to automatically make all file descriptors they manage to O_NONBLOCK. For such libraries the EAGAIN error code is treated specially. When a function is called that returns EAGAIN the function isn't called again until the event loop indicates the the file descriptor is ready. Supporting EAGAIN when waiting on pidfds makes such libraries just work with little effort. Thanks! Christian [1]: https://lore.kernel.org/lkml/20200811181236.GA18763@localhost/ Christian Brauner (4): pidfd: support PIDFD_NONBLOCK in pidfd_open() exit: support non-blocking pidfds tests: port pidfd_wait to kselftest harness tests: add waitid() tests for non-blocking pidfds include/uapi/linux/pidfd.h | 12 + kernel/exit.c | 15 +- kernel/pid.c | 12 +- tools/testing/selftests/pidfd/pidfd.h | 4 + tools/testing/selftests/pidfd/pidfd_wait.c | 298 +++++++++------------ 5 files changed, 157 insertions(+), 184 deletions(-) create mode 100644 include/uapi/linux/pidfd.h base-commit: d012a7190fc1fd72ed48911e77ca97ba4521bccd -- 2.28.0