Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp3775059ybv; Tue, 25 Feb 2020 07:06:57 -0800 (PST) X-Google-Smtp-Source: APXvYqxSMKGKlNC5iEfOmCJML2M2SZ0bIc9pRIo7csoj6adwUweHjG8AO8eg2PwEjeTF1gk+7CxL X-Received: by 2002:aca:2112:: with SMTP id 18mr3743583oiz.155.1582643217098; Tue, 25 Feb 2020 07:06:57 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582643217; cv=none; d=google.com; s=arc-20160816; b=mH9Ty98lSKSdy26SF0uKBzQVi8rO7f5TblkIWUhLjopFzfp1JNY3OhpWZTdhkm1Noi E32uEyiSqNZn3ZdddYPPC0BRkN03lJyC6KJGCB5+nCo6r1T1pY0IMsuhJjNpTdMXCTUs Pt9HHeVGMeYWYsvpFt4UW+ztZ8CP7JJWb4d1jX1W+SMxDZ6VI+4kfwC2V4w4JuzW4gBk AcrTolDcnSDSaMsKXrZwfB4q49acHKmUM1rxVcJthqTB/TC2Q2cNN4J3lw3TFfvSNbro 7ZRxN8OtW4v73HiTR+69UryaKVDRx1mXr3sPLBfowENDb/IG0N2hFDO3o94Zfy7zxS6E ex2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=QMvNXoW6psDe1PxL4MB925CUo6es0He3gLgCnJQuh1A=; b=M81+ZvkqpwfzZxVNeLC3ncmeQfkA1HQzOfCHq88Plq+2NE+Oh/77yCnp2WwBZ51DRO a2eWdtbSlRD4U/4kzYpoNSAbrix4O1DFR+PruYpCXRpgLAgOY/vIr2AbTLWG4vrjHLCh cRWwfzxKhHvEccWYIbT01vjCBlR+/7qpkw06irYtT16sWcGzXeBigE/QaRHSJcCfJKFt hOW80rKM/bD9mZ4jXzNovFjLHYKADIg/lX4laQ2zEwFIuIPRduD+j6IQJAeojqm+tiox uW8XnXm7RizoB1U/MewenjOAyz24XRw6RuKp8+kXpqTGrURhgjRXxcuGIpQd7+5f7YYE HHhw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=c3bkCqEm; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l15si7292809oic.220.2020.02.25.07.06.34; Tue, 25 Feb 2020 07:06:57 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=c3bkCqEm; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730910AbgBYOyh (ORCPT + 99 others); Tue, 25 Feb 2020 09:54:37 -0500 Received: from us-smtp-delivery-1.mimecast.com ([205.139.110.120]:50661 "EHLO us-smtp-1.mimecast.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729510AbgBYOyh (ORCPT ); Tue, 25 Feb 2020 09:54:37 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1582642475; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=QMvNXoW6psDe1PxL4MB925CUo6es0He3gLgCnJQuh1A=; b=c3bkCqEmPrVmJaJ+fZHMWu/HL1vfXxQAqvcqfDCFg6n6mJBsIEqlRdp8PqTQ1mjOguuKAU EW40y584/p/1WHJgiCMGJW5UukUwNT6S3lBj2NJsgaz5i45GP/ooYdBdfXQRcxA9XYTBAy okFg7x1nLgpX5Y9jP7QMNtYENcGuYpM= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-266-hv_p7hdhOdCQZbWjrxSTCQ-1; Tue, 25 Feb 2020 09:54:28 -0500 X-MC-Unique: hv_p7hdhOdCQZbWjrxSTCQ-1 Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.phx2.redhat.com [10.5.11.15]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id 45DEC102CE15; Tue, 25 Feb 2020 14:54:27 +0000 (UTC) Received: from lithium.redhat.com (unknown [10.36.118.11]) by smtp.corp.redhat.com (Postfix) with ESMTP id 545361BC6D; Tue, 25 Feb 2020 14:54:25 +0000 (UTC) From: Giuseppe Scrivano To: linux-kernel@vger.kernel.org Cc: rcu@vger.kernel.org, ebiederm@xmission.com, paulmck@kernel.org, viro@zeniv.linux.org.uk Subject: [PATCH v3] ipc: use a work queue to free_ipc Date: Tue, 25 Feb 2020 15:54:19 +0100 Message-Id: <20200225145419.527994-1-gscrivan@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 2.79 on 10.5.11.15 Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org the reason is to avoid a delay caused by the synchronize_rcu() call in kern_umount() when the mqueue mount is freed. the code: \#define _GNU_SOURCE \#include \#include \#include \#include int main() { int i; for (i =3D 0; i < 1000; i++) if (unshare (CLONE_NEWIPC) < 0) error (EXIT_FAILURE, errno, "unshare"); } gets from: Command being timed: "./ipc-namespace" User time (seconds): 0.00 System time (seconds): 0.06 Percent of CPU this job got: 0% Elapsed (wall clock) time (h:mm:ss or m:ss): 0:08.05 to: Command being timed: "./ipc-namespace" User time (seconds): 0.00 System time (seconds): 0.02 Percent of CPU this job got: 96% Elapsed (wall clock) time (h:mm:ss or m:ss): 0:00.03 Signed-off-by: Giuseppe Scrivano --- v3: - added comment for free_ipc_work - commit message rephrased v2: https://lkml.org/lkml/2020/2/17/839 - comment added in free_ipc_ns() v1: https://lkml.org/lkml/2020/2/11/692 include/linux/ipc_namespace.h | 2 ++ ipc/namespace.c | 24 ++++++++++++++++++++++-- 2 files changed, 24 insertions(+), 2 deletions(-) diff --git a/include/linux/ipc_namespace.h b/include/linux/ipc_namespace.= h index c309f43bde45..a06a78c67f19 100644 --- a/include/linux/ipc_namespace.h +++ b/include/linux/ipc_namespace.h @@ -68,6 +68,8 @@ struct ipc_namespace { struct user_namespace *user_ns; struct ucounts *ucounts; =20 + struct llist_node mnt_llist; + struct ns_common ns; } __randomize_layout; =20 diff --git a/ipc/namespace.c b/ipc/namespace.c index b3ca1476ca51..c0470aef41a0 100644 --- a/ipc/namespace.c +++ b/ipc/namespace.c @@ -117,6 +117,10 @@ void free_ipcs(struct ipc_namespace *ns, struct ipc_= ids *ids, =20 static void free_ipc_ns(struct ipc_namespace *ns) { + /* mq_put_mnt() waits for a grace period as kern_unmount() + * uses synchronize_rcu(). + */ + mq_put_mnt(ns); sem_exit_ns(ns); msg_exit_ns(ns); shm_exit_ns(ns); @@ -127,6 +131,21 @@ static void free_ipc_ns(struct ipc_namespace *ns) kfree(ns); } =20 +static LLIST_HEAD(free_ipc_list); +static void free_ipc(struct work_struct *unused) +{ + struct llist_node *node =3D llist_del_all(&free_ipc_list); + struct ipc_namespace *n, *t; + + llist_for_each_entry_safe(n, t, node, mnt_llist) + free_ipc_ns(n); +} + +/* + * The work queue is used to avoid the cost of synchronize_rcu in kern_u= nmount. + */ +static DECLARE_WORK(free_ipc_work, free_ipc); + /* * put_ipc_ns - drop a reference to an ipc namespace. * @ns: the namespace to put @@ -148,8 +167,9 @@ void put_ipc_ns(struct ipc_namespace *ns) if (refcount_dec_and_lock(&ns->count, &mq_lock)) { mq_clear_sbinfo(ns); spin_unlock(&mq_lock); - mq_put_mnt(ns); - free_ipc_ns(ns); + + if (llist_add(&ns->mnt_llist, &free_ipc_list)) + schedule_work(&free_ipc_work); } } =20 --=20 2.24.1