Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1163038AbdD0QXF (ORCPT ); Thu, 27 Apr 2017 12:23:05 -0400 Received: from mx1.redhat.com ([209.132.183.28]:60990 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751819AbdD0QW7 (ORCPT ); Thu, 27 Apr 2017 12:22:59 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mx1.redhat.com E918923E6F1 Authentication-Results: ext-mx05.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com Authentication-Results: ext-mx05.extmail.prod.ext.phx2.redhat.com; spf=pass smtp.mailfrom=oleg@redhat.com DKIM-Filter: OpenDKIM Filter v2.11.0 mx1.redhat.com E918923E6F1 Date: Thu, 27 Apr 2017 18:22:54 +0200 From: Oleg Nesterov To: Kirill Tkhai Cc: serge@hallyn.com, ebiederm@xmission.com, agruenba@redhat.com, linux-api@vger.kernel.org, linux-kernel@vger.kernel.org, paul@paul-moore.com, viro@zeniv.linux.org.uk, avagin@openvz.org, linux-fsdevel@vger.kernel.org, mtk.manpages@gmail.com, akpm@linux-foundation.org, luto@amacapital.net, gorcunov@openvz.org, mingo@kernel.org, keescook@chromium.org Subject: Re: [PATCH 2/2] pid_ns: Introduce ioctl to set vector of ns_last_pid's on ns hierarhy Message-ID: <20170427162254.GB19579@redhat.com> References: <149245014695.17600.12640895883798122726.stgit@localhost.localdomain> <149245057248.17600.1341652606136269734.stgit@localhost.localdomain> <20170426155352.GA12131@redhat.com> <785e1986-da03-72aa-06c0-234ed2dbc0fd@virtuozzo.com> <20170427161255.GA19350@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.24 (2015-08-30) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.29]); Thu, 27 Apr 2017 16:22:59 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1888 Lines: 48 On 04/27, Kirill Tkhai wrote: > > On 27.04.2017 19:12, Oleg Nesterov wrote: > > On 04/26, Kirill Tkhai wrote: > >> > >> On 26.04.2017 18:53, Oleg Nesterov wrote: > >>> > >>>> +static long set_last_pid_vec(struct pid_namespace *pid_ns, > >>>> + struct pidns_ioc_req *req) > >>>> +{ > >>>> + char *str, *p; > >>>> + int ret = 0; > >>>> + pid_t pid; > >>>> + > >>>> + read_lock(&tasklist_lock); > >>>> + if (!pid_ns->child_reaper) > >>>> + ret = -EINVAL; > >>>> + read_unlock(&tasklist_lock); > >>>> + if (ret) > >>>> + return ret; > >>> > >>> why do you need to check ->child_reaper under tasklist_lock? this looks pointless. > >>> > >>> In fact I do not understand how it is possible to hit pid_ns->child_reaper == NULL, > >>> there must be at least one task in this namespace, otherwise you can't open a file > >>> which has f_op == ns_file_operations, no? > >> > >> Sure, it's impossible to pick a pid_ns, if there is no the pid_ns's tasks. I added > >> it under impression of > >> https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/commit/?id=dfda351c729733a401981e8738ce497eaffcaa00 > >> but here it's completely wrong. It will be removed in v2. > > > > Hmm. But if I read this commit correctly then we really need to check > > pid_ns->child_reaper != NULL ? > > > > Currently we can't pick an "empty" pid_ns. But after the commit above a task > > can do sys_unshare(CLONE_NEWPID), another (or the same) task can open its > > /proc/$pid/ns/pid_for_children and call ns_ioctl() before the 1st alloc_pid() ? > > Another task can't open /proc/$pid/ns/pid_for_children before the 1st alloc_pid(), > because pid_for_children is available to open only after the 1st alloc_pid(). > So, it's impossible to call ioctl() on it. Ah, OK, I didn't notice the ns->child_reaper check in pidns_for_children_get(). But note that it doesn't need tasklist_lock too. Oleg.