Received: by 2002:a25:c205:0:0:0:0:0 with SMTP id s5csp2009257ybf; Sun, 1 Mar 2020 23:48:44 -0800 (PST) X-Google-Smtp-Source: APXvYqw64+Te7XuvoVujGI3F75wvWwkWeTeliiKXLjopT9Ex5VsgQdUBP/5nkTZnrDC1APpY5f1U X-Received: by 2002:a05:6830:147:: with SMTP id j7mr12230898otp.12.1583135324644; Sun, 01 Mar 2020 23:48:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1583135324; cv=none; d=google.com; s=arc-20160816; b=jgrc02sJVKhR3frY155vCu+KRhxoUhNyudODygQnjZerH2Dx5JoId7mTsXjj+VJzFS ZvSkMQcbnYtAMcBoMlf4geuHqCPgP+8gxRjvRawgF0cF1B/KEylkZHBKHi2oEzAAB3Yq Ha8bD9511Pq5/4TIauthZ/Fq4H0F+PlTHNZ9mmBqh5Z6yNVhH8SC+2TryhCbnOYYZfR9 BbsDxOJGzTibGTo8n2IKWJLEWzhmiFwtan0VdZ7Rm6txE59G04AVi2pcurzUPvenlsA3 90OTXCnZiqAFoaawbsaNQ0A2a2NLOsDZFx4q/WqqzVS8anOdBrgRcuNLiLoVXioMZFHK yCrA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=JH74PY+qldTvxy2zqoakZ8OsCtPGaRrrxovMfRq8vsY=; b=JkdWSNw5SCz8rJBqVP6hqQ9b6hfQghhbJWE73wqGVq/2NzVipKjVzMKmHzfAOy9V1/ HFls0P2uJTjXbe+fC5ziTqu0Hu9g0eKKw2iVsuSCzc6vlH5EGrPcrb/Zjzg83FsQo3pK pRBCOeoilztuEl282ror9+i9Auds5kcx89SEiQ6hpwo5KGxWBhUSpGikbsvxjrde6Fot LvHxEFVEjbrl0haJZDZ9URcAksi0ZFCxRLKrcgmdHr80fwM/PdT/JgQnjt5z7eO5sXFj 8QBFwBVSaf9KfdZMThE6K5fIjvehooSn2SHWQF8+2ZIEnPJzMCpbCG9IYImaY/dapSWH YRuw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 45si5898660otg.7.2020.03.01.23.48.32; Sun, 01 Mar 2020 23:48:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726956AbgCBHs2 (ORCPT + 99 others); Mon, 2 Mar 2020 02:48:28 -0500 Received: from youngberry.canonical.com ([91.189.89.112]:47552 "EHLO youngberry.canonical.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725446AbgCBHs2 (ORCPT ); Mon, 2 Mar 2020 02:48:28 -0500 Received: from ip5f5bf7ec.dynamic.kabel-deutschland.de ([95.91.247.236] helo=wittgenstein) by youngberry.canonical.com with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1j8fns-0001Sk-QB; Mon, 02 Mar 2020 07:47:52 +0000 Date: Mon, 2 Mar 2020 08:47:51 +0100 From: Christian Brauner To: Jann Horn Cc: Bernd Edlinger , Jonathan Corbet , Alexander Viro , Andrew Morton , Alexey Dobriyan , "Eric W. Biederman" , Thomas Gleixner , Oleg Nesterov , Frederic Weisbecker , Andrei Vagin , Ingo Molnar , "Peter Zijlstra (Intel)" , Yuyang Du , David Hildenbrand , Sebastian Andrzej Siewior , Anshuman Khandual , David Howells , James Morris , Kees Cook , Greg Kroah-Hartman , Shakeel Butt , Jason Gunthorpe , Christian Kellner , Andrea Arcangeli , Aleksa Sarai , "Dmitry V. Levin" , "linux-doc@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "linux-fsdevel@vger.kernel.org" , "linux-mm@kvack.org" Subject: Re: [PATCH] exec: Fix a deadlock in ptrace Message-ID: <20200302074751.evhnq3b5zvtbaqu4@wittgenstein> References: <20200301185244.zkofjus6xtgkx4s3@wittgenstein> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Mar 01, 2020 at 09:00:22PM +0100, Jann Horn wrote: > On Sun, Mar 1, 2020 at 7:52 PM Christian Brauner > wrote: > > On Sun, Mar 01, 2020 at 07:21:03PM +0100, Jann Horn wrote: > > > On Sun, Mar 1, 2020 at 12:27 PM Bernd Edlinger > > > wrote: > > > > The proposed solution is to have a second mutex that is > > > > used in mm_access, so it is allowed to continue while the > > > > dying threads are not yet terminated. > > > > > > Just for context: When I proposed something similar back in 2016, > > > https://lore.kernel.org/linux-fsdevel/20161102181806.GB1112@redhat.com/ > > > was the resulting discussion thread. At least back then, I looked > > > through the various existing users of cred_guard_mutex, and the only > > > places that couldn't be converted to the new second mutex were > > > PTRACE_ATTACH and SECCOMP_FILTER_FLAG_TSYNC. > > > > > > > > > The ideal solution would IMO be something like this: Decide what the > > > new task's credentials should be *before* reaching de_thread(), > > > install them into a second cred* on the task (together with the new > > > dumpability), drop the cred_guard_mutex, and let ptrace_may_access() > > > check against both. After that, some further restructuring might even > > > > Hm, so essentially a private ptrace_access_cred member in task_struct? > > And a second dumpability field, because that changes together with the > creds during execve. (Btw, currently the dumpability is in the > mm_struct, but that's kinda wrong. The mm_struct is removed from a > task on exit while access checks can still be performed against it, and > currently ptrace_may_access() just lets the access go through in that > case, which weakens the protection offered by PR_SET_DUMPABLE when > used for security purposes. I think it ought to be moved over into the > task_struct.) > > > That would presumably also involve altering various LSM hooks to look at > > ptrace_access_cred. > > When I tried to implement this in the past, I changed the LSM hook to > take the target task's cred* as an argument, and then called the LSM > hook twice from ptrace_may_access(). IIRC having the target task's > creds as an argument works for almost all the LSMs, with the exception > of Yama, which doesn't really care about the target task's creds, so > you have to pass in both the task_struct* and the cred*. It seems we should try PoCing this. Christian