Received: by 2002:ac0:8c9a:0:0:0:0:0 with SMTP id r26csp5519982ima; Tue, 5 Feb 2019 13:11:44 -0800 (PST) X-Google-Smtp-Source: AHgI3Ib0xDipkVXzx4RoBmFTkr36GrPxF3TQrWUMk9714/akRP6oPZntSSqxkNVnM/QBQlqaxeEr X-Received: by 2002:a17:902:eb03:: with SMTP id cw3mr7186229plb.130.1549401104631; Tue, 05 Feb 2019 13:11:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1549401104; cv=none; d=google.com; s=arc-20160816; b=Uf5xOHe9qDzaHXUrGMNYEZ9mfpKn2PX8eHbDP5mRYdmoN5F9FvhDUR3F/Gi4eMdQ1T nWevZKf3BH3PdDJT+WpSTlWTSDDN/MjivAcnuzKbgMX3g/GhUqS5HUhKj73zNdKG+fH2 Kn2ILXiOwRbx+RZPKeV6H1W/w1XcZ2qOp3OJTbcL8qnY3xTmuNiU8JgOqvviIGgT49xa Jra6FNQi8i1pse+3ncxvNdbB3Ih6CmOJ6OkS0UJFe7beraTLKKY0H5RAUiOI/2r8FINp VK0id5bZ++HYihFsTTsIq76WwFUsOhguz40l4udOSq603U3Fl29O3gvY1SC99qmwOlyw i/2g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date; bh=cjKwRsT1iza9J1uasInPZMTAoUm4GywmlFpCbDtraMY=; b=m67qGfA5Pata5XrVpJA33EJoiF+XeFg8QyxrecAeYQGOMg0RArX4wt9J8/d5gqQNhu zrFMIFtXUS4rSdDl3+tR/pGoQw1zviv/OVOkI6mtMJwz7q+IiTcTmzTrbSNpIXZjwfvr 05FcwFBy2kZL08NNDSKCuVwQX85bgz9oovq9qT35oXSgUYjec9KE+JrWQbtnxX9pYnMV 6iQGouWmPfG8k6R1ktTpGbU1j2iMS8x1uMcMTgmPrU8MCvdqYl8g/5QJM6ye7n1pWXvc JNTMMKY+N7z0zzUwSn8yjDjIrly9JeWaHYAa/cPmoVi1DMfDsKrbX0fJHarTBYW6mdqx XKyA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v10si4308773plg.82.2019.02.05.13.11.28; Tue, 05 Feb 2019 13:11:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728199AbfBEVLW (ORCPT + 99 others); Tue, 5 Feb 2019 16:11:22 -0500 Received: from mail.linuxfoundation.org ([140.211.169.12]:47098 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727232AbfBEVLW (ORCPT ); Tue, 5 Feb 2019 16:11:22 -0500 Received: from akpm3.svl.corp.google.com (unknown [104.133.8.65]) by mail.linuxfoundation.org (Postfix) with ESMTPSA id 8BDCCA6C1; Tue, 5 Feb 2019 21:11:20 +0000 (UTC) Date: Tue, 5 Feb 2019 13:11:19 -0800 From: Andrew Morton To: Ivan Delalande Cc: Al Viro , Dmitry Safonov <0x7f454c46@gmail.com>, Oleg Nesterov , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Andy Lutomirski Subject: Re: [PATCH v2] exec: don't force_sigsegv processes with a pending fatal signal Message-Id: <20190205131119.3e388a0a1a69c0a041ed87ef@linux-foundation.org> In-Reply-To: <20190205025308.GA24455@visor> References: <20190205025308.GA24455@visor> X-Mailer: Sylpheed 3.6.0 (GTK+ 2.24.31; x86_64-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 4 Feb 2019 18:53:08 -0800 Ivan Delalande wrote: > We were seeing unexplained segfaults in coreutils processes and other > basic utilities on systems with print-fatal-signals enabled: > > [ 311.001986] potentially unexpected fatal signal 11. > [ 311.001993] CPU: 3 PID: 4565 Comm: tail Tainted: P O 4.9.100.Ar-8497547.eostrunkkernel49 #1 > [ 311.001995] task: ffff88021431b400 task.stack: ffffc90004cec000 > [ 311.001997] RIP: 0023:[<00000000f7722c09>] [<00000000f7722c09>] 0xf7722c09 > [ 311.002003] RSP: 002b:00000000ffcc8aa4 EFLAGS: 00000296 > [ 311.002004] RAX: fffffffffffffff2 RBX: 0000000057efc530 RCX: 0000000057efdb68 > [ 311.002006] RDX: 0000000057effb60 RSI: 0000000057efdb68 RDI: 00000000f768f000 > [ 311.002007] RBP: 0000000057efc530 R08: 0000000000000000 R09: 0000000000000000 > [ 311.002008] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 > [ 311.002009] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 > [ 311.002011] FS: 0000000000000000(0000) GS:ffff88021e980000(0000) knlGS:0000000000000000 > [ 311.002013] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033 > [ 311.002014] CR2: 00000000f77bf097 CR3: 0000000150f6f000 CR4: 00000000000406f0 > > We tracked these crashes down to binfmt_elf failing to load segments > for ld.so inside the kernel. Digging further, the actual problem > seems to occur when a process gets sigkilled while it is still being > loaded by the kernel. In our case when _do_page_fault goes for a retry > it will return early as it first checks for fatal_signal_pending(), so > load_elf_interp also returns with error and as a result > search_binary_handler will force_sigsegv() which is pretty confusing as > nothing actually failed here. > > > v2: add a message when load_binary fails, add a check for fatal signals > in signal_delivered (avoiding a single check in force_sigsegv as other > architectures use it directly and may have different expectations). > > Thanks to Dmitry Safonov and Oleg Nesterov for their comments and > suggestions. > > ... > > --- a/fs/exec.c > +++ b/fs/exec.c > @@ -1660,7 +1660,12 @@ int search_binary_handler(struct linux_binprm *bprm) > if (retval < 0 && !bprm->mm) { > /* we got to flush_old_exec() and failed after it */ > read_unlock(&binfmt_lock); > - force_sigsegv(SIGSEGV, current); > + if (!fatal_signal_pending(current)) { > + if (print_fatal_signals) > + pr_info("load_binary() failed: %d\n", > + retval); Should we be using print_fatal_signal() here? > + force_sigsegv(SIGSEGV, current); > + } > return retval; > } > if (retval != -ENOEXEC || !bprm->file) { > diff --git a/kernel/signal.c b/kernel/signal.c > index e1d7ad8e6ab1..674076e63624 100644 > --- a/kernel/signal.c > +++ b/kernel/signal.c > @@ -2552,10 +2552,10 @@ static void signal_delivered(struct ksignal *ksig, int stepping) > > void signal_setup_done(int failed, struct ksignal *ksig, int stepping) > { > - if (failed) > - force_sigsegv(ksig->sig, current); > - else > + if (!failed) > signal_delivered(ksig, stepping); > + else if (!fatal_signal_pending(current)) > + force_sigsegv(ksig->sig, current); > }