Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp4309284imm; Mon, 15 Oct 2018 12:31:32 -0700 (PDT) X-Google-Smtp-Source: ACcGV62AGNoOLXOiQXzjFsZ70cxGdV0/F0UivLu7pPw61xKbwn9dtX0NnV6fhdisPLGfNOvmbVk7 X-Received: by 2002:a17:902:5ac9:: with SMTP id g9-v6mr18576434plm.311.1539631892847; Mon, 15 Oct 2018 12:31:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539631892; cv=none; d=google.com; s=arc-20160816; b=rBYMwsszMnXuSGj0b8q7zI00nTy4E3PBHwWkUkhQJFUM6XV8fHtdKWtlrKUpgO2lOf hNyr2HgTeUT9Jc4me+FIMqx8rZScB35J8j4mhxkAipEBvdyKzcwQUqajDFad1HK0cd5s SQlHXvZG1aa2Gu1jn9yATj1C22s1rPbRvwvT43rZ6M0FBG8bU5IOO/2hdUhOLDYROyv4 gTVxWalYrB8e2z3K8tDMDGfMPhYFnWPjDphZfa5kqkXb4WADt7jTXfRQmYqTUV+vXDQM 52ELF/jF6qsdXt//ADVCLRMLAQRczBji8YQOKBZD8s8woMlzGGeYwQX9Bfg2utTBpSFd R0IA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:to:from:references:cc:subject:dkim-signature; bh=kyGLXQg797U3sh9QdrEqClYeUw2NVajFKJJIHDtcUZk=; b=fQXG7eD4lfTb4Czb9b1yIhfEn2QI9BaWfVEVwt2qDjRF77qVJKoBwEYWzoyIQ0PSBR 707JadnX6MMxo8zBW7rZYBBhAgFUhFjEy97J98VHubl8XMbrVog43QqlOkVq0JKkZqSA oE1bbx9ii/1h5boxGM2pVWnvv4NMDuxL2rLAo/QQWR6tpYi6E4UbCkP+fPrYRYhuRm19 vT6syRv46GGU6ApCJMOm/yZnGT31m6Hrq9imAl6awg4VyuYj6ucdou2+Zzce8o+9J3sS EuXXwzPDDfInB1VEcdYLyDTKhoQIv/oF4PMWbmkdawij3gFUV3aHWOizk/8qleQH7KfS y2MQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@cisco.com header.s=iport header.b=LBZaVSW7; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=cisco.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y7-v6si11402135plt.112.2018.10.15.12.31.16; Mon, 15 Oct 2018 12:31:32 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@cisco.com header.s=iport header.b=LBZaVSW7; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=cisco.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726947AbeJPDRT (ORCPT + 99 others); Mon, 15 Oct 2018 23:17:19 -0400 Received: from rcdn-iport-4.cisco.com ([173.37.86.75]:44564 "EHLO rcdn-iport-4.cisco.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726820AbeJPDRT (ORCPT ); Mon, 15 Oct 2018 23:17:19 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=4072; q=dns/txt; s=iport; t=1539631841; x=1540841441; h=subject:cc:references:from:to:message-id:date: mime-version:in-reply-to:content-transfer-encoding; bh=l+kGnYAags5I3VHdWl3Bwj/IMWLNmAJom5a7xbFz8WQ=; b=LBZaVSW7TbTaoIfuU7vo/jCkECITn7XcztRt6v+6ov6ZFZIoqeOWJ/gd +0cDvRe65Dv/BYxYPH/XYIeaRSOByRBXJfRYA/etRjbUlQ1TJO1ZJZiGx OQslEIXcFlrpo2Fsqw6WLlixUREvOimO1nnBItED67VhiG9zdygeP0Gjt c=; X-IronPort-AV: E=Sophos;i="5.54,385,1534809600"; d="scan'208";a="466630773" Received: from alln-core-4.cisco.com ([173.36.13.137]) by rcdn-iport-4.cisco.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 15 Oct 2018 19:26:09 +0000 Received: from [10.154.208.167] ([10.154.208.167]) by alln-core-4.cisco.com (8.15.2/8.15.2) with ESMTP id w9FJQ1Dk009065; Mon, 15 Oct 2018 19:26:03 GMT Subject: Re: [PATCH] kernel/signal: Signal-based pre-coredump notification Cc: Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , x86@kernel.org, Peter Zijlstra , Arnd Bergmann , "Eric W. Biederman" , Khalid Aziz , Kate Stewart , Helge Deller , Greg Kroah-Hartman , Al Viro , Andrew Morton , Christian Brauner , Catalin Marinas , Will Deacon , Dave Martin , Mauro Carvalho Chehab , Michal Hocko , Rik van Riel , "Kirill A. Shutemov" , Roman Gushchin , Marcos Paulo de Souza , Dominik Brodowski , Cyrill Gorcunov , Yang Shi , Jann Horn , Kees Cook , linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, "Victor Kamensky (kamensky)" , xe-linux-external@cisco.com, Stefan Strogin , Eugene Syromiatnikov , Enke Chen References: <20181015120521.GA10146@redhat.com> <20398328-4ee1-96b2-5723-4b7eed55f0a2@cisco.com> From: Enke Chen To: Oleg Nesterov Message-ID: <079f5939-8a74-9026-8ec9-91af0b827a69@cisco.com> Date: Mon, 15 Oct 2018 12:26:00 -0700 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20398328-4ee1-96b2-5723-4b7eed55f0a2@cisco.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Auto-Response-Suppress: DR, OOF, AutoReply X-Outbound-SMTP-Client: 10.154.208.167, [10.154.208.167] X-Outbound-Node: alln-core-4.cisco.com Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Olge: >> probably ->predump_signal should be cleared on exec? As I replied to Jann, will do. Thanks. -- Enke On 10/15/18 12:17 PM, Enke Chen wrote: > Hi, Oleg: > > I missed some of your comments in my previous reply. > > On 10/15/18 5:05 AM, Oleg Nesterov wrote: >> On 10/12, Enke Chen wrote: >>> >>> For simplicity and consistency, this patch provides an implementation >>> for signal-based fault notification prior to the coredump of a child >>> process. A new prctl command, PR_SET_PREDUMP_SIG, is defined that can >>> be used by an application to express its interest and to specify the >>> signal (SIGCHLD or SIGUSR1 or SIGUSR2) for such a notification. A new >>> signal code (si_code), CLD_PREDUMP, is also defined for SIGCHLD. >> >> To be honest, I can't say I like this new feature... >> >>> --- a/include/linux/sched.h >>> +++ b/include/linux/sched.h >>> @@ -696,6 +696,10 @@ struct task_struct { >>> int exit_signal; >>> /* The signal sent when the parent dies: */ >>> int pdeath_signal; >>> + >>> + /* The signal sent prior to a child's coredump: */ >>> + int predump_signal; >>> + >> >> At least, I think predump_signal should live in signal_struct, not >> task_struct. >> >> (pdeath_signal too, but it is too late to change (fix) this awkward API). >> >>> +static void do_notify_parent_predump(struct task_struct *tsk) >>> +{ >>> + struct sighand_struct *sighand; >>> + struct task_struct *parent; >>> + struct kernel_siginfo info; >>> + unsigned long flags; >>> + int sig; >>> + >>> + parent = tsk->real_parent; >> >> So, debuggere won't be notified, only real_parent... >> >>> + sig = parent->predump_signal; >> >> probably ->predump_signal should be cleared on exec? > > > Is this not enough in "copy_process()"? > > @@ -1985,6 +1985,7 @@ static __latent_entropy struct task_struct *copy_process( > p->dirty_paused_when = 0; > > p->pdeath_signal = 0; > + p->predump_signal = 0; > >> >>> + /* Check again with tasklist_lock" locked by the caller */ >>> + if (!valid_predump_signal(sig)) >>> + return; >> >> I don't understand why we need valid_predump_signal() at all. > > Most of the signals have well-defined semantics, and would not be appropriate > for this purpose. That is why it is limited to only SIGCHLD, SIGUSR1, SIGUSR2. > >> >>> bool get_signal(struct ksignal *ksig) >>> { >>> struct sighand_struct *sighand = current->sighand; >>> @@ -2497,6 +2535,19 @@ bool get_signal(struct ksignal *ksig) >>> current->flags |= PF_SIGNALED; >>> >>> if (sig_kernel_coredump(signr)) { >>> + /* >>> + * Notify the parent prior to the coredump if the >>> + * parent is interested in such a notificaiton. >>> + */ >>> + int p_sig = current->real_parent->predump_signal; >>> + >>> + if (valid_predump_signal(p_sig)) { >>> + read_lock(&tasklist_lock); >>> + do_notify_parent_predump(current); >>> + read_unlock(&tasklist_lock); >>> + cond_resched(); >> >> perhaps this should be called by do_coredump() after coredump_wait() kills >> all the sub-threads? > > proc_coredump_connector(current) is located here, they should stay together. > > Thanks. -- Enke > >> >>> +static int prctl_set_predump_signal(struct task_struct *tsk, pid_t pid, int sig) >>> +{ >>> + struct task_struct *p; >>> + int error; >>> + >>> + /* 0 is valid for disabling the feature */ >>> + if (sig && !valid_predump_signal(sig)) >>> + return -EINVAL; >>> + >>> + /* For the current task, the common case */ >>> + if (pid == 0) { >>> + tsk->predump_signal = sig; >>> + return 0; >>> + } >>> + >>> + error = -ESRCH; >>> + rcu_read_lock(); >>> + p = find_task_by_vpid(pid); >>> + if (p) { >>> + if (!set_predump_signal_perm(p)) >>> + error = -EPERM; >>> + else { >>> + error = 0; >>> + p->predump_signal = sig; >>> + } >>> + } >>> + rcu_read_unlock(); >>> + return error; >>> +} >> >> Why? I mean, why do we really want to support the pid != 0 case? >> >> Oleg. >>