Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp2089624yba; Fri, 19 Apr 2019 11:56:16 -0700 (PDT) X-Google-Smtp-Source: APXvYqxcG8bhnGl+W7muaDzOnT96fCJHODUgX1TY59KeUuzXyGfgT9n8rvkN3HPMKFzy8MaqN1tM X-Received: by 2002:a63:195e:: with SMTP id 30mr5415170pgz.312.1555700176878; Fri, 19 Apr 2019 11:56:16 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1555700176; cv=none; d=google.com; s=arc-20160816; b=t8x0WOVq39N/bfj+FH3NCJ0ndTNlOtLyCi33Tuj5z7pJWDbIlIbOHHMkNLdBJVqvsX g+11z5EDj+Z+mbjqLNc9T/BZBFUrahVebHR0lkc2YPQr2+bufl0+4ciGdaFheVeMw0HQ trWEx4olWyPjr89tPMRnLLO96nMuASBvnqBS/5ULuwQu/98oII+Ctf0ZsTU96BKB0ydS RjgSsMAuV1jg9t4BwrRoJ/08RlGx9NWAUrkItDPCFcvUyMLC++vCV6J6YTXoXIoRjyDW QE3c/8pwvI/ukvOWjaI1pJxsNxOuG/mzeqsrkfMlfk5iW9aqSsLYidOKkDSO75D62yRE 6CvQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=Fk75dbl/le5yDucTbM5af5yCCyE0PHHquu5ZXOG5sVE=; b=Hi6pveqKs81+PHoBhtW+ohcAoFA6H7HEnA89igZUverg9hbcwE7JEcNd4KBPU2yw0D q2Kvj2BTKhCHegPkViYzD31eFiAnNzdXdbw4EsaMtIF7PA+lHRsCRFVC/Pk1fHF/fmzr vBzLuVj7RFma5+3Mre0oBbAeBr400rT3hvRKB7ePZTO2/nI9u6fpotshJwUV0hoPoXfA EsMRmj4Kxj8BZkgZth2DgFeGAgHq2szcCwEQTzc3eziz2KZU9UpzFFR/5KbgqT/dwV6u rN65CCj+L8GFkxG29XVtys36U2yjIPyuEVyOSm8VFvZSyUR0U2Nmuon4k+e7MZZhZowC RbVg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 61si5844503plq.2.2019.04.19.11.56.01; Fri, 19 Apr 2019 11:56:16 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728466AbfDSSyb (ORCPT + 99 others); Fri, 19 Apr 2019 14:54:31 -0400 Received: from mx1.redhat.com ([209.132.183.28]:41622 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726417AbfDSSya (ORCPT ); Fri, 19 Apr 2019 14:54:30 -0400 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id D9B1F19D223; Fri, 19 Apr 2019 15:19:14 +0000 (UTC) Received: from dhcp-27-174.brq.redhat.com (unknown [10.43.17.38]) by smtp.corp.redhat.com (Postfix) with SMTP id 9124360BF7; Fri, 19 Apr 2019 15:19:13 +0000 (UTC) Received: by dhcp-27-174.brq.redhat.com (nbSMTP-1.00) for uid 1000 oleg@redhat.com; Fri, 19 Apr 2019 17:19:14 +0200 (CEST) Date: Fri, 19 Apr 2019 17:19:12 +0200 From: Oleg Nesterov To: Roman Gushchin Cc: Tejun Heo , kernel-team@fb.com, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, Roman Gushchin Subject: Re: [PATCH v10 4/9] cgroup: cgroup v2 freezer Message-ID: <20190419151912.GA12152@redhat.com> References: <20190405174708.1010-1-guro@fb.com> <20190405174708.1010-5-guro@fb.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190405174708.1010-5-guro@fb.com> User-Agent: Mutt/1.5.24 (2015-08-30) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.29]); Fri, 19 Apr 2019 15:19:14 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 04/05, Roman Gushchin wrote: > > +void cgroup_leave_frozen(bool always_leave) > +{ > + struct cgroup *cgrp; > + > + spin_lock_irq(&css_set_lock); > + cgrp = task_dfl_cgroup(current); > + if (always_leave || !test_bit(CGRP_FREEZE, &cgrp->flags)) { > + cgroup_dec_frozen_cnt(cgrp); > + cgroup_update_frozen(cgrp); > + WARN_ON_ONCE(!current->frozen); > + current->frozen = false; > + } > + spin_unlock_irq(&css_set_lock); > + > + if (unlikely(current->frozen)) { > + /* > + * If the task remained in the frozen state, > + * make sure it won't reach userspace without > + * entering the signal handling loop. > + */ > + spin_lock_irq(¤t->sighand->siglock); > + recalc_sigpending(); > + spin_unlock_irq(¤t->sighand->siglock); I still can't understand this logic. Once again, suppose we race with CGRP_FREEZE. If JOBCTL_TRAP_FREEZE is already set then signal_pending() must be already T and we do not need recalc_sigpending? If JOBCTL_TRAP_FREEZE is not set yet, how can recalc_sigpending() help? > +static void cgroup_freeze_task(struct task_struct *task, bool freeze) > +{ > + unsigned long flags; > + > + /* If the task is about to die, don't bother with freezing it. */ > + if (!lock_task_sighand(task, &flags)) > + return; > + > + if (freeze) { > + task->jobctl |= JOBCTL_TRAP_FREEZE; > + signal_wake_up(task, false); > + } else { > + task->jobctl &= ~JOBCTL_TRAP_FREEZE; > + wake_up_process(task); wake_up_interruptible() ? > static int ptrace_signal(int signr, kernel_siginfo_t *info) > { > /* > @@ -2442,6 +2483,10 @@ bool get_signal(struct ksignal *ksig) > ksig->info.si_signo = signr = SIGKILL; > sigdelset(¤t->pending.signal, SIGKILL); > recalc_sigpending(); > + current->jobctl &= ~JOBCTL_TRAP_FREEZE; > + spin_unlock_irq(&sighand->siglock); > + if (unlikely(cgroup_task_frozen(current))) > + cgroup_leave_frozen(true); Oh, and another leave_frozen below... I feel this must be simplified somehow, but nothing comes to my mind right now. > + /* > + * If the task is leaving the frozen state, let's update > + * cgroup counters and reset the frozen bit. > + */ > + if (unlikely(cgroup_task_frozen(current))) { > spin_unlock_irq(&sighand->siglock); > + cgroup_leave_frozen(true); > goto relock; > } afaics cgroup_leave_frozen(false) makes more sense here. Oleg.