Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp1629746pxb; Thu, 4 Mar 2021 16:56:56 -0800 (PST) X-Google-Smtp-Source: ABdhPJxLEA3P8XV9DiISjE5Y9awnvLR5Bh2bpXZBeQglaMl9Lbq7Q6Tp9DGEY7hko1GbGC0gMr8Y X-Received: by 2002:a6b:8b0e:: with SMTP id n14mr6006708iod.199.1614905816501; Thu, 04 Mar 2021 16:56:56 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614905816; cv=none; d=google.com; s=arc-20160816; b=AyUoQ5d/7yF9Ahff+KAz8o73NfFi7gT1jHoaNxyFZZRsweHJM+SHRq+QiEe/koceVw CEUH+Q7d5et/JJeAJDiw7ytT3w7XTSwFwhMgIogqOjOdXM47IBDbwwFCgO4w3G8Lbk/r n4rO8bByV4Y775FyvtvvOjBDwMnXC3dCcXcdVZGGOJ8NyFKGFVKMcMJjrc23FKdzP33r oHMdCRsNHcqv4Uy8xdl30j3/KTwZbFxgw4ihMd8cdOF3kd5+I1A+4/a0uF/S9rvzX9bh jJ4m0Q4YHtCUrE8xQ88ALj44eRAZA4GeqZvI9eehg30GK+ho9d3FXMg7A7yhZXOYHKGT +jeA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :ironport-sdr:ironport-sdr; bh=mjCv9wrt+nv7zRkl7hsIRY45R0guZ2rEAOIoAp2TY0c=; b=zPVgSCSfGhGeVUHu5lq3DYSQeBWwCt3ratFXiU5cHyTI6oA66jDaHwODFzuofpoKNe 4zyNfB/xyFphjD2lisJs/j0RchBVW69l3uKdM/rSmDXpub+FHsvq19I13NKYfy9J+UK9 M6gk401uiE7HGTUfvjiAiJgADDWR7TifcQMvNvG5SALN5u/Nt9CvnmVSf5mmNm2ruxU2 jDKXuleU6GZ/PtypFfxI8cMEZRYHCDU1HzDwA94BiHLdjXLl70MbitBtwjWbsPWK2VLd VWCDCgGnFzLPW7TntCs1yfrdVNpauxJytnGCIt9HeHs9NgYw1DhA830WQdwR8H1tIEXP KsIw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t15si487491jal.2.2021.03.04.16.56.43; Thu, 04 Mar 2021 16:56:56 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236681AbhCDUUG (ORCPT + 99 others); Thu, 4 Mar 2021 15:20:06 -0500 Received: from mga05.intel.com ([192.55.52.43]:6280 "EHLO mga05.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239044AbhCDUTr (ORCPT ); Thu, 4 Mar 2021 15:19:47 -0500 IronPort-SDR: +Cav2cRedfECRZpSM1QkUegtnvB8nk+IfCN/YbXBWjG5S3pMg5T0qQyKfzLLBvMKnNsumlojX8 /dPWbUJCnyKQ== X-IronPort-AV: E=McAfee;i="6000,8403,9913"; a="272504381" X-IronPort-AV: E=Sophos;i="5.81,223,1610438400"; d="scan'208";a="272504381" Received: from orsmga008.jf.intel.com ([10.7.209.65]) by fmsmga105.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 04 Mar 2021 12:19:06 -0800 IronPort-SDR: 8IHLP432SVJfpEPtW/B8iO9EiS88EjkED9rCFG5tHm21VS10ACysIMBbq6/KSSpgZMsG8e92Hf /hvZwcnYUSeA== X-IronPort-AV: E=Sophos;i="5.81,223,1610438400"; d="scan'208";a="407966663" Received: from iweiny-desk2.sc.intel.com (HELO localhost) ([10.3.52.147]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 04 Mar 2021 12:19:06 -0800 Date: Thu, 4 Mar 2021 12:19:06 -0800 From: Ira Weiny To: Andy Lutomirski Cc: x86@kernel.org, LKML , Mark Rutland , Josh Poimboeuf Subject: Re: [PATCH v3 04/11] x86/kthread,dumpstack: Set task_pt_regs->cs.RPL=3 for kernel threads Message-ID: <20210304201906.GM3014244@iweiny-DESK2.sc.intel.com> References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.11.1 (2018-12-01) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Mar 04, 2021 at 11:05:57AM -0800, Andy Lutomirski wrote: > For kernel threads, task_pt_regs is currently all zeros, a valid user state > (if kernel_execve() has been called), or some combination thereof during > execution of kernel_execve(). If a stack trace is printed, the unwinder > might get confused and treat task_pt_regs as a kernel state. Indeed, > forcing a stack dump results in an attempt to display _kernel_ code bytes > from a bogus address at the very top of kernel memory. > > Fix the confusion by setting cs=3 so that user_mode(task_pt_regs(...)) == > true for kernel threads. > > Also improve the error when failing to fetch code bytes to show which type > of fetch failed. This makes it much easier to understand what's happening. > > Cc: Josh Poimboeuf > Signed-off-by: Andy Lutomirski > --- > arch/x86/kernel/dumpstack.c | 4 ++-- > arch/x86/kernel/process.c | 13 +++++++++++++ > 2 files changed, 15 insertions(+), 2 deletions(-) > > diff --git a/arch/x86/kernel/dumpstack.c b/arch/x86/kernel/dumpstack.c > index 55cf3c8325c6..9b7b69bb12e5 100644 > --- a/arch/x86/kernel/dumpstack.c > +++ b/arch/x86/kernel/dumpstack.c > @@ -128,8 +128,8 @@ void show_opcodes(struct pt_regs *regs, const char *loglvl) > /* No access to the user space stack of other tasks. Ignore. */ > break; > default: > - printk("%sCode: Unable to access opcode bytes at RIP 0x%lx.\n", > - loglvl, prologue); > + printk("%sCode: Unable to access %s opcode bytes at RIP 0x%lx.\n", > + loglvl, user_mode(regs) ? "user" : "kernel", prologue); > break; > } > } > diff --git a/arch/x86/kernel/process.c b/arch/x86/kernel/process.c > index 145a7ac0c19a..f6f16df04cb9 100644 > --- a/arch/x86/kernel/process.c > +++ b/arch/x86/kernel/process.c > @@ -163,6 +163,19 @@ int copy_thread(unsigned long clone_flags, unsigned long sp, unsigned long arg, > /* Kernel thread ? */ > if (unlikely(p->flags & PF_KTHREAD)) { > memset(childregs, 0, sizeof(struct pt_regs)); > + > + /* > + * Even though there is no real user state here, these > + * are were user regs belong, and kernel_execve() will ^^^^^ where? Ira > + * overwrite them with user regs. Put an obviously > + * invalid value that nonetheless causes user_mode(regs) > + * to return true in CS. > + * > + * This also prevents the unwinder from thinking that there > + * is invalid kernel state at the top of the stack. > + */ > + childregs->cs = 3; > + > kthread_frame_init(frame, sp, arg); > return 0; > } > -- > 2.29.2 >