Received: by 2002:a05:6a10:6006:0:0:0:0 with SMTP id w6csp1074046pxa; Fri, 28 Aug 2020 02:58:27 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw5SkaUmKuAzICuswywf1+ewZ+AbPEr9HRMB8sLFEqvZAsdV+tbZTjxvQicKalWl6g1ehZ/ X-Received: by 2002:a05:6402:1c95:: with SMTP id cy21mr978105edb.151.1598608707053; Fri, 28 Aug 2020 02:58:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1598608707; cv=none; d=google.com; s=arc-20160816; b=wAtl8TgYY5gRe94Y82iMSFe78vk7lkjMmrjoyFrn5/cBY6bpBuiX+S4pZMpEcZF0fx cwYTFaHNVsqYBY+/DSqLln+UyUuYm+7FFL9pBxr9ULbpl4Qmwba0ygZv6D01Gu6/UEni iuvnY3tyxzM0EccoR0rg3sowhNv7i5jiEd1gnjxcn36WQbFwXLZN4IgqovchOxLwSn9j qEdGd8adtlfOLa1FH8xShLLge1yyReiwsIM1BWFhvQ8CoyE0aVAjtUWXhBCoE+T1Kfkr wYUJphvdi9TPoOgb5w7Ab1Ruft26Y59MHpMn5nvvJrDDvQBi/ovM3YdKJ14YkMBc5kq1 w7RQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=G6SBICsoo4KmrgAoqg80mPXmPRpG8yWjGtFzEw2Nki8=; b=XwJN/emcnRKgrkMjJN8ehpOyRyJo81FpuvHVBft1oNMe7yaMHmVBvG3DHc7oVot0VN W/XaeYQfQ+762C6AHS/bwEk55AlLnrEw4Wi9LPB7P5xSp6PDSa/ZiBJm5uB/mfEkWUPX KFCu5e8rm9NpakuDy6KczOB2+mPIVV2WUs2UEkZfQVxDUG6CBa3uFyAUQGSgaqAryCvS lwkX0Z33a9FaQH3zAgtT93zYmT+laVykso67RAkJYW3nVTmsH53hVXbmPuTpVkvdM2pe qS/L1m59fIM1FpzEgnSSHGg74pXvhFhiDnBccmMJzaJXgCffiJzQ3WXAwm+S1TN3nKRR Co4w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id z2si320786eju.353.2020.08.28.02.58.04; Fri, 28 Aug 2020 02:58:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728985AbgH1J4t (ORCPT + 99 others); Fri, 28 Aug 2020 05:56:49 -0400 Received: from mail.kernel.org ([198.145.29.99]:36244 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728016AbgH1J4r (ORCPT ); Fri, 28 Aug 2020 05:56:47 -0400 Received: from gaia (unknown [46.69.195.127]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 3CAEB208D5; Fri, 28 Aug 2020 09:56:44 +0000 (UTC) Date: Fri, 28 Aug 2020 10:56:41 +0100 From: Catalin Marinas To: Evgenii Stepanov Cc: Andrey Konovalov , Vincenzo Frascino , Dmitry Vyukov , kasan-dev , Andrey Ryabinin , Alexander Potapenko , Marco Elver , Elena Petrova , Branislav Rankov , Kevin Brodsky , Will Deacon , Andrew Morton , Linux ARM , Linux Memory Management List , LKML Subject: Re: [PATCH 21/35] arm64: mte: Add in-kernel tag fault handler Message-ID: <20200828095641.GD3169@gaia> References: <20200827095429.GC29264@gaia> <20200827131045.GM29264@gaia> <20200827145642.GO29264@gaia> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Aug 27, 2020 at 12:14:26PM -0700, Evgenii Stepanov wrote: > On Thu, Aug 27, 2020 at 7:56 AM Catalin Marinas wrote: > > On Thu, Aug 27, 2020 at 03:34:42PM +0200, Andrey Konovalov wrote: > > > On Thu, Aug 27, 2020 at 3:10 PM Catalin Marinas wrote: > > > > On Thu, Aug 27, 2020 at 02:31:23PM +0200, Andrey Konovalov wrote: > > > > > On Thu, Aug 27, 2020 at 11:54 AM Catalin Marinas > > > > > wrote: > > > > > > On Fri, Aug 14, 2020 at 07:27:03PM +0200, Andrey Konovalov wrote: > > > > > > > +static int do_tag_recovery(unsigned long addr, unsigned int esr, > > > > > > > + struct pt_regs *regs) > > > > > > > +{ > > > > > > > + report_tag_fault(addr, esr, regs); > > > > > > > + > > > > > > > + /* Skip over the faulting instruction and continue: */ > > > > > > > + arm64_skip_faulting_instruction(regs, AARCH64_INSN_SIZE); > > > > > > > > > > > > Ooooh, do we expect the kernel to still behave correctly after this? I > > > > > > thought the recovery means disabling tag checking altogether and > > > > > > restarting the instruction rather than skipping over it. > > [...] > > > > > Can we disable MTE, reexecute the instruction, and then reenable MTE, > > > > > or something like that? > > > > > > > > If you want to preserve the MTE enabled, you could single-step the > > > > instruction or execute it out of line, though it's a bit more convoluted > > > > (we have a similar mechanism for kprobes/uprobes). > > > > > > > > Another option would be to attempt to set the matching tag in memory, > > > > under the assumption that it is writable (if it's not, maybe it's fine > > > > to panic). Not sure how this interacts with the slub allocator since, > > > > presumably, the logical tag in the pointer is wrong rather than the > > > > allocation one. > > > > > > > > Yet another option would be to change the tag in the register and > > > > re-execute but this may confuse the compiler. > > > > > > Which one of these would be simpler to implement? > > > > Either 2 or 3 would be simpler (re-tag the memory location or the > > pointer) with the caveats I mentioned. Also, does the slab allocator > > need to touch the memory on free with a tagged pointer? Otherwise slab > > may hit an MTE fault itself. > > Changing the memory tag can cause faults in other threads, and that > could be very confusing. It could indeed trigger a chain of faults. It's not even other threads, it could be the same thread in a different function. > Probably the safest thing is to retag the register, single step and > then retag it back, but be careful with the instructions that change > the address register (like ldr x0, [x0]). This gets complicated if you have to parse the opcode. If you can single-step, just set PSTATE.TCO for the instruction. But the single-step machinery gets more complicated, probably interacts badly with kprobes. I think the best option is to disable the MTE checks in TCF on an _unhandled_ kernel fault, report and continue. For the KASAN tests, add accessors similar to get_user/put_user which are able to handle the fault and return an error. Such accessors, since they have a fixup handler, would not lead to the MTE checks being disabled. -- Catalin