Received: by 2002:a5b:505:0:0:0:0:0 with SMTP id o5csp694772ybp; Fri, 4 Oct 2019 03:32:19 -0700 (PDT) X-Google-Smtp-Source: APXvYqxFK7C90llcwjAnSfADGIOi3Qugpzm4LDk2bKUMxFIJQW9XyHa2JSF7drqOzyjLhCT7k0ni X-Received: by 2002:a17:906:81c6:: with SMTP id e6mr11644699ejx.284.1570185139188; Fri, 04 Oct 2019 03:32:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1570185139; cv=none; d=google.com; s=arc-20160816; b=BhePnBflBXcT7QORRZVMpiNETv2EvSCXvJ8TngIPyKtovBoWy0nijGWd25U9jCdGJT qR+L19ZQBLfOvxZo6YFS1NlUXsr1dG+GK517URKnaKpUUUFUh990x9GAspisxSgvbbT+ oKz5Kt/7hIuEAXLjJjBvzwzu09vtiOiqiGL7JwmmJD6yYwLhGYikwu5miMcrTRbokcLN YMgCnCq12WxOzkLnG5a5wwMeNSgeKRecH/K3R/MMcuewF1IsUTrpEALbUAeQ6jwu3pYs 3iYtzABOmK3Qu3Khur/H/GIT8YpbegYNshLtN1ArWdbxXe0uxmnDLRXxeqRpfkQCDfS3 +05Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=lYr8ibckse0BmUaAjEkti+4k6yIQOCn91r/LNpsvMM0=; b=LIIZiEbXPs4oUC4PqIRkbsLJ/8XSNjwpk/gWzCD85aiFEELm5bEM1nBxOfmZOhiS+h /15gNCSalJlrnC4A7nZNh0DQIaAxMDwEpPmm+zq1MxWEXFmyvbMMxzfdlfKeJJ9k+T0j SPR0zVuBbrQTClU8PvV6bwrit5HHelHXBx1uqqNIN5ujlJk+fay3XX85ZAGt4mwD/XvW zvOsZZVqxJ5SCRo4DG/P+37Lkaum97/5cY3Rb5hyIj0/U2belktafQfQE2bJKWgvNC5T tDtRI44l59nG8u4EFN5fZvxF95/fxPaRtcdxruhhszDzbhar4aaO/me/VKG1Fp3Dr0v4 PAeA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=m1g0d+tr; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id ca4si2698055ejb.39.2019.10.04.03.31.54; Fri, 04 Oct 2019 03:32:19 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=m1g0d+tr; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388051AbfJDJyS (ORCPT + 99 others); Fri, 4 Oct 2019 05:54:18 -0400 Received: from mail-qt1-f195.google.com ([209.85.160.195]:44205 "EHLO mail-qt1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729922AbfJDJyR (ORCPT ); Fri, 4 Oct 2019 05:54:17 -0400 Received: by mail-qt1-f195.google.com with SMTP id u40so7683934qth.11 for ; Fri, 04 Oct 2019 02:54:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=lYr8ibckse0BmUaAjEkti+4k6yIQOCn91r/LNpsvMM0=; b=m1g0d+trfTdQ+ECUs0dLmouJWy/sQUda0BcPSAoUH+uG/QUwZXL1daxAdV6i1F5yxG qOd6Ud4kS1pfJvJeoZyYt8cmLUe7/HdEIxIwxA6h0r+KyO2oZE1CJTC5kej+WAmDGvhs g1q8Kkh1QysPK7nYlXgGO7K9MWJMpJeN+e4wpXF0kFv8xW4SCpVUMgljBivP86cPFceq tOOnxKMU64CeKbO5cJXoehMdcqq14zIMx2l3tVumblXB8VU6U63dZLLpfqQJTrt3FZnE pSUY7GqQees/Ikygqqy/PogJWe5MVlogZAiHYdjBtGqG6XrU346eySqNqR8gldgwuJVz t3rg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=lYr8ibckse0BmUaAjEkti+4k6yIQOCn91r/LNpsvMM0=; b=K9nFQFZodRxPXXRtdIlAsbo5TUVHo4gAQBPaJ5YrRn6MdHFbz+giKYC7wmYkz1VXfr WqGlzFsEr8DxeSulXmFS49WTfqM9NdqjxNwmEKc0aQQb0VhhG7hNqrDNtqyL7Llvb5TN nUE368t+GHlBYGPmdrw4XW9tKytMiE4ZAnCunDedHyjDlbaFxX91PeK7ETkpdgzQ68Aw uzh1RvELy01n1Ln1CAFw2FKVJgsK0+ZDLTBpg2R+/4eupFYufJivR1Rh9XD3sfQGWUiQ HzBdXQu4fImx0RSF3J6Myl4Vso4gqq6YgusGDPsUmi6wcozKRRUSZJDKqVP+9ZA2HnQ8 cWBg== X-Gm-Message-State: APjAAAXN49l73qZgkPtncU6WAPEOIvdRXQYbZrGJ6HiqxacTy4KEMcEl zylcbIKuDCiH42aX3Hvb7bd/olBz4neR+BAQHcNyfA== X-Received: by 2002:ac8:108b:: with SMTP id a11mr14866143qtj.380.1570182856217; Fri, 04 Oct 2019 02:54:16 -0700 (PDT) MIME-Version: 1.0 References: <20190927034338.15813-1-walter-zh.wu@mediatek.com> <1569594142.9045.24.camel@mtksdccf07> <1569818173.17361.19.camel@mtksdccf07> <1570018513.19702.36.camel@mtksdccf07> <1570069078.19702.57.camel@mtksdccf07> <1570095525.19702.59.camel@mtksdccf07> <1570110681.19702.64.camel@mtksdccf07> <1570164140.19702.97.camel@mtksdccf07> <1570176131.19702.105.camel@mtksdccf07> <1570182257.19702.109.camel@mtksdccf07> In-Reply-To: <1570182257.19702.109.camel@mtksdccf07> From: Dmitry Vyukov Date: Fri, 4 Oct 2019 11:54:04 +0200 Message-ID: Subject: Re: [PATCH] kasan: fix the missing underflow in memmove and memcpy with CONFIG_KASAN_GENERIC=y To: Walter Wu Cc: Andrey Ryabinin , Alexander Potapenko , Matthias Brugger , LKML , kasan-dev , Linux-MM , Linux ARM , linux-mediatek@lists.infradead.org, wsd_upstream Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Oct 4, 2019 at 11:44 AM Walter Wu wrote: > > On Fri, 2019-10-04 at 11:18 +0200, Dmitry Vyukov wrote: > > On Fri, Oct 4, 2019 at 10:02 AM Walter Wu wrote: > > > > > > On Fri, 2019-10-04 at 12:42 +0800, Walter Wu wrote: > > > > On Thu, 2019-10-03 at 16:53 +0200, Dmitry Vyukov wrote: > > > > > On Thu, Oct 3, 2019 at 3:51 PM Walter Wu wrote:> > > > > > > > > > > > > static void print_error_description(struct kasan_access_info *info) > > > > > > { > > > > > > - pr_err("BUG: KASAN: %s in %pS\n", > > > > > > - get_bug_type(info), (void *)info->ip); > > > > > > - pr_err("%s of size %zu at addr %px by task %s/%d\n", > > > > > > - info->is_write ? "Write" : "Read", info->access_size, > > > > > > - info->access_addr, current->comm, task_pid_nr(current)); > > > > > > + if ((long)info->access_size < 0) { > > > > > > + pr_err("BUG: KASAN: invalid size %zu in %pS\n", > > > > > > + info->access_size, (void *)info->ip); > > > > > > > > > > I would not introduce a new bug type. > > > > > These are parsed and used by some systems, e.g. syzbot. If size is > > > > > user-controllable, then a new bug type for this will mean 2 bug > > > > > reports. > > > > > It also won't harm to print Read/Write, definitely the address, so no > > > > > reason to special case this out of a dozen of report formats. > > > > > This can qualify as out-of-bounds (definitely will cross some > > > > > bounds!), so I would change get_bug_type() to return > > > > > "slab-out-of-bounds" (as the most common OOB) in such case (with a > > > > > comment). > > > > > > > > > Print Read/Write and address information, it is ok. > > > > But if we can directly point to the root cause of this problem, why we > > > > not do it? see 1) and 2) to get a point, if we print OOB, then user > > > > needs one minute to think what is root case of this problem, but if we > > > > print invalid size, then user can directly get root case. this is my > > > > original thinking. > > > > 1)Invalid size is true then OOB is true. > > > > 2)OOB is true then invalid size may be true or false. > > > > > > > > But I see you say some systems have used bug report so that avoid this > > > > trouble, i will print the wrong type is "out-of-bound" in a unified way > > > > when size<0. > > > > > > > > > > Updated my patch, please help to review it. > > > thanks. > > > > > > commit 13e10a7e4264eb25c5a14193068027afc9c261f6 > > > Author: Walter-zh Wu > > > Date: Fri Oct 4 15:27:17 2019 +0800 > > > > > > kasan: detect negative size in memory operation function > > > > > > It is an undefined behavior to pass a negative value to > > > memset()/memcpy()/memmove() > > > , so need to be detected by KASAN. > > > > > > If size is negative value, then it will be larger than ULONG_MAX/2, > > > so that we will qualify as out-of-bounds issue. > > > > > > KASAN report: > > > > > > BUG: KASAN: out-of-bounds in kmalloc_memmove_invalid_size+0x70/0xa0 > > > Read of size 18446744073709551608 at addr ffffff8069660904 by task > > > cat/72 > > > > > > CPU: 2 PID: 72 Comm: cat Not tainted > > > 5.4.0-rc1-next-20191004ajb-00001-gdb8af2f372b2-dirty #1 > > > Hardware name: linux,dummy-virt (DT) > > > Call trace: > > > dump_backtrace+0x0/0x288 > > > show_stack+0x14/0x20 > > > dump_stack+0x10c/0x164 > > > print_address_description.isra.9+0x68/0x378 > > > __kasan_report+0x164/0x1a0 > > > kasan_report+0xc/0x18 > > > check_memory_region+0x174/0x1d0 > > > memmove+0x34/0x88 > > > kmalloc_memmove_invalid_size+0x70/0xa0 > > > > > > [1] https://bugzilla.kernel.org/show_bug.cgi?id=199341 > > > > > > Signed-off-by: Walter Wu > > > Reported -by: Dmitry Vyukov > > > Suggested-by: Dmitry Vyukov > > > > > > diff --git a/lib/test_kasan.c b/lib/test_kasan.c > > > index 49cc4d570a40..06942cf585cc 100644 > > > --- a/lib/test_kasan.c > > > +++ b/lib/test_kasan.c > > > @@ -283,6 +283,23 @@ static noinline void __init > > > kmalloc_oob_in_memset(void) > > > kfree(ptr); > > > } > > > > > > +static noinline void __init kmalloc_memmove_invalid_size(void) > > > +{ > > > + char *ptr; > > > + size_t size = 64; > > > + > > > + pr_info("invalid size in memmove\n"); > > > + ptr = kmalloc(size, GFP_KERNEL); > > > + if (!ptr) { > > > + pr_err("Allocation failed\n"); > > > + return; > > > + } > > > + > > > + memset((char *)ptr, 0, 64); > > > + memmove((char *)ptr, (char *)ptr + 4, -2); > > > + kfree(ptr); > > > +} > > > + > > > static noinline void __init kmalloc_uaf(void) > > > { > > > char *ptr; > > > @@ -773,6 +790,7 @@ static int __init kmalloc_tests_init(void) > > > kmalloc_oob_memset_4(); > > > kmalloc_oob_memset_8(); > > > kmalloc_oob_memset_16(); > > > + kmalloc_memmove_invalid_size(); > > > kmalloc_uaf(); > > > kmalloc_uaf_memset(); > > > kmalloc_uaf2(); > > > diff --git a/mm/kasan/common.c b/mm/kasan/common.c > > > index 6814d6d6a023..97dd6eecc3e7 100644 > > > --- a/mm/kasan/common.c > > > +++ b/mm/kasan/common.c > > > @@ -102,7 +102,8 @@ EXPORT_SYMBOL(__kasan_check_write); > > > #undef memset > > > void *memset(void *addr, int c, size_t len) > > > { > > > - check_memory_region((unsigned long)addr, len, true, _RET_IP_); > > > + if (!check_memory_region((unsigned long)addr, len, true, _RET_IP_)) > > > + return NULL; > > > > > > return __memset(addr, c, len); > > > } > > > @@ -110,7 +111,8 @@ void *memset(void *addr, int c, size_t len) > > > #undef memmove > > > void *memmove(void *dest, const void *src, size_t len) > > > { > > > - check_memory_region((unsigned long)src, len, false, _RET_IP_); > > > + if (!check_memory_region((unsigned long)src, len, false, _RET_IP_)) > > > + return NULL; > > > check_memory_region((unsigned long)dest, len, true, _RET_IP_); > > > > I would check both calls. > > The current code seems to be over-specialized for handling of invalid > > size (you assume that if it's invalid size, then the first > > check_memory_region will detect it and checking the second one is > > pointless, right?). > > But check_memory_region can return false in other cases too. > > Also seeing first call checked, but the second not checked just hurts > > my eyes when reading code (whenever I will read such code my first > > reaction will be "why?"). > > > I can't agree with you any more about second point. > > #undef memmove > void *memmove(void *dest, const void *src, size_t len) > { > if (!check_memory_region((unsigned long)src, len, false, _RET_IP_) > ||) > !check_memory_region((unsigned long)dest, len, true, _RET_IP_); > return NULL; > > return __memmove(dest, src, len); > } > > > > > > > > > return __memmove(dest, src, len); > > > @@ -119,7 +121,8 @@ void *memmove(void *dest, const void *src, size_t > > > len) > > > #undef memcpy > > > void *memcpy(void *dest, const void *src, size_t len) > > > { > > > - check_memory_region((unsigned long)src, len, false, _RET_IP_); > > > + if (!check_memory_region((unsigned long)src, len, false, _RET_IP_)) > > > + return NULL; > > > check_memory_region((unsigned long)dest, len, true, _RET_IP_); > > > > > > return __memcpy(dest, src, len); > > > diff --git a/mm/kasan/generic.c b/mm/kasan/generic.c > > > index 616f9dd82d12..02148a317d27 100644 > > > --- a/mm/kasan/generic.c > > > +++ b/mm/kasan/generic.c > > > @@ -173,6 +173,11 @@ static __always_inline bool > > > check_memory_region_inline(unsigned long addr, > > > if (unlikely(size == 0)) > > > return true; > > > > > > + if (unlikely((long)size < 0)) { > > > + kasan_report(addr, size, write, ret_ip); > > > + return false; > > > + } > > > + > > > if (unlikely((void *)addr < > > > kasan_shadow_to_mem((void *)KASAN_SHADOW_START))) { > > > kasan_report(addr, size, write, ret_ip); > > > diff --git a/mm/kasan/generic_report.c b/mm/kasan/generic_report.c > > > index 36c645939bc9..ae9596210394 100644 > > > --- a/mm/kasan/generic_report.c > > > +++ b/mm/kasan/generic_report.c > > > @@ -107,6 +107,13 @@ static const char *get_wild_bug_type(struct > > > kasan_access_info *info) > > > > > > const char *get_bug_type(struct kasan_access_info *info) > > > { > > > + /* > > > + * if access_size < 0, then it will be larger than ULONG_MAX/2, > > > + * so that this can qualify as out-of-bounds. > > > + */ > > > + if ((long)info->access_size < 0) > > > + return "out-of-bounds"; > > > > "out-of-bounds" is the _least_ frequent KASAN bug type. So saying > > "out-of-bounds" has downsides of both approaches and won't prevent > > duplicate reports by syzbot... > > > maybe i should add your comment into the comment in get_bug_type? Yes, that's exactly what I meant above: "I would change get_bug_type() to return "slab-out-of-bounds" (as the most common OOB) in such case (with a comment)." ;) > > > + > > > if (addr_has_shadow(info->access_addr)) > > > return get_shadow_bug_type(info); > > > return get_wild_bug_type(info); > > > diff --git a/mm/kasan/tags.c b/mm/kasan/tags.c > > > index 0e987c9ca052..b829535a3ad7 100644 > > > --- a/mm/kasan/tags.c > > > +++ b/mm/kasan/tags.c > > > @@ -86,6 +86,11 @@ bool check_memory_region(unsigned long addr, size_t > > > size, bool write, > > > if (unlikely(size == 0)) > > > return true; > > > > > > + if (unlikely((long)size < 0)) { > > > + kasan_report(addr, size, write, ret_ip); > > > + return false; > > > + } > > > + > > > tag = get_tag((const void *)addr); > > > > > > /* > > > diff --git a/mm/kasan/tags_report.c b/mm/kasan/tags_report.c > > > index 969ae08f59d7..1e1ca81214b5 100644 > > > --- a/mm/kasan/tags_report.c > > > +++ b/mm/kasan/tags_report.c > > > @@ -36,6 +36,13 @@ > > > > > > const char *get_bug_type(struct kasan_access_info *info) > > > { > > > + /* > > > + * if access_size < 0, then it will be larger than ULONG_MAX/2, > > > + * so that this can qualify as out-of-bounds. > > > + */ > > > + if ((long)info->access_size < 0) > > > + return "out-of-bounds"; > > > + > > > #ifdef CONFIG_KASAN_SW_TAGS_IDENTIFY > > > struct kasan_alloc_meta *alloc_meta; > > > struct kmem_cache *cache;