Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id E583CC433EF for ; Fri, 19 Nov 2021 14:09:42 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id C3E9B61B66 for ; Fri, 19 Nov 2021 14:09:42 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235700AbhKSOMn (ORCPT ); Fri, 19 Nov 2021 09:12:43 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54950 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229879AbhKSOMl (ORCPT ); Fri, 19 Nov 2021 09:12:41 -0500 Received: from mail-il1-x134.google.com (mail-il1-x134.google.com [IPv6:2607:f8b0:4864:20::134]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F2CEFC061574 for ; Fri, 19 Nov 2021 06:09:39 -0800 (PST) Received: by mail-il1-x134.google.com with SMTP id j7so3027466ilk.13 for ; Fri, 19 Nov 2021 06:09:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=uH5+LijC2//yZ3zDrCNfSRI5ao+iauirzsE3A/1TQsY=; b=is7eXdz4qfpteEjiaagClNYuz3XlZjSoUKAwzEAPFEmQiKECAf/Pun22IBZ5fVKNHH MR5ylOp1OTwGcGGecyFz/fhQaRm0qguNxLPcEn8uBFgic7J9NbyFMQ/do2PucOxGNrUw zIBJdjHXB2Zr9SjGLuAawUDiGg2Dl2JZPv8xgDwAQR61FpO2qg3ZLuuQQLSFl58nh87/ wCfb0gFvyK8JZLaPb9E3ggWjUuc2yQ+7Nd5J3E37CQ1nLJkJsP/N7q+PYVCvjnpUxi3p 1CRofm6bgSJFjhwTwsyd3DySO+6o2GUC985YlFWrQCBS+/TpjKW/kShwtAYqlLjbvbeg C49w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=uH5+LijC2//yZ3zDrCNfSRI5ao+iauirzsE3A/1TQsY=; b=HjFdxP/DkSYIVqhQGzpGqgOoExS3JTsHBoUNW7O3mavIbAvnZUdhWeUnmsGDo6pnVg CPLwBivHUywa3hNmeaLBKDFfDdsti//YsC2HzPFYu3awkMRhBWAKhBj8u5FzxrhmxIOK SmJx3cYqShLaoDHxqqLZhoztJiZjx7oyvEfGNLNcYqCysgu3n0UHoNjRt7cMuLNHy9IK 7de0FFRGLMZN9BI/tMJJxHovfdfoTPV/kwExmDOcANEKo12S6P9LPolu4EJOaxt1YXOr OJGpZ7/t7tUxyN/Ypyy9O7SB+J6ci+8FjIG1npn4UOutaFa9pxntdSvLV62DdSqD/V0q WZVQ== X-Gm-Message-State: AOAM533TbtxHBMQhBCHRM1SHwB5BwBm1Cne3+eshTuyNJiv/EEah2IE1 DpEnFhu8unkFmDwrHHKH+6Ajf/gl6IqewHLDjCMkIxZXcbI= X-Google-Smtp-Source: ABdhPJzcPs8BTDEQDatUVFnv80eNNwO8+7JZDfwin1ICbJWyjZMykoKICiD7FN8nmkSVbqGSccsUxLL5oRPpK9kC+Wg= X-Received: by 2002:a05:6e02:1605:: with SMTP id t5mr802646ilu.233.1637330979385; Fri, 19 Nov 2021 06:09:39 -0800 (PST) MIME-Version: 1.0 References: <1637130234-57238-1-git-send-email-quic_jiangenj@quicinc.com> In-Reply-To: From: Andrey Konovalov Date: Fri, 19 Nov 2021 15:09:28 +0100 Message-ID: Subject: Re: [PATCH] kcov: add KCOV_PC_RANGE to limit pc range To: Dmitry Vyukov Cc: "JianGen Jiao (QUIC)" , syzkaller , "kasan-dev@googlegroups.com" , LKML , Alexander Lochmann , "Likai Ding (QUIC)" , Kaipeng Zeng , Hangbin Liu Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Nov 19, 2021 at 2:07 PM Dmitry Vyukov wrote: > > On Fri, 19 Nov 2021 at 13:55, Dmitry Vyukov wrote: > > > > +Kaipeng, Hangbin who contributed the coverage filter to syzkaller. > > This is a discussion about adding a similar filter to the kernel. You > > can see whole discussion here: > > https://groups.google.com/g/kasan-dev/c/TQmYUdUC08Y > > Joey, what do you think in general about passing a filter bitmap to the k= ernel? > > Since the bitmap is large, it can make sense to reuse it across > different KCOV instances. > I am thinking about something along the following lines: > > kcov_fd =3D open("/debugfs/kcov"); > filter_fd =3D ioctl(kcov_fd, KCOV_CREATE_FILTER, &{... some args > specifying start/end ...}); > filter =3D mmap(..., filter_fd); > ... write to the filter ... > > ... > kcov_fd2 =3D open("/debugfs/kcov"); > ioctl(kcov_fd2, KCOV_ATTACH_FILTER, filter_fd); > ioctl(kcov_fd2, KCOV_ENABLE); > > > This would allow us to create 2 filters: > 1. One the interesting subsystems > 2. Second only for yet uncovered PCs in the interesting subsystems > (updated as we discover more coverage) > > During fuzzing we attach the second filter to KCOV. > But when we want to obtain full program coverage, we attach the first one= . > > The filters (bitmaps) are reused across all threads in all executor > processes (so that we have only 2 filters globally per VM). > > KCOV_CREATE_FILTER could also accept how many bytes each bit > represents (that scaling factor, as hardcoding 4, 8, 16 may be bad for > a stable kernel interface). > > But I am still not sure how to support both the main kernel and > modules. We could allow setting up multiple filters for different PC > ranges. Or may be just 2 (one for kernel and one for modules range). > Or maybe 1 bitmap can cover both kernel and modules? > > Thoughts? Throwing in a thought without a concrete design suggestion: how about en eBPF-based filter? The flexibility would allow covering as many PC ranges as one wants. And, perhaps, do other things. > > > > On Fri, 19 Nov 2021 at 12:21, JianGen Jiao (QUIC) > > wrote: > > > > > > Yes, on x86_64, module address space is after kernel. But like below = on arm64, it's different. > > > > > > # grep stext /proc/kallsyms > > > ffffffc010010000 T _stext > > > # cat /proc/modules |sort -k 6 | tail -2 > > > Some_module_1 552960 0 - Live 0xffffffc00ca05000 (O) > > > Some_module_1 360448 0 - Live 0xffffffc00cb8f000 (O) # cat /proc/modu= les |sort -k 6 | head -2 > > > Some_module_3 16384 1 - Live 0xffffffc009430000 > > > > > > -----Original Message----- > > > From: Dmitry Vyukov > > > Sent: Friday, November 19, 2021 6:38 PM > > > To: JianGen Jiao (QUIC) > > > Cc: andreyknvl@gmail.com; kasan-dev@googlegroups.com; LKML ; Alexander Lochmann ; Lika= i Ding (QUIC) > > > Subject: Re: [PATCH] kcov: add KCOV_PC_RANGE to limit pc range > > > > > > WARNING: This email originated from outside of Qualcomm. Please be wa= ry of any links or attachments, and do not enable macros. > > > > > > On Fri, 19 Nov 2021 at 04:17, JianGen Jiao (QUIC) wrote: > > > > > > > > Hi Dmitry, > > > > I'm using the start, end pc from cover filter, which currently is t= he fast way compared to the big bitmap passing from syzkaller solution, as = I only set the cover filter to dirs/files I care about. > > > > > > I see. > > > But if we are unlucky and our functions of interest are at the very l= ow and high addresses, start/end will cover almost all kernel code... > > > > > > > I checked > > > > https://groups.google.com/g/kasan-dev/c/oVz3ZSWaK1Q/m/9ASztdzCAAAJ, > > > > The bitmap seems not the same as syzkaller one, which one will be u= sed finally? > > > > > > I don't know yet. We need to decide. > > > In syzkaller we are more flexible and can change code faster, while k= ernel interfaces are stable and need to be kept forever. So I think we need= to concentrate more on the good kernel interface and then support it in sy= zkaller. > > > > > > > ``` Alexander's one > > > > + pos =3D (ip - canonicalize_ip((unsigned long)&_stext)) / 4; idx = =3D pos > > > > + % BITS_PER_LONG; pos /=3D BITS_PER_LONG; if (likely(pos < > > > > + t->kcov_size)) WRITE_ONCE(area[pos], READ_ONCE(area[pos]) | 1L << > > > > + idx); > > > > ``` > > > > Pc offset is divided by 4 and start is _stext. But for some arch, p= c is less than _stext. > > > > > > You mean that modules can have PC < _stext? > > > > > > > ``` https://github.com/google/syzkaller/blob/master/syz-manager/cov= filter.go#L139-L154 > > > > data :=3D make([]byte, 8+((size>>4)/8+1)) > > > > order :=3D binary.ByteOrder(binary.BigEndian) > > > > if target.LittleEndian { > > > > order =3D binary.LittleEndian > > > > } > > > > order.PutUint32(data, start) > > > > order.PutUint32(data[4:], size) > > > > > > > > bitmap :=3D data[8:] > > > > for pc :=3D range pcs { > > > > // The lowest 4-bit is dropped. > > > > pc =3D uint32(backend.NextInstructionPC(target, uin= t64(pc))) > > > > pc =3D (pc - start) >> 4 > > > > bitmap[pc/8] |=3D (1 << (pc % 8)) > > > > } > > > > return data > > > > ``` > > > > Pc offset is divided by 16 and start is cover filter start pc. > > > > > > > > I think divided by 8 is more reasonable? Because there is at least = one instruction before each __sanitizer_cov_trace_pc call. > > > > 0000000000000160 R_AARCH64_CALL26 __sanitizer_cov_trace_pc > > > > 0000000000000168 R_AARCH64_CALL26 __sanitizer_cov_trace_pc > > > > > > > > I think we still need my patch because we still need a way to keep = the trace_pc call and post-filter in syzkaller doesn't solve trace_pc dropp= ing, right? > > > > > > Yes, the in-kernel filter solves the problem of trace capacity/overfl= ows. > > > > > > > > > > But for sure I can use the bitmap from syzkaller. > > > > > > > > THX > > > > Joey > > > > -----Original Message----- > > > > From: Dmitry Vyukov > > > > Sent: Thursday, November 18, 2021 10:00 PM > > > > To: JianGen Jiao (QUIC) > > > > Cc: andreyknvl@gmail.com; kasan-dev@googlegroups.com; LKML > > > > ; Alexander Lochmann > > > > > > > > Subject: Re: [PATCH] kcov: add KCOV_PC_RANGE to limit pc range > > > > > > > > WARNING: This email originated from outside of Qualcomm. Please be = wary of any links or attachments, and do not enable macros. > > > > > > > > ,On Wed, 17 Nov 2021 at 07:24, Joey Jiao wrote: > > > > > > > > > > Sometimes we only interested in the pcs within some range, while > > > > > there are cases these pcs are dropped by kernel due to `pos >=3D > > > > > t->kcov_size`, and by increasing the map area size doesn't help. > > > > > > > > > > To avoid disabling KCOV for these not intereseted pcs during buil= d > > > > > time, adding this new KCOV_PC_RANGE cmd. > > > > > > > > Hi Joey, > > > > > > > > How do you use this? I am concerned that a single range of PCs is t= oo restrictive. I can only see how this can work for single module (continu= ous in memory) or a single function. But for anything else (something in th= e main kernel, or several modules), it won't work as PCs are not continuous= . > > > > > > > > Maybe we should use a compressed bitmap of interesting PCs? It allo= ws to support all cases and we already have it in syz-executor, then syz-ex= ecutor could simply pass the bitmap to the kernel rather than post-filter. > > > > It's also overlaps with the KCOV_MODE_UNIQUE mode that +Alexander p= roposed here: > > > > https://groups.google.com/g/kasan-dev/c/oVz3ZSWaK1Q/m/9ASztdzCAAAJ > > > > It would be reasonable if kernel uses the same bitmap format for th= ese > > > > 2 features. > > > > > > > > > > > > > > > > > An example usage is to use together syzkaller's cov filter. > > > > > > > > > > Change-Id: I954f6efe1bca604f5ce31f8f2b6f689e34a2981d > > > > > Signed-off-by: Joey Jiao > > > > > --- > > > > > Documentation/dev-tools/kcov.rst | 10 ++++++++++ > > > > > include/uapi/linux/kcov.h | 7 +++++++ > > > > > kernel/kcov.c | 18 ++++++++++++++++++ > > > > > 3 files changed, 35 insertions(+) > > > > > > > > > > diff --git a/Documentation/dev-tools/kcov.rst > > > > > b/Documentation/dev-tools/kcov.rst > > > > > index d83c9ab..fbcd422 100644 > > > > > --- a/Documentation/dev-tools/kcov.rst > > > > > +++ b/Documentation/dev-tools/kcov.rst > > > > > @@ -52,9 +52,15 @@ program using kcov: > > > > > #include > > > > > #include > > > > > > > > > > + struct kcov_pc_range { > > > > > + uint32 start; > > > > > + uint32 end; > > > > > + }; > > > > > + > > > > > #define KCOV_INIT_TRACE _IOR('c', 1, unsi= gned long) > > > > > #define KCOV_ENABLE _IO('c', 100) > > > > > #define KCOV_DISABLE _IO('c', 101) > > > > > + #define KCOV_TRACE_RANGE _IOW('c', 103, st= ruct kcov_pc_range) > > > > > #define COVER_SIZE (64<<10) > > > > > > > > > > #define KCOV_TRACE_PC 0 > > > > > @@ -64,6 +70,8 @@ program using kcov: > > > > > { > > > > > int fd; > > > > > unsigned long *cover, n, i; > > > > > + /* Change start and/or end to your interested pc range. = */ > > > > > + struct kcov_pc_range pc_range =3D {.start =3D 0, .end = =3D > > > > > + (uint32)(~((uint32)0))}; > > > > > > > > > > /* A single fd descriptor allows coverage collection on a= single > > > > > * thread. > > > > > @@ -79,6 +87,8 @@ program using kcov: > > > > > PROT_READ | PROT_WRITE, MAP_= SHARED, fd, 0); > > > > > if ((void*)cover =3D=3D MAP_FAILED) > > > > > perror("mmap"), exit(1); > > > > > + if (ioctl(fd, KCOV_PC_RANGE, pc_range)) > > > > > + dprintf(2, "ignore KCOV_PC_RANGE error.\n"); > > > > > /* Enable coverage collection on the current thread. */ > > > > > if (ioctl(fd, KCOV_ENABLE, KCOV_TRACE_PC)) > > > > > perror("ioctl"), exit(1); diff --git > > > > > a/include/uapi/linux/kcov.h b/include/uapi/linux/kcov.h index > > > > > 1d0350e..353ff0a 100644 > > > > > --- a/include/uapi/linux/kcov.h > > > > > +++ b/include/uapi/linux/kcov.h > > > > > @@ -16,12 +16,19 @@ struct kcov_remote_arg { > > > > > __aligned_u64 handles[0]; > > > > > }; > > > > > > > > > > +#define PC_RANGE_MASK ((__u32)(~((u32) 0))) struct kcov_pc_range= { > > > > > + __u32 start; /* start pc & 0xFFFFFFFF = */ > > > > > + __u32 end; /* end pc & 0xFFFFFFFF */ > > > > > +}; > > > > > + > > > > > #define KCOV_REMOTE_MAX_HANDLES 0x100 > > > > > > > > > > #define KCOV_INIT_TRACE _IOR('c', 1, unsi= gned long) > > > > > #define KCOV_ENABLE _IO('c', 100) > > > > > #define KCOV_DISABLE _IO('c', 101) > > > > > #define KCOV_REMOTE_ENABLE _IOW('c', 102, struct kco= v_remote_arg) > > > > > +#define KCOV_PC_RANGE _IOW('c', 103, struct kco= v_pc_range) > > > > > > > > > > enum { > > > > > /* > > > > > diff --git a/kernel/kcov.c b/kernel/kcov.c index 36ca640..5955045= 0 > > > > > 100644 > > > > > --- a/kernel/kcov.c > > > > > +++ b/kernel/kcov.c > > > > > @@ -36,6 +36,7 @@ > > > > > * - initial state after open() > > > > > * - then there must be a single ioctl(KCOV_INIT_TRACE) call > > > > > * - then, mmap() call (several calls are allowed but not usefu= l) > > > > > + * - then, optional to set trace pc range > > > > > * - then, ioctl(KCOV_ENABLE, arg), where arg is > > > > > * KCOV_TRACE_PC - to trace only the PCs > > > > > * or > > > > > @@ -69,6 +70,8 @@ struct kcov { > > > > > * kcov_remote_stop(), see the comment there. > > > > > */ > > > > > int sequence; > > > > > + /* u32 Trace PC range from start to end. */ > > > > > + struct kcov_pc_range pc_range; > > > > > }; > > > > > > > > > > struct kcov_remote_area { > > > > > @@ -192,6 +195,7 @@ static notrace unsigned long > > > > > canonicalize_ip(unsigned long ip) void notrace > > > > > __sanitizer_cov_trace_pc(void) { > > > > > struct task_struct *t; > > > > > + struct kcov_pc_range pc_range; > > > > > unsigned long *area; > > > > > unsigned long ip =3D canonicalize_ip(_RET_IP_); > > > > > unsigned long pos; > > > > > @@ -199,6 +203,11 @@ void notrace __sanitizer_cov_trace_pc(void) > > > > > t =3D current; > > > > > if (!check_kcov_mode(KCOV_MODE_TRACE_PC, t)) > > > > > return; > > > > > + pc_range =3D t->kcov->pc_range; > > > > > + if (pc_range.start < pc_range.end && > > > > > + ((ip & PC_RANGE_MASK) < pc_range.start || > > > > > + (ip & PC_RANGE_MASK) > pc_range.end)) > > > > > + return; > > > > > > > > > > area =3D t->kcov_area; > > > > > /* The first 64-bit word is the number of subsequent PCs.= */ > > > > > @@ -568,6 +577,7 @@ static int kcov_ioctl_locked(struct kcov *kco= v, unsigned int cmd, > > > > > int mode, i; > > > > > struct kcov_remote_arg *remote_arg; > > > > > struct kcov_remote *remote; > > > > > + struct kcov_pc_range *pc_range; > > > > > unsigned long flags; > > > > > > > > > > switch (cmd) { > > > > > @@ -589,6 +599,14 @@ static int kcov_ioctl_locked(struct kcov *kc= ov, unsigned int cmd, > > > > > kcov->size =3D size; > > > > > kcov->mode =3D KCOV_MODE_INIT; > > > > > return 0; > > > > > + case KCOV_PC_RANGE: > > > > > + /* Limit trace pc range. */ > > > > > + pc_range =3D (struct kcov_pc_range *)arg; > > > > > + if (copy_from_user(&kcov->pc_range, pc_range, siz= eof(kcov->pc_range))) > > > > > + return -EINVAL; > > > > > + if (kcov->pc_range.start >=3D kcov->pc_range.end) > > > > > + return -EINVAL; > > > > > + return 0; > > > > > case KCOV_ENABLE: > > > > > /* > > > > > * Enable coverage for the current task. > > > > > -- > > > > > 2.7.4