Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp1211956pxb; Wed, 6 Apr 2022 11:26:51 -0700 (PDT) X-Google-Smtp-Source: ABdhPJweMW4f8VRQoS3lKg0hajujVtceMFpZD0ERgp8aqrLjdW8D6zvC4kkxXMpK22Y+W/7zWHCv X-Received: by 2002:a65:490c:0:b0:382:692a:b1fa with SMTP id p12-20020a65490c000000b00382692ab1famr8394559pgs.150.1649269611619; Wed, 06 Apr 2022 11:26:51 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649269611; cv=none; d=google.com; s=arc-20160816; b=xXPD1uX6qTni+t2QHxUSFLsB2murMMNjkMCYYZ5lkHN8rjdsrXpVT+GA4x9cHJRodL Q03hNtpN5V7AHBzromMwBbggnIUIskF8aM2FVyzk9WgRlh3HiF0h0eEzoxAlBBcSajct tklBQBKNBahixI6RmguGpHPvQb/cq/5GLqsIiJcr/qHelTsw8tBnO7d2GkqxLuicyJ76 gLRSH/n29hz1h3icRNLcd4aMulUk3dpTtVw0DcjHScuMYE4Mzyp1ku6SOmND30Y0qqUQ WOvfpmyXwGvxkNqPzSbMIkAq+SG7vSCp40SCpwIF+Yv0NXFdbuCa6bpv/0YoRCkO0NSn OmWg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=KUGFoNnUGScUomYsF7wtqgIA+7i5QVqibaScVacsjnU=; b=U98h8qV9/iB0hTEcOHaxvCghazIViTNR2dIlcGhvXny4ASlx3nrOKIYjJpHNvJfViP fZFVAnVa/vJEWseYGzjlwPm3CIRzTLN93jigMRAtIlhNvD00LN1Ft5C9W2qgESe4LlYO oi13Oy0xmxfJ0HHDdSLy+EDyTuBCbB2VdiPxUlIjOoap5yoQdAT0wTl0AYJeNi5hx/GX jXpNPZ1MILpTk0Vry8tH5V0PpcRPFrC4q5T3M4RYOTtbryIHUHHbiT3MqH8CiXnbEdqf dBPI/yH3c/tTqPL2puEqMxI8pcY+03DlmmurhcSI0rtdB0BJkTGAycrAHiUXIUbcQMBY s47w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=AbBXosJZ; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id v22-20020a63b956000000b003816043eec9si16731803pgo.190.2022.04.06.11.26.51 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 06 Apr 2022 11:26:51 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=AbBXosJZ; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 7D98C263645; Wed, 6 Apr 2022 11:09:57 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S240194AbiDFSLs (ORCPT + 99 others); Wed, 6 Apr 2022 14:11:48 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52524 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S240254AbiDFSJc (ORCPT ); Wed, 6 Apr 2022 14:09:32 -0400 Received: from mail-wm1-x335.google.com (mail-wm1-x335.google.com [IPv6:2a00:1450:4864:20::335]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2A6111AF7FA for ; Wed, 6 Apr 2022 09:46:49 -0700 (PDT) Received: by mail-wm1-x335.google.com with SMTP id i6-20020a1c3b06000000b0038e710da2dcso4023537wma.1 for ; Wed, 06 Apr 2022 09:46:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=KUGFoNnUGScUomYsF7wtqgIA+7i5QVqibaScVacsjnU=; b=AbBXosJZdvj+44jV/t3tLeXNGDbrJB0DTPW335jYRx/iAWXKo3M2TJxCtQmHuoLYVi eQQr6SDYWWaPfQa2K3p9Ft8NRL+qPD66x/fVU/ZtHc6JxasrB4n12PwUWuhSDdznqDT4 pt9cg1B+Nxdobqqjq4tOLdkOzSaz65OTgy3x+zfILsG+HGNPltMuvQxThoVrexbhob/j jQUm/G693iC+2UB4JCW/M5Us7dd74lZ94oEi6kkcxl3wqriqS5/VzIWjKF8TLJvayZ4d WdUtgynNNirwZRCNHZ7JpeE33XQBUi/nBqtK9IjpCj0/1IDAO5otMttxuJ6VLHKD2ssc 0fkA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=KUGFoNnUGScUomYsF7wtqgIA+7i5QVqibaScVacsjnU=; b=FGKYirZGO0ZbY+xJsp42uKdLM418hqRsdRrrox1q1noapBYtzZSOwWxSRdJkpFHx6e VJjDiRailNVEtu+3ohaDAwIOu2rEt2tZ0jm2b6v8d304U6Ww01zdp3t6YqnUEk0OeJh2 MkcOBZpfa4VcIgsBxs4DaV4+C1YUuEIC8hbuDq7xadOD2DI8VMakuRaFa+tDlHwFHYXp ycvlmydFf8n5T8LXKvYG66LD+uA3ssHNryO8bsUr593xkaHFJ5ovNSAfzPAb9qpavBwB zUkHkh7nDsYUyJfqsCPzYsySIvylRGpYf4v7LXbZ4Q+umcNEJhpc+v+c6OUPvSyy34gl i45w== X-Gm-Message-State: AOAM532hOvLOCe5Jlb07jwUZPaRxlKH+TjipRQhb9xILpCwD4K3NhI5F h1G2Vv5LOWsz2JHAoDTyHEvdQLeaBAaQj3tnKq/ssw== X-Received: by 2002:a7b:c77a:0:b0:38c:2c33:d8f1 with SMTP id x26-20020a7bc77a000000b0038c2c33d8f1mr8292204wmk.115.1649263607436; Wed, 06 Apr 2022 09:46:47 -0700 (PDT) MIME-Version: 1.0 References: <9042bf7daf988e17e17e6acbf5d29590bde869cd.1642440724.git.alexey.v.bayduraev@linux.intel.com> In-Reply-To: From: Ian Rogers Date: Wed, 6 Apr 2022 09:46:34 -0700 Message-ID: Subject: Re: [PATCH v13 01/16] perf record: Introduce thread affinity and mmap masks To: "Bayduraev, Alexey V" Cc: Arnaldo Carvalho de Melo , Jiri Olsa , Namhyung Kim , Alexander Shishkin , Peter Zijlstra , Ingo Molnar , linux-kernel , Andi Kleen , Adrian Hunter , Alexander Antonov , Alexei Budankov , Riccardo Mancini Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.5 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 5, 2022 at 9:21 AM Bayduraev, Alexey V wrote: > > On 05.04.2022 1:25, Ian Rogers wrote: > > On Mon, Jan 17, 2022 at 10:38 AM Alexey Bayduraev > > wrote: > >> > >> Introduce affinity and mmap thread masks. Thread affinity mask > > > > > > > In per-thread mode it is possible that cpus is the dummy CPU map here. > > This means that the cpu below has the value -1 and setting bit -1 > > actually has the effect of setting bit 63. Here is a reproduction > > based on the acme/perf/core branch: > > > > ``` > > $ make STATIC=1 DEBUG=1 EXTRA_CFLAGS='-fno-omit-frame-pointer > > -fsanitize=undefined -fno-sanitize-recover' > > $ perf record -o /tmp/perf.data --per-thread true > > tools/include/asm-generic/bitops/atomic.h:10:36: runtime error: shift > > exponent -1 is negative > > $ UBSAN_OPTIONS=abort_on_error=1 gdb --args perf record -o > > /tmp/perf.data --per-thread true > > (gdb) r > > tools/include/asm-generic/bitops/atomic.h:10:36: runtime error: shift > > exponent -1 is negative > > (gdb) bt > > #0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:49 > > #1 0x00007ffff71d2546 in __GI_abort () at abort.c:79 > > #2 0x00007ffff640db9f in __sanitizer::Abort () at > > ../../../../src/libsanitizer/sanitizer_common/sanitizer_posix_libcdep.cpp:151 > > #3 0x00007ffff6418efc in __sanitizer::Die () at > > ../../../../src/libsanitizer/sanitizer_common/sanitizer_termination.cpp:58 > > #4 0x00007ffff63fd99e in > > __ubsan::__ubsan_handle_shift_out_of_bounds_abort (Data= > out>, LHS=, > > RHS=) at > > ../../../../src/libsanitizer/ubsan/ubsan_handlers.cpp:378 > > #5 0x0000555555c54405 in set_bit (nr=-1, addr=0x555556ecd0a0) > > at tools/include/asm-generic/bitops/atomic.h:10 > > #6 0x0000555555c6ddaf in record__mmap_cpu_mask_init > > (mask=0x555556ecd070, cpus=0x555556ecd050) at builtin-record.c:3333 > > #7 0x0000555555c7044c in record__init_thread_default_masks > > (rec=0x55555681b100 , cpus=0x555556ecd050) at > > builtin-record.c:3668 > > #8 0x0000555555c705b3 in record__init_thread_masks > > (rec=0x55555681b100 ) at builtin-record.c:3681 > > #9 0x0000555555c7297a in cmd_record (argc=1, argv=0x7fffffffdcc0) at > > builtin-record.c:3976 > > #10 0x0000555555e06d41 in run_builtin (p=0x555556827538 > > , argc=5, argv=0x7fffffffdcc0) at perf.c:313 > > #11 0x0000555555e07253 in handle_internal_command (argc=5, > > argv=0x7fffffffdcc0) at perf.c:365 > > #12 0x0000555555e07508 in run_argv (argcp=0x7fffffffdb0c, > > argv=0x7fffffffdb00) at perf.c:409 > > #13 0x0000555555e07b32 in main (argc=5, argv=0x7fffffffdcc0) at perf.c:539 > > ``` > > > > Not setting the mask->bits if the cpu map is dummy causes no data to > > be written. Setting mask->bits 0 causes a segv. Setting bit 63 works > > but feels like there are more invariants broken in the code. > > > > Here is a not good workaround patch: > > > > diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > > index ba74fab02e62..62727b676f98 100644 > > --- a/tools/perf/builtin-record.c > > +++ b/tools/perf/builtin-record.c > > @@ -3329,6 +3329,11 @@ static void record__mmap_cpu_mask_init(struct > > mmap_cpu_mask *mask, struct perf_c > > { > > int c; > > > > + if (cpu_map__is_dummy(cpus)) { > > + set_bit(63, mask->bits); > > + return; > > + } > > + > > for (c = 0; c < cpus->nr; c++) > > set_bit(cpus->map[c].cpu, mask->bits); > > } > > > > Alexey, what should the expected behavior be with per-thread mmaps? > > > > Thanks, > > Ian > > Thanks a lot, > > In case of per-thread mmaps we should initialize thread_data[0]->maps[i] by > evlist->mmap[i]. Looks like this was missed by this patchset. > > Your patch works, because it triggers test_bit() in record__thread_data_init_maps() > and thread_data maps get correctly initialized. > > However, it's better to ignore thread_data->masks in record__mmap_cpu_mask_init() > and setup thread_data maps explicitly for per-thread case. > > Also, to prevent more runtime crashes, --per-thread and --threads > options should be mutually exclusive. > > I will prepare a fix for this issue soon. Hi Alexey, sorry to nag, I'm being nagged, is there an ETA on the fix? Is it pragmatic to roll back for now? Thanks, Ian > Regards, > Alexey > > > > >> +static void record__free_thread_masks(struct record *rec, int nr_threads) > >> +{ > >> + int t; > >> + > >> + if (rec->thread_masks) > >> + for (t = 0; t < nr_threads; t++) > >> + record__thread_mask_free(&rec->thread_masks[t]); > >> + > >> + zfree(&rec->thread_masks); > >> +} > >> + > >> +static int record__alloc_thread_masks(struct record *rec, int nr_threads, int nr_bits) > >> +{ > >> + int t, ret; > >> + > >> + rec->thread_masks = zalloc(nr_threads * sizeof(*(rec->thread_masks))); > >> + if (!rec->thread_masks) { > >> + pr_err("Failed to allocate thread masks\n"); > >> + return -ENOMEM; > >> + } > >> + > >> + for (t = 0; t < nr_threads; t++) { > >> + ret = record__thread_mask_alloc(&rec->thread_masks[t], nr_bits); > >> + if (ret) { > >> + pr_err("Failed to allocate thread masks[%d]\n", t); > >> + goto out_free; > >> + } > >> + } > >> + > >> + return 0; > >> + > >> +out_free: > >> + record__free_thread_masks(rec, nr_threads); > >> + > >> + return ret; > >> +} > >> + > >> +static int record__init_thread_default_masks(struct record *rec, struct perf_cpu_map *cpus) > >> +{ > >> + int ret; > >> + > >> + ret = record__alloc_thread_masks(rec, 1, cpu__max_cpu().cpu); > >> + if (ret) > >> + return ret; > >> + > >> + record__mmap_cpu_mask_init(&rec->thread_masks->maps, cpus); > >> + > >> + rec->nr_threads = 1; > >> + > >> + return 0; > >> +} > >> + > >> +static int record__init_thread_masks(struct record *rec) > >> +{ > >> + struct perf_cpu_map *cpus = rec->evlist->core.cpus; > >> + > >> + return record__init_thread_default_masks(rec, cpus); > >> +} > >> + > >> int cmd_record(int argc, const char **argv) > >> { > >> int err; > >> @@ -2948,6 +3063,12 @@ int cmd_record(int argc, const char **argv) > >> goto out; > >> } > >> > >> + err = record__init_thread_masks(rec); > >> + if (err) { > >> + pr_err("Failed to initialize parallel data streaming masks\n"); > >> + goto out; > >> + } > >> + > >> if (rec->opts.nr_cblocks > nr_cblocks_max) > >> rec->opts.nr_cblocks = nr_cblocks_max; > >> pr_debug("nr_cblocks: %d\n", rec->opts.nr_cblocks); > >> @@ -2966,6 +3087,8 @@ int cmd_record(int argc, const char **argv) > >> symbol__exit(); > >> auxtrace_record__free(rec->itr); > >> out_opts: > >> + record__free_thread_masks(rec, rec->nr_threads); > >> + rec->nr_threads = 0; > >> evlist__close_control(rec->opts.ctl_fd, rec->opts.ctl_fd_ack, &rec->opts.ctl_fd_close); > >> return err; > >> } > >> -- > >> 2.19.0 > >>