Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp2187662pxb; Wed, 9 Feb 2022 12:46:03 -0800 (PST) X-Google-Smtp-Source: ABdhPJzPmVsbC418WHxzdNI9FqITGingExeC/GzrCvoCAtNxmMLfujR+Zo5qTnGxLZxGZjU16HML X-Received: by 2002:a63:8849:: with SMTP id l70mr3235462pgd.437.1644439562895; Wed, 09 Feb 2022 12:46:02 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644439562; cv=none; d=google.com; s=arc-20160816; b=N2oADzTchDf9dNhJGX0/fs4P6vOxaVkXmKXsPkAreaLreQJxJV20o/QUdBPEQh0pcH zXYNTh1Rg6SJgDvUYgmCH3iT+vh8rcxCADeDpPH+p95gRAdAv9KIIdvLS4jJZNCVgO7M isOsOWnJJ1Zy2ExhYXWNZP8z6ER83Mv18y+mWbrUucXAzWBbJXEdAkiwDZs6+71DZT8+ 4HpeFc4zeiAo6ROj2tXNgjppaiWiWYgie73BJYR/h6DvZR+yzpIO275p+5i3lpKNyRhz C3CLj2sSeXuBAo/n9cjQvORgHCZ1cqxT/W6jV1IwFxYqsDyipsBj9wn0PKlvOV1ZZ3ku AwQg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:thread-index:thread-topic :content-transfer-encoding:mime-version:subject:references :in-reply-to:message-id:cc:to:from:date:dkim-signature:dkim-filter; bh=yADqTyQkhJP9OM2muqi1wi9cZWwijNMCjs7GZNnU5nc=; b=B+5vT4eLlCwDtvXtAd6pg1YWKiAZLZ8+wff9MikXir824HhEK4nqJhenjSsac9qEIy /a82Icsds73PISFAliUVnizV3OK6af+RSeFun7b4yW5t9miOcSxMoEEtc/NNbCRKAs9d A40+YtKOZqbHNALDN6DXtO5FBurJexwdUTQVBKUNiFUTH8EuuY4RA+TyQR9FfViebXnm bt3UiuI+dliQzjt8E+2G55Tht9LYwLPGcJljTZIfO2xCYQmJGWVbGAonKr+yIHHvXKhP BMkTA/fMuVz8+bwAyABHmLpRuX9VLVmFmQlJtQzF8/2Qw+2ufojfAqripp3GTPEEZxHW StnA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@efficios.com header.s=default header.b=hjAFoufI; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=efficios.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id t14si613675pgr.63.2022.02.09.12.45.58 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 09 Feb 2022 12:46:02 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@efficios.com header.s=default header.b=hjAFoufI; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=efficios.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 2D7D5E0918F7; Wed, 9 Feb 2022 12:11:41 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236317AbiBITcP (ORCPT + 99 others); Wed, 9 Feb 2022 14:32:15 -0500 Received: from gmail-smtp-in.l.google.com ([23.128.96.19]:35252 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236001AbiBITbo (ORCPT ); Wed, 9 Feb 2022 14:31:44 -0500 Received: from mail.efficios.com (mail.efficios.com [167.114.26.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 36380DF28A5F; Wed, 9 Feb 2022 11:23:33 -0800 (PST) Received: from localhost (localhost [127.0.0.1]) by mail.efficios.com (Postfix) with ESMTP id E5C483B48DE; Wed, 9 Feb 2022 14:17:09 -0500 (EST) Received: from mail.efficios.com ([127.0.0.1]) by localhost (mail03.efficios.com [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id p4pAwA43GFBW; Wed, 9 Feb 2022 14:17:09 -0500 (EST) Received: from localhost (localhost [127.0.0.1]) by mail.efficios.com (Postfix) with ESMTP id 477333B492B; Wed, 9 Feb 2022 14:17:09 -0500 (EST) DKIM-Filter: OpenDKIM Filter v2.10.3 mail.efficios.com 477333B492B DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=efficios.com; s=default; t=1644434229; bh=yADqTyQkhJP9OM2muqi1wi9cZWwijNMCjs7GZNnU5nc=; h=Date:From:To:Message-ID:MIME-Version; b=hjAFoufIR8rv34T8WsJNcQHG8/h0g9PvgdEhLOVfwKchQzI7I7kmaa8BmDDKxMIb8 qWMtRNQ9+HYHp5RDTPgJQyWNwtJShaSLCaeaWeVFtlSiRpb8mSI0xOOaYinY+Fpkkt 3o3JwteTLTuKrtXLK4G2qjxABKI2FQHxhvqVkuso+VgtU/YXYUEpEM75ArV9nCO66D msEjAjad96iZYXYzCEJmCdSbVqIUrQKcdJ4RHsEWlhVnJx0YsSZuQdgiRpNm7EF9IZ TqtlkNotNQEZholmuLFREIRKaa75WLNb0VPCsfQLH6JPerbhUJFETOJh59k9GszePF ibWtvtXK55TGA== X-Virus-Scanned: amavisd-new at efficios.com Received: from mail.efficios.com ([127.0.0.1]) by localhost (mail03.efficios.com [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id 1xFgL6eLxLgz; Wed, 9 Feb 2022 14:17:09 -0500 (EST) Received: from mail03.efficios.com (mail03.efficios.com [167.114.26.124]) by mail.efficios.com (Postfix) with ESMTP id 313EF3B47F2; Wed, 9 Feb 2022 14:17:09 -0500 (EST) Date: Wed, 9 Feb 2022 14:17:09 -0500 (EST) From: Mathieu Desnoyers To: Waiman Long Cc: Peter Zijlstra , Namhyung Kim , Ingo Molnar , Will Deacon , Boqun Feng , linux-kernel , Thomas Gleixner , rostedt , Byungchul Park , "Paul E. McKenney" , Radoslaw Burny , Tejun Heo , rcu , cgroups , linux-btrfs , intel-gfx Message-ID: <593915946.50384.1644434229077.JavaMail.zimbra@efficios.com> In-Reply-To: <69e5f778-8715-4acf-c027-58b6ec4a9e77@redhat.com> References: <20220208184208.79303-1-namhyung@kernel.org> <20220209090908.GK23216@worktop.programming.kicks-ass.net> <24fe6a08-5931-8e8d-8d77-459388c4654e@redhat.com> <919214156.50301.1644431371345.JavaMail.zimbra@efficios.com> <69e5f778-8715-4acf-c027-58b6ec4a9e77@redhat.com> Subject: Re: [RFC 00/12] locking: Separate lock tracepoints from lockdep/lock_stat (v1) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Originating-IP: [167.114.26.124] X-Mailer: Zimbra 8.8.15_GA_4203 (ZimbraWebClient - FF96 (Linux)/8.8.15_GA_4203) Thread-Topic: locking: Separate lock tracepoints from lockdep/lock_stat (v1) Thread-Index: +C+VmEK8/wKvHxydXodKzSxj1FQLTw== X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org ----- On Feb 9, 2022, at 2:02 PM, Waiman Long longman@redhat.com wrote: > On 2/9/22 13:29, Mathieu Desnoyers wrote: >> ----- On Feb 9, 2022, at 1:19 PM, Waiman Long longman@redhat.com wrote: >> >>> On 2/9/22 04:09, Peter Zijlstra wrote: >>>> On Tue, Feb 08, 2022 at 10:41:56AM -0800, Namhyung Kim wrote: >>>> >>>>> Eventually I'm mostly interested in the contended locks only and I >>>>> want to reduce the overhead in the fast path. By moving that, it'd be >>>>> easy to track contended locks with timing by using two tracepoints. >>>> So why not put in two new tracepoints and call it a day? >>>> >>>> Why muck about with all that lockdep stuff just to preserve the name >>>> (and in the process continue to blow up data structures etc..). This >>>> leaves distros in a bind, will they enable this config and provide >>>> tracepoints while bloating the data structures and destroying things >>>> like lockref (which relies on sizeof(spinlock_t)), or not provide this >>>> at all. >>>> >>>> Yes, the name is convenient, but it's just not worth it IMO. It makes >>>> the whole proposition too much of a trade-off. >>>> >>>> Would it not be possible to reconstruct enough useful information from >>>> the lock callsite? >>>> >>> I second that as I don't want to see the size of a spinlock exceeds 4 >>> bytes in a production system. >>> >>> Instead of storing additional information (e.g. lock name) directly into >>> the lock itself. Maybe we can store it elsewhere and use the lock >>> address as the key to locate it in a hash table. We can certainly extend >>> the various lock init functions to do that. It will be trickier for >>> statically initialized locks, but we can probably find a way to do that too. >> If we go down that route, it would be nice if we can support a few different >> use-cases for various tracers out there. >> >> One use-case (a) requires the ability to query the lock name based on its >> address as key. >> For this a hash table is a good fit. This would allow tracers like ftrace to >> output lock names in its human-readable output which is formatted within the >> kernel. >> >> Another use-case (b) is to be able to "dump" the lock { name, address } tuples >> into the trace stream (we call this statedump events in lttng), and do the >> translation from address to name at post-processing. This simply requires >> that this information is available for iteration for both the core kernel >> and module locks, so the tracer can dump this information on trace start >> and module load. >> >> Use-case (b) is very similar to what is done for the kernel tracepoints. Based >> on this, implementing the init code that iterates on those sections and >> populates >> a hash table for use-case (a) should be easy enough. > > Yes, that are good use cases for this type of functionality. I do need > to think about how to do it for statically initialized lock first. Tracepoints already solved that problem. Look at the macro DEFINE_TRACE_FN() in include/linux/tracepoint.h. You will notice that it statically defines a struct tracepoint in a separate section and a tracepoint_ptr_t in a __tracepoints_ptrs section. Then the other parts of the picture are in kernel/tracepoint.c: extern tracepoint_ptr_t __start___tracepoints_ptrs[]; extern tracepoint_ptr_t __stop___tracepoints_ptrs[]; and kernel/module.c:find_module_sections() #ifdef CONFIG_TRACEPOINTS mod->tracepoints_ptrs = section_objs(info, "__tracepoints_ptrs", sizeof(*mod->tracepoints_ptrs), &mod->num_tracepoints); #endif and the iteration code over kernel and modules in kernel/tracepoint.c. All you need in addition is in include/asm-generic/vmlinux.lds.h, we add to the DATA_DATA define an entry such as: STRUCT_ALIGN(); \ *(__tracepoints) \ and in RO_DATA: . = ALIGN(8); \ __start___tracepoints_ptrs = .; \ KEEP(*(__tracepoints_ptrs)) /* Tracepoints: pointer array */ \ __stop___tracepoints_ptrs = .; AFAIU, if you do something similar for a structure that contains your relevant lock information, it should be straightforward to handle statically initialized locks. Thanks, Mathieu > > Thanks, > Longman -- Mathieu Desnoyers EfficiOS Inc. http://www.efficios.com