Received: by 2002:a25:868d:0:0:0:0:0 with SMTP id z13csp3804724ybk; Tue, 19 May 2020 13:22:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJweaiwn9iQzynANXvaN5s92PSu3n9wJJRdolbHzhDUY1s/4SnC9znj6R59eO3cHJ6xZADI1 X-Received: by 2002:aa7:d487:: with SMTP id b7mr527418edr.351.1589919753507; Tue, 19 May 2020 13:22:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1589919753; cv=none; d=google.com; s=arc-20160816; b=iGdw6o/IVOmyta9dGq9mfMOOziS1OKRJwIhV1t8278qPS+k3dLgs26XaYDJIJxXwWA v1LWBF4HJm9m/xwr5ETBiJT6m/EiN7dOCHoJ+udVrF/VZ88SxVYh7J0xRN8XneJhrrHQ KQ5eMGEV+3B8WdgH7//KHOI6y8ialtkq2LjD+vn73EohOOmFAGPEFhp8DLteevPR7QDH MGH3YVFl4B8e1gNSG7U6idfbs+AAQP/lrX7ZtqXFq/3D1fLNIXx8YIwmFEbRGsAQiDFe iJGrl3DRRvP4WKfkJIK9tUuyIRVmbBXBg5o+O/Q5bc7TqOb7/L6Fy/J0NIiHZI6/tg2R eNkA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=Hgl+Cetu9pfR8P+qMuolFqwnRrDRhF9hWZrf7zDqd3w=; b=gOrz862dVXgNUQqrVmP+SANPlJGtJ1iEuZ0pTFbQgsnI4/AS3EXg7MX9rHh+FZbFzm +oy03v6wbekUHQYuSM2Pa9Ul8eS8l/cbNlCFDUEN2WUSZdMSeUkXjL1FCW8NK50sxU/d 7F235n/J1IGGHyGIZgkVIXmMXAab5XdGZlhGSyzuAA87A46hsJTtsmEN5LfgU/4aGyCq YoJxKd3GoQ+pHTtdw1tFUdZYNXSIx73OCueMacwP7A6k4wYrE56p1vOyhHLjkN9Ywr0O MBSoqBNFDWhpUO3xM5lRX6k1SvLZzRU76yeB6WVurRR1Uu/jW+/z8SX07hx3HAF9JtzB GorQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id pk16si512455ejb.310.2020.05.19.13.22.10; Tue, 19 May 2020 13:22:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727938AbgESUUN (ORCPT + 99 others); Tue, 19 May 2020 16:20:13 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55464 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726369AbgESUUN (ORCPT ); Tue, 19 May 2020 16:20:13 -0400 Received: from Galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 213DFC08C5C0 for ; Tue, 19 May 2020 13:20:13 -0700 (PDT) Received: from localhost ([127.0.0.1] helo=flow.W.breakpoint.cc) by Galois.linutronix.de with esmtp (Exim 4.80) (envelope-from ) id 1jb8ig-00012c-HJ; Tue, 19 May 2020 22:20:10 +0200 From: Sebastian Andrzej Siewior To: linux-kernel@vger.kernel.org Cc: Peter Zijlstra , Ingo Molnar , Steven Rostedt , Will Deacon , Thomas Gleixner , "Paul E . McKenney" , Linus Torvalds , Julia Cartwright , Phillip Lougher , Alexander Stein , Sebastian Andrzej Siewior Subject: [PATCH 5/8] squashfs: make use of local lock in multi_cpu decompressor Date: Tue, 19 May 2020 22:19:09 +0200 Message-Id: <20200519201912.1564477-6-bigeasy@linutronix.de> X-Mailer: git-send-email 2.26.2 In-Reply-To: <20200519201912.1564477-1-bigeasy@linutronix.de> References: <20200519201912.1564477-1-bigeasy@linutronix.de> MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Julia Cartwright The squashfs multi CPU decompressor makes use of get_cpu_ptr() to acquire a pointer to per-CPU data. get_cpu_ptr() implicitly disables preemption which serializes the access to the per-CPU data. But decompression can take quite some time depending on the size. The observed preempt disabled times in real world scenarios went up to 8ms, causing massive wakeup latencies. This happens on all CPUs as the decompression is fully parallelized. Replace the implicit preemption control with an explicit local lock. This allows RT kernels to substitute it with a real per CPU lock, which serializes the access but keeps the code section preemptible. On non RT kernels this maps to preempt_disable() as before, i.e. no functional change. [ bigeasy: Use local_lock(), patch description] Cc: Phillip Lougher Reported-by: Alexander Stein Signed-off-by: Julia Cartwright Signed-off-by: Sebastian Andrzej Siewior Tested-by: Alexander Stein --- fs/squashfs/decompressor_multi_percpu.c | 19 +++++++++++++------ 1 file changed, 13 insertions(+), 6 deletions(-) diff --git a/fs/squashfs/decompressor_multi_percpu.c b/fs/squashfs/decompre= ssor_multi_percpu.c index 2a2a2d106440e..8a77a2741c176 100644 --- a/fs/squashfs/decompressor_multi_percpu.c +++ b/fs/squashfs/decompressor_multi_percpu.c @@ -8,6 +8,7 @@ #include #include #include +#include =20 #include "squashfs_fs.h" #include "squashfs_fs_sb.h" @@ -23,6 +24,8 @@ struct squashfs_stream { void *stream; }; =20 +static DEFINE_LOCAL_LOCK(stream_lock); + void *squashfs_decompressor_create(struct squashfs_sb_info *msblk, void *comp_opts) { @@ -75,12 +78,16 @@ void squashfs_decompressor_destroy(struct squashfs_sb_i= nfo *msblk) int squashfs_decompress(struct squashfs_sb_info *msblk, struct buffer_head= **bh, int b, int offset, int length, struct squashfs_page_actor *output) { - struct squashfs_stream __percpu *percpu =3D - (struct squashfs_stream __percpu *) msblk->stream; - struct squashfs_stream *stream =3D get_cpu_ptr(percpu); - int res =3D msblk->decompressor->decompress(msblk, stream->stream, bh, b, - offset, length, output); - put_cpu_ptr(stream); + struct squashfs_stream *stream; + int res; + + local_lock(stream_lock); + stream =3D this_cpu_ptr(msblk->stream); + + res =3D msblk->decompressor->decompress(msblk, stream->stream, bh, b, + offset, length, output); + + local_unlock(stream_lock); =20 if (res < 0) ERROR("%s decompression failed, data probably corrupt\n", --=20 2.26.2