Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp2430931pxu; Sat, 28 Nov 2020 13:55:24 -0800 (PST) X-Google-Smtp-Source: ABdhPJyZRtJFInUr1x3l4JfYVdPpC4Xnm/8AhEYilQpoBfbpVe6e/BbdDEJ796pmK1T0oEhi8TuD X-Received: by 2002:a17:907:aaf:: with SMTP id bz15mr13787123ejc.199.1606600524533; Sat, 28 Nov 2020 13:55:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606600524; cv=none; d=google.com; s=arc-20160816; b=km5u7kgcKrIJIpVPQKWdsrdqS76JSf2i6hmIf6GJQ0GzgRQIUaoauVtwfAgnY0CLHn k1KdjYcYxYGQlhI6Pm8ZTtNLuLLQxLu3sYd6Jk7Zbl7IFqcgIO14Rh/QQ5YQp2Ylpz9u Eby7UnFQMsL3VBXFMGZ7elt3YPNuz8BljTGjkonus/AQu1vwhJdrCexU5KVTM60gkQla mTtOPFGMEk3EaMNS1+ZnTc7/Pl/WQaUofEu1O5bSmmOPT6K8u47zdD7Dq/bBp/T6iB9S FI7H0u8IWMruRr6ORBZ4dls11iZ+LD8xZ7pciuy07AO1Gy3Xlsp33WS7rr+ieVm5I7LR AiMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-disposition :mime-version:message-id:subject:cc:to:from:date:dkim-signature; bh=/Uw53imUcRXDFT7eHM+wU0zdaiBNyjT9GdOjcfo49Ss=; b=r2cgiLd59GYe30vtcj/wKUyLJf1YB+oLt5uEodv49xvO/8cK5JG5IVTc2MnkgARGNR J87h/knZovF+QYycNRRAVMVRn/z+uqU8+wlDMERVondfytRmPOVDwgYFzkaROCyoWHBj z4MNwIw/iOsOvoBGJkQzgjwnDcxvas9esyF2pdaHQJjHB6zwI0HEDQ7oPd2DOlX+q0wc hdDnjjeXqNhfcimiPN002pR844RLw9ijmgv1skFNfDD11m2zsDU9Wn+ehvCiB/8hnbF2 IfAIANNc/kR7NXzG8Zs2olXbN0oX/AwB9WwcsItTAqQpUNNxGyWHAblM/PYf1RVVvHc8 wHcQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@natalenko.name header.s=dkim-20170712 header.b=Q55QKRuH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=natalenko.name Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id v4si4259994edj.57.2020.11.28.13.55.02; Sat, 28 Nov 2020 13:55:24 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@natalenko.name header.s=dkim-20170712 header.b=Q55QKRuH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=natalenko.name Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2391175AbgK1VwI (ORCPT + 99 others); Sat, 28 Nov 2020 16:52:08 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43184 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732099AbgK1S6F (ORCPT ); Sat, 28 Nov 2020 13:58:05 -0500 Received: from vulcan.natalenko.name (vulcan.natalenko.name [IPv6:2001:19f0:6c00:8846:5400:ff:fe0c:dfa0]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4713EC02A1A2 for ; Sat, 28 Nov 2020 06:05:31 -0800 (PST) Received: from localhost (home.natalenko.name [151.237.229.131]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by vulcan.natalenko.name (Postfix) with ESMTPSA id 38F318B1827; Sat, 28 Nov 2020 15:05:24 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=natalenko.name; s=dkim-20170712; t=1606572324; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=/Uw53imUcRXDFT7eHM+wU0zdaiBNyjT9GdOjcfo49Ss=; b=Q55QKRuHAV4109vIVy07d/RjUVWUOvU1YeyG0XG01k187zVu/BEh3v6OvdPzcntDJjSXYn /r76Ss+jOYZQsNyD6DQ0eQs6IacAshFwhajSF3poywBtVOpsPMzTt0mmnwPo9AXUA7RN8O 0T16+1Jfi7IkXLXAp+EGkSoEgNHxbIk= Date: Sat, 28 Nov 2020 15:05:23 +0100 From: Oleksandr Natalenko To: linux-kernel@vger.kernel.org Cc: linux-mm@kvack.org, Andrew Morton , Sebastian Andrzej Siewior , Steven Rostedt , Mike Galbraith , Thomas Gleixner Subject: scheduling while atomic in z3fold Message-ID: <20201128140523.ovmqon5fjetvpby4@spock.localdomain> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi. While running v5.10-rc5-rt11 I bumped into the following: ``` BUG: scheduling while atomic: git/18695/0x00000002 Preemption disabled at: [] z3fold_zpool_malloc+0x463/0x6e0 … Call Trace: dump_stack+0x6d/0x88 __schedule_bug.cold+0x88/0x96 __schedule+0x69e/0x8c0 preempt_schedule_lock+0x51/0x150 rt_spin_lock_slowlock_locked+0x117/0x2c0 rt_spin_lock_slowlock+0x58/0x80 rt_spin_lock+0x2a/0x40 z3fold_zpool_malloc+0x4c1/0x6e0 zswap_frontswap_store+0x39c/0x980 __frontswap_store+0x6e/0xf0 swap_writepage+0x39/0x70 shmem_writepage+0x31b/0x490 pageout+0xf4/0x350 shrink_page_list+0xa28/0xcc0 shrink_inactive_list+0x300/0x690 shrink_lruvec+0x59a/0x770 shrink_node+0x2d6/0x8d0 do_try_to_free_pages+0xda/0x530 try_to_free_pages+0xff/0x260 __alloc_pages_slowpath.constprop.0+0x3d5/0x1230 __alloc_pages_nodemask+0x2f6/0x350 allocate_slab+0x3da/0x660 ___slab_alloc+0x4ff/0x760 __slab_alloc.constprop.0+0x7a/0x100 kmem_cache_alloc+0x27b/0x2c0 __d_alloc+0x22/0x230 d_alloc_parallel+0x67/0x5e0 __lookup_slow+0x5c/0x150 path_lookupat+0x2ea/0x4d0 filename_lookup+0xbf/0x210 vfs_statx.constprop.0+0x4d/0x110 __do_sys_newlstat+0x3d/0x80 do_syscall_64+0x33/0x40 entry_SYSCALL_64_after_hwframe+0x44/0xa9 ``` The preemption seems to be disabled here: ``` $ scripts/faddr2line mm/z3fold.o z3fold_zpool_malloc+0x463 z3fold_zpool_malloc+0x463/0x6e0: add_to_unbuddied at mm/z3fold.c:645 (inlined by) z3fold_alloc at mm/z3fold.c:1195 (inlined by) z3fold_zpool_malloc at mm/z3fold.c:1737 ``` The call to the rt_spin_lock() seems to be here: ``` $ scripts/faddr2line mm/z3fold.o z3fold_zpool_malloc+0x4c1 z3fold_zpool_malloc+0x4c1/0x6e0: add_to_unbuddied at mm/z3fold.c:649 (inlined by) z3fold_alloc at mm/z3fold.c:1195 (inlined by) z3fold_zpool_malloc at mm/z3fold.c:1737 ``` Or, in source code: ``` 639 /* Add to the appropriate unbuddied list */ 640 static inline void add_to_unbuddied(struct z3fold_pool *pool, 641 struct z3fold_header *zhdr) 642 { 643 if (zhdr->first_chunks == 0 || zhdr->last_chunks == 0 || 644 zhdr->middle_chunks == 0) { 645 struct list_head *unbuddied = get_cpu_ptr(pool->unbuddied); 646 647 int freechunks = num_free_chunks(zhdr); 648 spin_lock(&pool->lock); 649 list_add(&zhdr->buddy, &unbuddied[freechunks]); 650 spin_unlock(&pool->lock); 651 zhdr->cpu = smp_processor_id(); 652 put_cpu_ptr(pool->unbuddied); 653 } 654 } ``` Shouldn't the list manipulation be protected with local_lock+this_cpu_ptr instead of get_cpu_ptr+spin_lock? Thanks. -- Oleksandr Natalenko (post-factum)