Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp164194pxu; Wed, 2 Dec 2020 18:23:47 -0800 (PST) X-Google-Smtp-Source: ABdhPJy4m2jdGqw8mScy9UThfZ/WH5yCfCDjjRojXIV42s7gaaczvAyOn4A36N21k0Zfu1cjURf9 X-Received: by 2002:a05:6402:d08:: with SMTP id eb8mr894543edb.271.1606962227520; Wed, 02 Dec 2020 18:23:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606962227; cv=none; d=google.com; s=arc-20160816; b=cDY/zTo6S4HsBbeD+DXpp8ZIWe2MfkbpUCWYQv/PXD+yhLhtQcbLcP7RdaC2FmROdK PAemuJIc65r9dTuUXEy8t0TGpTnHQJY3+uAKIcx/vx2u3hzSksmneouUQuKVFr3mC8ba Ws1AhxaCB3UrKc/QyUo1OMHdCxjfUx4eMTc9LURarTSF8ijkuvuAsYDVZx8azrPH2sEG GCLGZ9y1ZTfWV19u0ttLbJkNeQXwLdX2QAizdwrVlPCdIo2YBlDb+s4n0YXa8iry4qAO sOLIL3oVFCtzwuK4zdHBonhFrG57w82Hhzl2FX7z6J/cWpjDL4fZdWT9CZQSA5tHlL08 68XQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id:dkim-signature; bh=oqRc/wRB7ecoBdIY8/28uHNVcQaiZVdNQBHwCHQQlqg=; b=SmN5kMiM73vxfniFa490XlusneINjpHKu/tQV4jw4jAhUPpPXDNowsZWhi42AGmsMW jBPa2nj5HvOT9/hD/89oecbEwhISTQvPbyi1kelgJdQZVrnPs/yKmafzi4rV1XQEXd9r dpRBlX6uHOxhAZ965Wf53k922jInVyjE9eC0fDKB6L4AtiGROGckaybF+fvanKwhhjh8 WytITgSjzmVsPiPGhKLV+JHzXTYpFYzu72sS6WMXIkdKJSRPmwHnFt1CDjJCnJjDwztm 8QGmgOo4E7AgUBOOHCsWoxzx+nHv5JEXHZnc/jsyPkQgwdVpApmBpUODGR8Hx/taQKVL guxg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmx.net header.s=badeba3b8450 header.b=gogmm6l7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dc22si308605ejb.568.2020.12.02.18.23.24; Wed, 02 Dec 2020 18:23:47 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmx.net header.s=badeba3b8450 header.b=gogmm6l7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728016AbgLCCSc (ORCPT + 99 others); Wed, 2 Dec 2020 21:18:32 -0500 Received: from mout.gmx.net ([212.227.15.15]:57811 "EHLO mout.gmx.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725859AbgLCCSc (ORCPT ); Wed, 2 Dec 2020 21:18:32 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1606961774; bh=2skB8aCmzq6dDMF6IK8Lo93lDruUxEaD4jrfymHSWYI=; h=X-UI-Sender-Class:Subject:From:To:Cc:Date:In-Reply-To:References; b=gogmm6l7roQjuQsJuE25Siqr8BvuY0N9sf6/PsU57Ngn+BLIGkDKvsvibqBnGFeo3 q5cTmxXq9jKRB9achhymSg4Xos6GGzRnUXmnCouzszqEmhTjDgkHKfxj9/rRWaEgvi U83JiQEnl0giIJ7dzK4HO5UIR+gAl+udJxV2wQDI= X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c Received: from homer.fritz.box ([185.221.149.242]) by mail.gmx.com (mrgmx004 [212.227.17.190]) with ESMTPSA (Nemesis) id 1M9o1v-1kqRat3AX7-005mkW; Thu, 03 Dec 2020 03:16:14 +0100 Message-ID: Subject: Re: scheduling while atomic in z3fold From: Mike Galbraith To: Sebastian Andrzej Siewior , Vitaly Wool Cc: Oleksandr Natalenko , linux-kernel@vger.kernel.org, linux-mm@kvack.org, Andrew Morton , Steven Rostedt , Thomas Gleixner , linux-rt-users@vger.kernel.org Date: Thu, 03 Dec 2020 03:16:12 +0100 In-Reply-To: <20201202220826.5chy56mbgvrwmg3d@linutronix.de> References: <20201129112922.db53kmtpu76xxukj@spock.localdomain> <90c4857c53b657147bfb71a281ece9839b0373c2.camel@gmx.de> <20201130132014.mlvxeyiub3fpwyw7@linutronix.de> <856b5cc2a3d4eb673743b52956bf1e60dcdf87a1.camel@gmx.de> <20201130145229.mhbkrfuvyctniaxi@linutronix.de> <05121515e73891ceb9e5caf64b6111fc8ff43fab.camel@gmx.de> <20201130160327.ov32m4rapk4h432a@linutronix.de> <20201202220826.5chy56mbgvrwmg3d@linutronix.de> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.34.4 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-Provags-ID: V03:K1:gzCigAg9onWa1Aqoc6zWJqyVUF3JkkH2Dk64Hun2h5p+wwryf7t wfKUwmppG/nyTXUQ9c9caw6za5jHhQHxJMjYTgOygZPwalxvCFjKbGWQmb6SR1h+TqDdS5c 8YHWZivhGYS1k8/oXJowLGn7zoVN28DGoifNt8lmQek18IzkPQwdkMMIrkhXAn4vcWh7ocu iHtyysyQJw8SdbdEyEqQQ== X-Spam-Flag: NO X-UI-Out-Filterresults: notjunk:1;V03:K0:6jyZYoie4Uk=:HUuO4gxdrQ0umfsbQ/Y48v ei4+2554XhixiiE1bFZhzZnzLdniSDVa6pVX1cQOv3leG+qtAb5YNbeYS+/1Ex/OtShB47sBz bqg3W0m0Y4XhLSVisFL1LGFWti2eEPj6E4vBCscXtdHPowQqjyzl9YMbBfB+BGi+LOrUkq/so FsUSPeyGBKYKOGQrXYVMN41/w26Tytj7VrjJAlbFM2rMx+TVWXBBkrKsgK/mPXHfgm/CYvxZk /9SqSDqEp9f1dn/19azrFJ+YcxQSjCEZ/6bOjq/ldYBcjLiQezDFCU74r39S4LU+G+Xs8OkrA 9Oiz3gTWiYc4DzhHuzClcW4KSbQGaPxJnzbz9rfyWX0rE3TmV4x1Sk9ebtLEYJy+SG0eBdro6 FvSSn5CT6CYhQrr86UcC6d5PgY4o9ocK+ewOO4e4+rCjdouecua463LSEnmmdlGBeJrABHhPm IZhuCZFTGR6aBSePL5Qj3182ZAi53RcrxcsLjWFQSdaRdhttCp2LR6NTQ8fL+oJzKtPRbeUEM vCEl9CqIkzfN8G/jZj18L6TdRrc6hpF2+6A1nwwrhxB2+96Yiqg8VaHO5QOUrrAVHQpxhMWtW dk7XmKzGb29MtFJ68MH2eQ78e06QUc4wyf/j/jZPRC28IWc+ITMSs+7hjgFfAzEsZpE1xB4II XxnvJoN/hYhWZb2uKmQq9hARfqH2hj3hScYr0EbChCzDhY8dKlgv35J7JXLbmvD5jLxv4xpGe GXV0MnqdTtSwLki8dI/blDRurwTTguSDvfwU2rPzbPeq/RURFvFGryqenZGRFwhzYF787odRg xzWG53Pa+/CW4bnEDr9+sK5uUd8er1UgUe3zO+tj5BIMqxNSi6Ug40H463lUp3vXjvW+C17G3 NRm+lx3r9mHUDHdOxaqQ== Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2020-12-02 at 23:08 +0100, Sebastian Andrzej Siewior wrote: > On 2020-12-02 03:30:27 [+0100], Mike Galbraith wrote: > > > What I'm seeing is the below. rt_mutex_has_waiters() says yup we have > > a waiter, rt_mutex_top_waiter() emits the missing cached leftmost, and > > rt_mutex_dequeue_pi() chokes on it. Lock is buggered. > > correct. So this: > > diff --git a/mm/z3fold.c b/mm/z3fold.c > --- a/mm/z3fold.c > +++ b/mm/z3fold.c > @@ -342,7 +342,7 @@ static inline void free_handle(unsigned long handle) > > if (is_free) { > struct z3fold_pool *pool =3D slots_to_pool(slots); > - > + memset(slots, 0xee, sizeof(struct z3fold_buddy_slots)); > kmem_cache_free(pool->c_handle, slots); > } > } > @@ -548,8 +549,10 @@ static void __release_z3fold_page(struct z3fold_hea= der *zhdr, bool locked) > set_bit(HANDLES_ORPHANED, &zhdr->slots->pool); > read_unlock(&zhdr->slots->lock); > > - if (is_free) > + if (is_free) { > + memset(zhdr->slots, 0xdd, sizeof(struct z3fold_buddy_slots)); > kmem_cache_free(pool->c_handle, zhdr->slots); > + } > > if (locked) > z3fold_page_unlock(zhdr); > > resulted in: > > |[ 377.200696] Out of memory: Killed process 284358 (oom01) total-vm:15= 780488kB, anon-rss:150624kB, file-rss:0kB, shmem-rss:0kB, UID:0 pgtables:1= 6760kB oom_score_adj:0 > |[ 377.205438] ------------[ cut here ]------------ > |[ 377.205441] pvqspinlock: lock 0xffff8880105c6828 has corrupted value= 0xdddddddd! > |[ 377.205448] WARNING: CPU: 6 PID: 72 at kernel/locking/qspinlock_para= virt.h:498 __pv_queued_spin_unlock_slowpath+0xb3/0xc0 > |[ 377.205455] Modules linked in: > |[ 377.205456] CPU: 6 PID: 72 Comm: oom_reaper Not tainted 5.10.0-rc6-r= t13-rt+ #103 > |[ 377.205458] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS= 1.14.0-1 04/01/2014 > |[ 377.205460] RIP: 0010:__pv_queued_spin_unlock_slowpath+0xb3/0xc0 > =E2=80=A6 > |[ 377.205475] Call Trace: > |[ 377.205477] __raw_callee_save___pv_queued_spin_unlock_slowpath+0x11= /0x20 > |[ 377.205481] .slowpath+0x9/0xe > |[ 377.205483] _raw_spin_unlock_irqrestore+0x5/0x50 > |[ 377.205486] rt_mutex_futex_unlock+0x9e/0xb0 > |[ 377.205488] z3fold_free+0x2b0/0x470 > |[ 377.205491] zswap_free_entry+0x7d/0xc0 > |[ 377.205493] zswap_frontswap_invalidate_page+0x87/0x90 > |[ 377.205495] __frontswap_invalidate_page+0x58/0x90 > |[ 377.205496] swap_range_free.constprop.0+0x99/0xb0 > |[ 377.205499] swapcache_free_entries+0x131/0x390 > |[ 377.205501] free_swap_slot+0x99/0xc0 > |[ 377.205502] __swap_entry_free+0x8a/0xa0 > |[ 377.205504] free_swap_and_cache+0x36/0xd0 > |[ 377.205506] zap_pte_range+0x16a/0x940 > |[ 377.205509] unmap_page_range+0x1d8/0x310 > |[ 377.205514] __oom_reap_task_mm+0xe7/0x190 > |[ 377.205520] oom_reap_task_mm+0x5a/0x280 > |[ 377.205521] oom_reaper+0x98/0x1c0 > |[ 377.205525] kthread+0x18c/0x1b0 > |[ 377.205528] ret_from_fork+0x22/0x30 > |[ 377.205531] ---[ end trace 0000000000000002 ]--- > > Then I reverted commit > 4a3ac9311dac3 ("mm/z3fold.c: add inter-page compaction") > > and it seems to work now. Any suggestions? It looks like use-after-free. Looks like... d8f117abb380 z3fold: fix use-after-free when freeing handles ...wasn't completely effective. write_unlock() in handle_free() is where I see explosions. Only trouble is that 4a3ac9311dac3 arrived in 5.5, and my 5.[5-7]-rt refuse to reproduce (d8f117abb380 applied), whereas 5.9 and 5.10 do so quite reliably. There is a heisenbug aspect though, one trace_printk() in handle_free() made bug go hide, so the heisen-fairies in earlier trees were probably just messing with me.. because they can, being the twisted little freaks they are ;-) -Mike