Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp99394rwn; Wed, 14 Sep 2022 16:48:06 -0700 (PDT) X-Google-Smtp-Source: AA6agR4ymmoW0J3nXhXSHjcz80UyySIqpMGXkCMNEHVW9ButUT8drGQI5MttYDYGhvxbGP7WqvhV X-Received: by 2002:a17:906:7094:b0:73d:c95a:e66e with SMTP id b20-20020a170906709400b0073dc95ae66emr26796475ejk.179.1663199286353; Wed, 14 Sep 2022 16:48:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1663199286; cv=none; d=google.com; s=arc-20160816; b=kl+NHfHCt4c5qlyl3smGcw4MCtI+gcKqOtb5tmpry3HvM9TYzdkxhVRzC9PrDlnTZY BKoChw+C/UvZJQ/3EJNe4qRidBqnU+P8upyMLDxm+awkAJVLP2styj4W4i796bX7yD+x 40c2P1fQfjVyCeyk+FfJQoJM9zIxM0hd4LdhQ/X4sCjhzlVVbZONV1o6f0dqt9sKaN/3 Dp27mCDeydCgrJFxB8QeH8B5aqUk5XSYKyDQhJGZCCAsB4FJLvYSDRIHq2fDoLdRLEix 9CMA9EN/DO31+4YofR5scL7zXdNXaj1li9WPYovGhgOFD5JHLUT7jSn/ifW+JwBOL8ZX +1fg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=3M6Rx47eVZFkw6BYdlIzCxzWQ8Movj35erGXuZD4gLE=; b=Q+xi4GpYirHIxok3Qk0iOkXqBSoQRk01PKa7lA/oBOUQ3N6HAEundn4TU1oANgF8/I WVNgqrizoLouSw9gzmGNyCu0fUNlb4RZSGFG/LybE6PBlFoYbEgreO+Z77TUR4gfVxD1 TqJkKDPu8Yh7eGjFisasDatq/rKRllvzRANintcKUqV1K6jBx73OYFWMr3wr4Y/eo9JE OaM4bJvaJUR7tSJO8ZZ89ROZU5fRRUBmLWaQ0OhvNsz7o9Zx4/blXU3MxLb6SXKsfaCv UINPd1dFcOWolAl92NwwF5jt9lT+2By+/Imzl/fBeaGT7s2UbKsiNLk2/G8PBApZvT2o wUJA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=korg header.b=BuIdCB2D; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q28-20020a50aa9c000000b0043ddc200046si12854723edc.454.2022.09.14.16.47.41; Wed, 14 Sep 2022 16:48:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linux-foundation.org header.s=korg header.b=BuIdCB2D; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229719AbiINWfr (ORCPT + 99 others); Wed, 14 Sep 2022 18:35:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44740 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229629AbiINWfq (ORCPT ); Wed, 14 Sep 2022 18:35:46 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id AFE137EFE6 for ; Wed, 14 Sep 2022 15:35:45 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 67B1BB81CE0 for ; Wed, 14 Sep 2022 22:35:44 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D4F73C433C1; Wed, 14 Sep 2022 22:35:42 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linux-foundation.org; s=korg; t=1663194943; bh=V3UnjH8r8/HZCf/bUXEv6LLNwHtF7DQP1lCGuLGnSdc=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=BuIdCB2Dng/VZh/rE6kuH/hBzh9AoNuhbKvfiKDY76Tsl76mj9UVm+7dE7qLi0fqT X8WEcEkV5ma5YeYJP+ZkT+5+gAg7jodnZakuRY66tb/J2kAdVOSnvZafZJ1TKgy1qU 0wU9+nOA78D4kzV6UZL6EgRZszStVxVumn2JCTnQ= Date: Wed, 14 Sep 2022 15:35:42 -0700 From: Andrew Morton To: Shuai Xue Cc: naoya.horiguchi@nec.com, linmiaohe@huawei.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org, cuibixuan@linux.alibaba.com, baolin.wang@linux.alibaba.com, zhuo.song@linux.alibaba.com, Huang Ying Subject: Re: [PATCH] mm,hwpoison: check mm when killing accessing process Message-Id: <20220914153542.285f870f728c6129a479a69d@linux-foundation.org> In-Reply-To: <20220914064935.7851-1-xueshuai@linux.alibaba.com> References: <20220914064935.7851-1-xueshuai@linux.alibaba.com> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-8.7 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 14 Sep 2022 14:49:35 +0800 Shuai Xue wrote: > The GHES code calls memory_failure_queue() from IRQ context to queue work > into workqueue and schedule it on the current CPU. Then the work is > processed in memory_failure_work_func() by kworker and calls > memory_failure(). > > When a page is already poisoned, commit a3f5d80ea401 ("mm,hwpoison: send > SIGBUS with error virutal address") make memory_failure() call > kill_accessing_process() that: > > - holds mmap locking of current->mm > - does pagetable walk to find the error virtual address > - and sends SIGBUS to the current process with error info. > > However, the mm of kworker is not valid. Therefore, check mm when killing > accessing process. Thanks. When fixing a bug, please always describe the user-visible effects of tha bug. I'm thinking "null pointer deref crashes the kernel". > Fixes: a3f5d80ea401 ("mm,hwpoison: send SIGBUS with error virutal address") > Signed-off-by: Shuai Xue I'll add cc:stable. > --- a/mm/memory-failure.c > +++ b/mm/memory-failure.c > @@ -743,6 +743,9 @@ static int kill_accessing_process(struct task_struct *p, unsigned long pfn, > }; > priv.tk.tsk = p; > > + if (!p->mm) > + return -EFAULT; > + > mmap_read_lock(p->mm); > ret = walk_page_range(p->mm, 0, TASK_SIZE, &hwp_walk_ops, > (void *)&priv); > @@ -751,6 +754,7 @@ static int kill_accessing_process(struct task_struct *p, unsigned long pfn, > else > ret = 0; > mmap_read_unlock(p->mm); > + > return ret > 0 ? -EHWPOISON : -EFAULT; > } This is an unrelated change which doesn't appear to match the style in memory-failure.c, so I'll drop this hunk.