Received: by 2002:a05:6a10:413:0:0:0:0 with SMTP id 19csp953666pxp; Wed, 16 Mar 2022 22:30:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwSa1tZ00sAZERWwSgbHGUnzruoXqDU82VOJWy6dgzfXnoA/O/lJ9jbw1KJjp2LAgAK3fJL X-Received: by 2002:a17:90a:4590:b0:1bc:4afa:1778 with SMTP id v16-20020a17090a459000b001bc4afa1778mr14019396pjg.14.1647495006422; Wed, 16 Mar 2022 22:30:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1647495006; cv=none; d=google.com; s=arc-20160816; b=lORr4q4sKtlscVttJ3jLGyhnchVHSFPLIFVYuyFpeZNe2Fpl4gHknBH/VCOE0vhzxu as3pZJPYH5XbzAdaTOwdU7A5K07pgVmT2A6HICx44v06uvJMmYZBdXzGLbcEFGibbiJe Kazjd2lyvmngiY9ncIT7zv1wlVtJ6+/X2c32hDKrTn1TuTSVJcbLlHJfzCa/neUoER83 WrEkrGAMDS1PKbJZlMfJtTE5uQ4jNNAiaMsjp5vPEUCFBsPw84LTDWJOdqmkYFS5CSN6 D8bIwtrKXUt3XiGLe0HUAYXkL4MDH22hAjhs3p1w3MHVXvY9OJ6yyMyzuto2KzvapZ9k 2ppg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=+VVGHpNw2aToH4nlv/qP4fsUY4xFNKKSta7CoxCOLqU=; b=1FtDfS0DdZPPFYEnZ33rFUXkodGHyknGnbwM7KKsYKXWW2hbyiLPNiFMXIPdTZOFRE 59DWXW1uMxisWB+SFGO4A/DYlq22OTonXTm541gmAuUVzy2dNUV1pdEhFxlz4BXeeYeo jGYBQjEdWj5XbrStIPREqX24WKBOrDcvNFfwjPKwlnZkZJ6fvOLbUI8FKLDpkTDEUNMO 6FlzwaF3oJyFzGA3/0z4oS4JFUbj3N7iV71FbOEpUHws0tEPVgG4o8zchgYKRpqrQjKW QT8Mz+0my7BwE5ogN05jMNN/JKOgJx2IAwWrhg3+yM2+/RXw3vOlGGh1d/2hL7VjZbtN Vgmg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@cloudflare.com header.s=google header.b=Im3UELGk; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=cloudflare.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id f16-20020a056a001ad000b004f7ba8b4428si3984564pfv.16.2022.03.16.22.30.06 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 16 Mar 2022 22:30:06 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@cloudflare.com header.s=google header.b=Im3UELGk; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=cloudflare.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 501FE23C0EC; Wed, 16 Mar 2022 21:31:29 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1352375AbiCPS1s (ORCPT + 99 others); Wed, 16 Mar 2022 14:27:48 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55660 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236257AbiCPS1r (ORCPT ); Wed, 16 Mar 2022 14:27:47 -0400 Received: from mail-yb1-xb29.google.com (mail-yb1-xb29.google.com [IPv6:2607:f8b0:4864:20::b29]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 40CA86D960 for ; Wed, 16 Mar 2022 11:26:32 -0700 (PDT) Received: by mail-yb1-xb29.google.com with SMTP id v130so5915381ybe.13 for ; Wed, 16 Mar 2022 11:26:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudflare.com; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=+VVGHpNw2aToH4nlv/qP4fsUY4xFNKKSta7CoxCOLqU=; b=Im3UELGk/iBvvXDdwloE1LayQw78/8xAF5xIkzHUcYNoTmJQWTbW5FnZyqyYEgkf/2 u6yjX7f4cTiiB6Xl/VU6E5cj6YVeOpmIafJOgwchCrZVrCmBGaGiUTzK8ypCnXwxX13n /DVouNRz5uqNvN42UYjat3gLTPyDsihe7EGdA= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=+VVGHpNw2aToH4nlv/qP4fsUY4xFNKKSta7CoxCOLqU=; b=OZ10SiopvRowMmCSfgr75t04L7E626rOda1ynwFkQO/3Iy7MnYxNCwx3ei3+ER6LQ9 83sTCoqOHuMQxJ+6ohThBi0uNTQjzulqzxhK0SWQdWAQPJ8+kl3ab8JBtstTvSEx5ofD xBeBsL5PSr3MoFBoLvfgyUJwDEpOCo4a3SY5tBo72Z1pwF0TJknGFQJH83uKGihBGjwt iB7mHkiaUOQvu70E71S7XO3+k71T2dTFtv7aUurFNjx7UucoDIi6Jfo3WMmpks0sFMRe peabummB2pYxrb1Bi2ADse8A0232o4+80nOsPBcPfBjnXajOYcWpL33gMTPk363Vulf/ MnKA== X-Gm-Message-State: AOAM53026gewOaFp/frHuFLW1FceI0oHfuOdripin7NUylq6ylTHGvLe Wm95N8UiElFj5Y1NbTodWY3cBtycIuYpvM2fsKEmKQ== X-Received: by 2002:a25:918f:0:b0:633:6f7d:6d78 with SMTP id w15-20020a25918f000000b006336f7d6d78mr1343447ybl.134.1647455191504; Wed, 16 Mar 2022 11:26:31 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Ivan Babrou Date: Wed, 16 Mar 2022 11:26:20 -0700 Message-ID: Subject: Re: zram corruption due to uninitialized do_swap_page fault To: Minchan Kim Cc: Linux MM , linux-kernel , Andrew Morton , Nitin Gupta , Sergey Senozhatsky , Jens Axboe , linux-block@vger.kernel.org, kernel-team Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 15, 2022 at 3:09 PM Minchan Kim wrote: > I think the problem with CLONE_VM is following race > > CPU A CPU B > > do_swap_page do_swap_page > SWP_SYNCHRONOUS_IO path SWP_SYNCHRONOUS_IO path > swap_readpage original data > swap_slot_free_notify > delete zram entry > swap_readpage zero data > pte_lock > map the *zero data* to userspace > pte_unlock > pte_lock > if (!pte_same) > goto out_nomap; > pte_unlock > return and next refault will > read zero data > > So, CPU A and B see zero data. With patchset below, it changes > > > CPU A CPU B > > do_swap_page do_swap_page > SWP_SYNCHRONOUS_IO path SWP_SYNCHRONOUS_IO path > swap_readpage original data > pte_lock > map the original data > swap_free > swap_range_free > bd_disk->fops->swap_slot_free_notify > swap_readpage read zero data > pte_unlock > pte_lock > if (!pte_same) > goto out_nomap; > pte_unlock > return and next refault will > read correct data again > > Here, CPU A could read zero data from zram but that's not a bug > (IOW, warning injected doesn't mean bug). > > The concern of the patch would increase memory size since it could > increase wasted memory with compressed form in zram and uncompressed > form in address space. However, most of cases of zram uses no > readahead and then, do_swap_page is followed by swap_free so it will > free the compressed from in zram quickly. > > Ivan, with this patch, you can see the warning you added in the zram > but it shouldn't trigger the userspace corruption as mentioned above > if I understand correctly. > > Could you test whether the patch prevent userspace broken? I'm making an internal build and will push it to some location to see how it behaves, but it might take a few days to get any sort of confidence in the results (unless it breaks immediately). I've also pushed my patch that disables SWP_SYNCHRONOUS_IO to a few locations yesterday to see how it fares.