Received: by 2002:ab2:7903:0:b0:1fb:b500:807b with SMTP id a3csp1084441lqj; Mon, 3 Jun 2024 09:37:59 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCUPKrqyrEvNqEmgz6IdEsSe3J4p7Lcl+6GrXQNxOtZqdI9QSw7VY8kOeE4eU4BO6iIck8uCExVJr+PuUV75D37fAHyq/qimyVwngOQRcQ== X-Google-Smtp-Source: AGHT+IEguEV44/lGgUL++3mKutWVmX47gUuVpkOXQApUcARcgkq0yvIaSJWumVM/KKDYfGOpK+Uj X-Received: by 2002:a05:6870:9a11:b0:24d:e85b:c9fd with SMTP id 586e51a60fabf-2508b9a36d6mr10655398fac.1.1717432679288; Mon, 03 Jun 2024 09:37:59 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1717432679; cv=pass; d=google.com; s=arc-20160816; b=V32ukv77jBnfmTVVKSwXZBkhG5WpfU2TfR8RMg4dLDess/sIQ/6H+nMVytT49333lG ExNqKy/xBskPhSM2pmP/VgHM0ENN2tqhEdPkTIQUvb2jk+OJsBKTm4vcHG0MbflCic6T 4dc9THdYAtgygpTae6XFY2nKQjIzq69I6FHPCyh9juv1AIdJUDF9gEul//M80HMRn6g1 9bSSBRJpfgajuMV8vJtkDWOTEOfMB8zA8t+QYjAqTEFdbM8BuRA2NyEdF1KMP+EAGDcH 7klNbdKY0yUGvrz//GgzyFMNCM89SMu0iZVnEoprcAE0Df/zz7wBc6Tm61H7XOjie8mr XRpw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:autocrypt:content-language :from:references:cc:to:subject:user-agent:mime-version :list-unsubscribe:list-subscribe:list-id:precedence:date:message-id :dkim-signature; bh=8evk6BQflJEpzGYc9ly4En0otPd6lyV9fGYkC70B9Bo=; fh=fmDBhK1lTcM/pWrdYsZ1A9Tk7aq1No5kz5x3+3yzTAw=; b=uwAmjL13oL8D+fRrr2zOggsC89yKiJ1ZvtbB3+dfbQP+SJi6u+Yb3miTjOCrSeX6tk nLWIF4Wl9lQY+OyvThYbT8c0zUUrJO4KZLI0FPM2n+qjM52+zAokRjepOGoOn9TjOMXJ y3SYJFzseUJ7ip6kHysqqjD4nciD/fl4M7C2ySV+uOlvzsU0jpLlTS8Z+9yCtf6lISDW vJlWUWOl0RYNS3WJYHio5cGgL1Y1jL/sGyjgjuZfnFCC+n45eAwU0gbpDEl87WmhiGup uM3K78PapwVM28+nY1G1xg+n9QPHAqK9XP91+O+91Uq0qrFEnk23X8WZNnx1Vd+NJD9y PG0w==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=RFuhLKfu; arc=pass (i=1 spf=pass spfdomain=intel.com dkim=pass dkdomain=intel.com dmarc=pass fromdomain=intel.com); spf=pass (google.com: domain of linux-kernel+bounces-199453-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-199453-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [147.75.199.223]) by mx.google.com with ESMTPS id af79cd13be357-794f2f04f65si8590585a.43.2024.06.03.09.37.59 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 03 Jun 2024 09:37:59 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-199453-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) client-ip=147.75.199.223; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=RFuhLKfu; arc=pass (i=1 spf=pass spfdomain=intel.com dkim=pass dkdomain=intel.com dmarc=pass fromdomain=intel.com); spf=pass (google.com: domain of linux-kernel+bounces-199453-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-199453-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 099351C21A5D for ; Mon, 3 Jun 2024 16:37:57 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id A15F81369A0; Mon, 3 Jun 2024 16:37:51 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b="RFuhLKfu" Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.12]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id D9C8A13541B for ; Mon, 3 Jun 2024 16:37:48 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=198.175.65.12 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717432670; cv=none; b=oKgRX0xjIo9hMf6Ip6pLsWNbfDhvJpg16yr+i8qvZ/qtI1wJyGzZvDK/oZi9J9MEyjbPSF6EpQJWCoO9Kmf3nIBfla9Vba1cFFK03wvBKlAU4QdR/haGXcLgO8xbguP15ZW1ja/938yaGxR3FlRgPvHWssAL10DoSx7WXY8tAYY= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1717432670; c=relaxed/simple; bh=jfmJNoqS/aMtSuUlQYpT7Wgh9qVxuLiRembjxum2Y7c=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=lrjUghvz403lZSvGgzw8toVAZ5L1PrR+Z4gp/huvIWJKWopb4u/ixGB6PvNHpTiNVEMYcl8eWjpvHEHGrgKh2Fmn9iyy01XyGB9sbPQsEtbTJgufnZiHE9TjfetWcU/CFatoCEO782p6x5zdmlfTSjl2nHTkN4rlBM3qdcNxJEo= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com; spf=pass smtp.mailfrom=intel.com; dkim=pass (2048-bit key) header.d=intel.com header.i=@intel.com header.b=RFuhLKfu; arc=none smtp.client-ip=198.175.65.12 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=intel.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1717432669; x=1748968669; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=jfmJNoqS/aMtSuUlQYpT7Wgh9qVxuLiRembjxum2Y7c=; b=RFuhLKfuJtMfF60OBadCAEWaQfCiWPibqt3tEMgeZ/tDzWR6ap+Y2ZQz EoUHR970iSzAUPeKXYuGQWc8vCB7Wp5tSgs3lfDwx+VvhHHtv70XePQfn VHA0VGrgTzJvyrr9yVzsZnin5+TNL6ugZO/sm1scDcVIwegOBLcL8ngqT Vvg68JnfTZsSYyFU4zesY6oDxpGl0T+cfzNA7GpOoFBlnzvaO7tiJtvY1 RRdbYxMWJMUltGnRJwLGdmEINvbDBg7eNW6w3AotLAIKa1YLQnv3VtWJL La+NjU9v6gErz9EKM1GuEmF9QLPF/SybzR4uaa+kiIbUX68YZWRTSvGxr g==; X-CSE-ConnectionGUID: 4FkBsTHSQPKuMXAQwsBNxA== X-CSE-MsgGUID: 9LNe9g2lTay4WAhC520M7Q== X-IronPort-AV: E=McAfee;i="6600,9927,11092"; a="25335724" X-IronPort-AV: E=Sophos;i="6.08,212,1712646000"; d="scan'208";a="25335724" Received: from fmviesa007.fm.intel.com ([10.60.135.147]) by orvoesa104.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 03 Jun 2024 09:37:48 -0700 X-CSE-ConnectionGUID: 6a/WILzaTtS/A6w4ImaBaw== X-CSE-MsgGUID: lsSP7SbuRiak3ysU/Y908g== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,212,1712646000"; d="scan'208";a="36827281" Received: from ibganev-desk.amr.corp.intel.com (HELO [10.125.108.143]) ([10.125.108.143]) by fmviesa007-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 03 Jun 2024 09:37:47 -0700 Message-ID: <196481bb-b86d-4959-b69b-21fda4daae77@intel.com> Date: Mon, 3 Jun 2024 09:37:46 -0700 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v11 09/12] mm: implement LUF(Lazy Unmap Flush) defering tlb flush when folios get unmapped To: David Hildenbrand , Byungchul Park Cc: Byungchul Park , linux-kernel@vger.kernel.org, linux-mm@kvack.org, kernel_team@skhynix.com, akpm@linux-foundation.org, ying.huang@intel.com, vernhao@tencent.com, mgorman@techsingularity.net, hughd@google.com, willy@infradead.org, peterz@infradead.org, luto@kernel.org, tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, dave.hansen@linux.intel.com, rjgolo@gmail.com References: <20240531092001.30428-1-byungchul@sk.com> <20240531092001.30428-10-byungchul@sk.com> <26dc4594-430b-483c-a26c-7e68bade74b0@redhat.com> <20240603093505.GA12549@system.software.com> <35866f91-7d96-462a-aa0a-ac8a6b8cbcf8@redhat.com> From: Dave Hansen Content-Language: en-US Autocrypt: addr=dave.hansen@intel.com; keydata= xsFNBE6HMP0BEADIMA3XYkQfF3dwHlj58Yjsc4E5y5G67cfbt8dvaUq2fx1lR0K9h1bOI6fC oAiUXvGAOxPDsB/P6UEOISPpLl5IuYsSwAeZGkdQ5g6m1xq7AlDJQZddhr/1DC/nMVa/2BoY 2UnKuZuSBu7lgOE193+7Uks3416N2hTkyKUSNkduyoZ9F5twiBhxPJwPtn/wnch6n5RsoXsb ygOEDxLEsSk/7eyFycjE+btUtAWZtx+HseyaGfqkZK0Z9bT1lsaHecmB203xShwCPT49Blxz VOab8668QpaEOdLGhtvrVYVK7x4skyT3nGWcgDCl5/Vp3TWA4K+IofwvXzX2ON/Mj7aQwf5W iC+3nWC7q0uxKwwsddJ0Nu+dpA/UORQWa1NiAftEoSpk5+nUUi0WE+5DRm0H+TXKBWMGNCFn c6+EKg5zQaa8KqymHcOrSXNPmzJuXvDQ8uj2J8XuzCZfK4uy1+YdIr0yyEMI7mdh4KX50LO1 pmowEqDh7dLShTOif/7UtQYrzYq9cPnjU2ZW4qd5Qz2joSGTG9eCXLz5PRe5SqHxv6ljk8mb ApNuY7bOXO/A7T2j5RwXIlcmssqIjBcxsRRoIbpCwWWGjkYjzYCjgsNFL6rt4OL11OUF37wL QcTl7fbCGv53KfKPdYD5hcbguLKi/aCccJK18ZwNjFhqr4MliQARAQABzUVEYXZpZCBDaHJp c3RvcGhlciBIYW5zZW4gKEludGVsIFdvcmsgQWRkcmVzcykgPGRhdmUuaGFuc2VuQGludGVs LmNvbT7CwXgEEwECACIFAlQ+9J0CGwMGCwkIBwMCBhUIAgkKCwQWAgMBAh4BAheAAAoJEGg1 lTBwyZKwLZUP/0dnbhDc229u2u6WtK1s1cSd9WsflGXGagkR6liJ4um3XCfYWDHvIdkHYC1t MNcVHFBwmQkawxsYvgO8kXT3SaFZe4ISfB4K4CL2qp4JO+nJdlFUbZI7cz/Td9z8nHjMcWYF IQuTsWOLs/LBMTs+ANumibtw6UkiGVD3dfHJAOPNApjVr+M0P/lVmTeP8w0uVcd2syiaU5jB aht9CYATn+ytFGWZnBEEQFnqcibIaOrmoBLu2b3fKJEd8Jp7NHDSIdrvrMjYynmc6sZKUqH2 I1qOevaa8jUg7wlLJAWGfIqnu85kkqrVOkbNbk4TPub7VOqA6qG5GCNEIv6ZY7HLYd/vAkVY E8Plzq/NwLAuOWxvGrOl7OPuwVeR4hBDfcrNb990MFPpjGgACzAZyjdmYoMu8j3/MAEW4P0z F5+EYJAOZ+z212y1pchNNauehORXgjrNKsZwxwKpPY9qb84E3O9KYpwfATsqOoQ6tTgr+1BR CCwP712H+E9U5HJ0iibN/CDZFVPL1bRerHziuwuQuvE0qWg0+0SChFe9oq0KAwEkVs6ZDMB2 P16MieEEQ6StQRlvy2YBv80L1TMl3T90Bo1UUn6ARXEpcbFE0/aORH/jEXcRteb+vuik5UGY 5TsyLYdPur3TXm7XDBdmmyQVJjnJKYK9AQxj95KlXLVO38lczsFNBFRjzmoBEACyAxbvUEhd GDGNg0JhDdezyTdN8C9BFsdxyTLnSH31NRiyp1QtuxvcqGZjb2trDVuCbIzRrgMZLVgo3upr MIOx1CXEgmn23Zhh0EpdVHM8IKx9Z7V0r+rrpRWFE8/wQZngKYVi49PGoZj50ZEifEJ5qn/H Nsp2+Y+bTUjDdgWMATg9DiFMyv8fvoqgNsNyrrZTnSgoLzdxr89FGHZCoSoAK8gfgFHuO54B lI8QOfPDG9WDPJ66HCodjTlBEr/Cwq6GruxS5i2Y33YVqxvFvDa1tUtl+iJ2SWKS9kCai2DR 3BwVONJEYSDQaven/EHMlY1q8Vln3lGPsS11vSUK3QcNJjmrgYxH5KsVsf6PNRj9mp8Z1kIG qjRx08+nnyStWC0gZH6NrYyS9rpqH3j+hA2WcI7De51L4Rv9pFwzp161mvtc6eC/GxaiUGuH BNAVP0PY0fqvIC68p3rLIAW3f97uv4ce2RSQ7LbsPsimOeCo/5vgS6YQsj83E+AipPr09Caj 0hloj+hFoqiticNpmsxdWKoOsV0PftcQvBCCYuhKbZV9s5hjt9qn8CE86A5g5KqDf83Fxqm/ vXKgHNFHE5zgXGZnrmaf6resQzbvJHO0Fb0CcIohzrpPaL3YepcLDoCCgElGMGQjdCcSQ+Ci FCRl0Bvyj1YZUql+ZkptgGjikQARAQABwsFfBBgBAgAJBQJUY85qAhsMAAoJEGg1lTBwyZKw l4IQAIKHs/9po4spZDFyfDjunimEhVHqlUt7ggR1Hsl/tkvTSze8pI1P6dGp2XW6AnH1iayn yRcoyT0ZJ+Zmm4xAH1zqKjWplzqdb/dO28qk0bPso8+1oPO8oDhLm1+tY+cOvufXkBTm+whm +AyNTjaCRt6aSMnA/QHVGSJ8grrTJCoACVNhnXg/R0g90g8iV8Q+IBZyDkG0tBThaDdw1B2l asInUTeb9EiVfL/Zjdg5VWiF9LL7iS+9hTeVdR09vThQ/DhVbCNxVk+DtyBHsjOKifrVsYep WpRGBIAu3bK8eXtyvrw1igWTNs2wazJ71+0z2jMzbclKAyRHKU9JdN6Hkkgr2nPb561yjcB8 sIq1pFXKyO+nKy6SZYxOvHxCcjk2fkw6UmPU6/j/nQlj2lfOAgNVKuDLothIxzi8pndB8Jju KktE5HJqUUMXePkAYIxEQ0mMc8Po7tuXdejgPMwgP7x65xtfEqI0RuzbUioFltsp1jUaRwQZ MTsCeQDdjpgHsj+P2ZDeEKCbma4m6Ez/YWs4+zDm1X8uZDkZcfQlD9NldbKDJEXLIjYWo1PH hYepSffIWPyvBMBTW2W5FRjJ4vLRrJSUoEfJuPQ3vW9Y73foyo/qFoURHO48AinGPZ7PC7TF vUaNOTjKedrqHkaOcqB185ahG2had0xnFsDPlx5y In-Reply-To: <35866f91-7d96-462a-aa0a-ac8a6b8cbcf8@redhat.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit On 6/3/24 09:05, David Hildenbrand wrote: ... >>    2. A LUF'd page's position in the page cache can't be replaced until >>       after a luf_flush() > > That's the most tricky bit. I think these are the VFS concerns like > > 1) Page migration/reclaim ends up freeing the old page. TLB not flushed. > 2) write() to the new page / write from other process to the new page > 3) CPU reads stale content from old page > > PTE markers can't handle that. Yeah, we'd need some equivalent of a PTE marker, but for the page cache. Presumably some xa_value() that means a reader has to go do a luf_flush() before going any farther. That would actually have a chance at fixing two issues: One where a new page cache insertion is attempted. The other where someone goes to look in the page cache and takes some action _because_ it is empty (I think NFS is doing some of this for file locks). LUF is also pretty fundamentally built on the idea that files can't change without LUF being aware. That model seems to work decently for normal old filesystems on normal old local block devices. I'm worried about NFS, and I don't know how seriously folks take FUSE, but it obviously can't work well for FUSE.