Received: by 2002:a05:6a10:c604:0:0:0:0 with SMTP id y4csp1191065pxt; Sat, 7 Aug 2021 03:32:55 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwC0L67WV7nu0xzUKvnoaSLcjXwjWwsay3anJFgfRLY7jVg1o5AJX0NUDEvpGhdo/07XqMg X-Received: by 2002:aa7:ccc1:: with SMTP id y1mr17998455edt.321.1628332375605; Sat, 07 Aug 2021 03:32:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1628332375; cv=none; d=google.com; s=arc-20160816; b=Ng0070YvpBJ1DgE+pJLP1VWoIYEJOBl4NTbSya0A4q0VbaZ2r0fTIlMsiH0ptnYNYz s8ErzlltAhkRtYSKKHGF4PEJoWpSQ6t17iAfrueAiO4Q8+4l5wWRFBLc0JkGL5z3AkcV qzEP/JlEmLwWEccNApek3lutlHTGyOHA8mWjT6uwg2oAvZz8K63eLD0j9ke2gYF9JnTA 7XE1TeEgJPQKmRC5sixlBUUtnOKQr+bfUfMm2vcgxABDsHFZm4VY0gVrqC3lbIDby0ip wXlmes1AHrjc6zvw8IUd3LrC1Kwu92Z0P3zrNr2JxA/jrza2o6l/fqpmhRjqXDulMmOQ zLaA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:subject:from :references:cc:to:dkim-signature; bh=Xz2POERoj03wWLZEE/kDhO0wJ4yd+zaF3oDDbK7OnvA=; b=ufxw1jTRXX1aC6UHZnCCJcZcOlLWShnJ5RZKduP4VBolc6RHNmrKDAhxsD1wbIaScN M1Wzb/+wBxKMG2svMrBe9dQSxA4pHQISxPm9sX6NFy9JD6I4nkxoQv1ZHiHHEHVTIavc lTVrxJtwnERp8rZNlL+91ExNcmSHO/6cRTR8S/2UySBB5EtjMREcNoy9daccwWGIobDJ s9S5U1vVLlHHHltyPwRxtR/iqiqrtE71WbNgIph7jYoY6GALCG7zTPBwskK6l/T8cE7e EAD9TKCsLPywwMys4+U1AxtksyLnVah7t/SKCClom4FeZt/AmIMe43SW5cZIxvX0kTrk xAvQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=H2XHvma4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e9si10172991edz.204.2021.08.07.03.32.32; Sat, 07 Aug 2021 03:32:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=H2XHvma4; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231839AbhHGKbh (ORCPT + 99 others); Sat, 7 Aug 2021 06:31:37 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40992 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231687AbhHGKbg (ORCPT ); Sat, 7 Aug 2021 06:31:36 -0400 Received: from mail-wr1-x429.google.com (mail-wr1-x429.google.com [IPv6:2a00:1450:4864:20::429]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 41C9FC0613CF; Sat, 7 Aug 2021 03:31:18 -0700 (PDT) Received: by mail-wr1-x429.google.com with SMTP id n12so4366957wrr.2; Sat, 07 Aug 2021 03:31:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=to:cc:references:from:subject:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=Xz2POERoj03wWLZEE/kDhO0wJ4yd+zaF3oDDbK7OnvA=; b=H2XHvma4LuuSVfBN8ILh6Jp9Pdbm1fXiqibs3HZAbEsSl2t9PmxV++zAVQc3H20S1J o7IRX3aQFf4Hu51bxL2+rQoDOmUElyxC+/cv9kq6OOirBn306UgSQNktK80MTV/7ewUD flytjI+yKhudqqDhG2ZblB/OFBenWkx8apuISOGUkeLkgSL9Dk+ZBZ57FLYTNxlkdHQ/ wCoYiIZCpQj7SN/bW7GPLxUa5EaB5nnfsqBeo3DxPgM8nut6+YKOzpcH1Yy95NBXBwRa r9NXjx4m/j24gIPdtchs4FlDitXBdMrojfpcAUPa0E8SECt6Y5zx3h872RH2NCDAHhdk zDDw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:to:cc:references:from:subject:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=Xz2POERoj03wWLZEE/kDhO0wJ4yd+zaF3oDDbK7OnvA=; b=p5CE2Jzdhcw5ZaTEdsdxaVB4yeDgSj0+1D81NFNFe8cgvCbf48UIv0F+sVg+Mnuusi /BV5rVEh3ib5rlCMsj8iLum+ivOjuNQL7JxO2tAfmcqt7fbQQTjzBD2TsHItN1hW2oHd 2tSofvG9gfOi8rm/r7vNaQzd+JvQosw4b6pN1ThpgBLHOBvDfC2FzDEs+7DPrNpHW459 9muJue30eOwg4WalpCPAZnooTuON6GeQ9OGeE+/AhmNTcI2jBOB6Q7mBAF2g8s7tD3gQ nx9ZCAqKp93/D7VUMmtLwETkaUdc3wOloy1OhRYLuLZmTdrjFxxW4twrfsSAp+wMuFo+ L8Cg== X-Gm-Message-State: AOAM5322jGOM5+lU3T3aEE1LA2743uyUdudjTI+O0QDEE0RzBA16Egjv daB8ywfT40rcN9IPePCi/4kQzpVA45Y= X-Received: by 2002:adf:f383:: with SMTP id m3mr14747988wro.81.1628332276733; Sat, 07 Aug 2021 03:31:16 -0700 (PDT) Received: from [192.168.8.197] ([85.255.237.206]) by smtp.gmail.com with ESMTPSA id x15sm15023813wmc.13.2021.08.07.03.31.15 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sat, 07 Aug 2021 03:31:16 -0700 (PDT) To: Al Viro Cc: Andrew Morton , linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, Jens Axboe , linux-kernel@vger.kernel.org References: <07bd408d6cad95166b776911823b40044160b434.1628248975.git.asml.silence@gmail.com> From: Pavel Begunkov Subject: Re: [RFC] mm: optimise generic_file_read_iter Message-ID: Date: Sat, 7 Aug 2021 11:30:48 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.12.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 8/6/21 2:48 PM, Al Viro wrote: > On Fri, Aug 06, 2021 at 12:42:43PM +0100, Pavel Begunkov wrote: >> Unless direct I/O path of generic_file_read_iter() ended up with an >> error or a short read, it doesn't use inode. So, load inode and size >> later, only when they're needed. This cuts two memory reads and also >> imrpoves code generation, e.g. loads from stack. > > ... and the same question here. > >> NOTE: as a side effect, it reads inode->i_size after ->direct_IO(), and >> I'm not sure whether that's valid, so would be great to get feedback >> from someone who knows better. > > Ought to be safe, I think, but again, how much effect have you observed > from the patch? Answering for both patches -- I haven't benchmarked it and don't expect to find anything just from this one, considering variance between runs. I took a loot at the assembly (gcc 11.1), it removes 2 reads to get i_size, write+read that i_size from stack, because it stashed it on the stack. For example, we've squeezed several percents of throughput before on the io_uring side just by cutting sheer number of not too expensive individually instructions. IMHO, it's easier to do when you spotted something by the way, than rediscovering the same during a performance safari. -- Pavel Begunkov