Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id B1537C636CC for ; Sat, 11 Feb 2023 15:06:41 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229581AbjBKPGk (ORCPT ); Sat, 11 Feb 2023 10:06:40 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54782 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229460AbjBKPGj (ORCPT ); Sat, 11 Feb 2023 10:06:39 -0500 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 7EC7926848 for ; Sat, 11 Feb 2023 07:05:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1676127952; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=RAA6MLVbCYRbaNJcYcJSydJa+JkBz3cL6cg9ack6zDU=; b=WTS2YZECw0bYCs0fanCIVI8doOyJwUKvRXpdLmZYBc5f1NJbe7jM6aUusvi/9tv0ugiQau nqoCjo7xaEqz0NsxUayq9774O3g2Ncx2PNJT8p3KJBm/cEKker3bJYEDiC/wEgOSmAQHSx bbr9vbdqniuNje+YqTtY/0f/v4ZQ8ds= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-497-Du0Nyxm1PCWBRLvFA9Z8fg-1; Sat, 11 Feb 2023 10:05:49 -0500 X-MC-Unique: Du0Nyxm1PCWBRLvFA9Z8fg-1 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.rdu2.redhat.com [10.11.54.8]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id D6791101A521; Sat, 11 Feb 2023 15:05:48 +0000 (UTC) Received: from T590 (ovpn-8-18.pek2.redhat.com [10.72.8.18]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 94B83C16022; Sat, 11 Feb 2023 15:05:40 +0000 (UTC) Date: Sat, 11 Feb 2023 23:05:35 +0800 From: Ming Lei To: Jens Axboe Cc: Linus Torvalds , Andy Lutomirski , Dave Chinner , Matthew Wilcox , Stefan Metzmacher , linux-fsdevel , Linux API Mailing List , io-uring , "linux-kernel@vger.kernel.org" , Al Viro , Samba Technical , ming.lei@redhat.com Subject: Re: copy on write for splice() from file to pipe? Message-ID: References: <7a2e5b7f-c213-09ff-ef35-d6c2967b31a7@kernel.dk> <2bb12591-9d24-6b26-178f-05e939bf3251@kernel.dk> <55eaac9e-0d77-1fa2-df27-4d64e123177e@kernel.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <55eaac9e-0d77-1fa2-df27-4d64e123177e@kernel.dk> X-Scanned-By: MIMEDefang 3.1 on 10.11.54.8 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Feb 11, 2023 at 07:13:44AM -0700, Jens Axboe wrote: > On 2/10/23 8:18?PM, Ming Lei wrote: > > On Fri, Feb 10, 2023 at 02:08:35PM -0800, Linus Torvalds wrote: > >> On Fri, Feb 10, 2023 at 1:51 PM Jens Axboe wrote: > >>> > >>> Speaking of splice/io_uring, Ming posted this today: > >>> > >>> https://lore.kernel.org/io-uring/20230210153212.733006-1-ming.lei@redhat.com/ > >> > >> Ugh. Some of that is really ugly. Both 'ignore_sig' and > >> 'ack_page_consuming' just look wrong. Pure random special cases. > >> > >> And that 'ignore_sig' is particularly ugly, since the only thing that > >> sets it also sets SPLICE_F_NONBLOCK. > >> > >> And the *only* thing that actually then checks that field is > >> 'splice_from_pipe_next()', where there are exactly two > >> signal_pending() checks that it adds to, and > >> > >> (a) the first one is to protect from endless loops > >> > >> (b) the second one is irrelevant when SPLICE_F_NONBLOCK is set > >> > >> So honestly, just NAK on that series. > >> > >> I think that instead of 'ignore_sig' (which shouldn't exist), that > >> first 'signal_pending()' check in splice_from_pipe_next() should just > >> be changed into a 'fatal_signal_pending()'. > > > > Good point, here the signal is often from task_work_add() called by > > io_uring. > > Usually you'd use task_sigpending() to distinguis the two, but > fatal_signal_pending() as Linus suggests would also work. The only > concern here is that since you'll be potentially blocking on waiting for > the pipe to be readable - if task does indeed have task_work pending and > that very task_work is the one that will ensure that the pipe is now > readable, then you're waiting condition will never be satisfied. The 2nd signal_pending() will break the loop to get task_work handled, so it is safe to only change the 1st one to fatal_signal_pending(). Thanks, Ming