Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp1224150ybl; Thu, 23 Jan 2020 16:18:04 -0800 (PST) X-Google-Smtp-Source: APXvYqyuS9xQmUByyf+A09Br8Er679/oEN412lFRiJL4ICGWKBnBg3BMbZcsfF8lfWfDopMCys/K X-Received: by 2002:a05:6808:2d2:: with SMTP id a18mr382176oid.120.1579825084544; Thu, 23 Jan 2020 16:18:04 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579825084; cv=none; d=google.com; s=arc-20160816; b=SwaaGBSMlhhTgGHzFyyWCjdzXMN2rt6GVbydqtJZ7IWVrVp891Zj/AQjO2TQrdDCpW 4qjBWfvWpKa82VeDS0LowR6UcnNps2tBSwWWrtdSs7SDWIAyMf+LJDeniaXNa+I/cmg9 8UhRl31YlnEgd7hIg0BNf1WZv9N0GVTmiEnq+aCprn35KSb6dYBmFrzJeFddS+KgkoDk 9a0y3lqe7tYQ+mC/L6/tanzdqwAUxLe87wktlEQGednYgODvuxl+mRkYOnlDIuB9fOo9 Pfbm70Z6czaBUhUhaYcTP9uS9P3cBDdSa3arTSLTw5OU5ConKeoDyuLvAJCWOf2PVQtS PKzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:content-disposition :mime-version:message-id:subject:cc:to:from:date; bh=401Hna9pHRRX5s9/LLbTSCK3zuQ8GAZvoPVV9k5jdxg=; b=wWOs3xLyej7ViHR+Fsb0EkXcfR/RN0Tp5tbmI1GVunXFR3A76tcLi3eYdH4w3qbDsT UYERW7dH1H3BO06ulKYn0O9RAndRU7XXH/o0nLCG/4E+jaDHwLLNLarWucD6XxCkeMpp pjb9MGcUyWZ5z7CXW/TzpJrXkSjJNXzAPwdLImaBNRtZl/8xVfqhzBHW3wcjZPwCM85V jnHVlgwLNBHf14OyhITcf+NK+Xn/DnXqif4V1HxuutHgE0PqpvoOVJYfg54o5gcQOLOz 18x33oi957nnrA2ptCHwMj6YO/rF1ReemezrP77Svr+rKyXKKDp0Vw10X3qt35Z6kaju UZ+w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=libc.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c2si1540984oig.255.2020.01.23.16.17.51; Thu, 23 Jan 2020 16:18:04 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=libc.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729999AbgAXACp (ORCPT + 99 others); Thu, 23 Jan 2020 19:02:45 -0500 Received: from 216-12-86-13.cv.mvl.ntelos.net ([216.12.86.13]:52738 "EHLO brightrain.aerifal.cx" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729149AbgAXACp (ORCPT ); Thu, 23 Jan 2020 19:02:45 -0500 Received: from dalias by brightrain.aerifal.cx with local (Exim 3.15 #2) id 1iumQt-00039n-00; Fri, 24 Jan 2020 00:02:43 +0000 Date: Thu, 23 Jan 2020 19:02:43 -0500 From: Rich Felker To: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, linux-api@vger.kernel.org Cc: Alexander Viro Subject: Proposal to fix pwrite with O_APPEND via pwritev2 flag Message-ID: <20200124000243.GA12112@brightrain.aerifal.cx> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org There's a longstanding unfixable (due to API stability) bug in the pwrite syscall: http://man7.org/linux/man-pages/man2/pwrite.2.html#BUGS whereby it wrongly honors O_APPEND if set, ignoring the caller-passed offset. Now that there's a pwritev2 syscall that takes a flags argument, it's possible to fix this without breaking stability by adding a new RWF_NOAPPEND flag, which callers that want the fixed behavior can then pass. I have a completely untested patch to add such a flag, but would like to get a feel for whether the concept is acceptable before putting time into testing it. If so, I'll submit this as a proper patch with detailed commit message etc. Draft is below. Rich diff --git a/include/linux/fs.h b/include/linux/fs.h index e0d909d35763..3a769a972f79 100644 --- a/include/linux/fs.h +++ b/include/linux/fs.h @@ -3397,6 +3397,8 @@ static inline int kiocb_set_rw_flags(struct kiocb *ki, rwf_t flags) { if (unlikely(flags & ~RWF_SUPPORTED)) return -EOPNOTSUPP; + if (unlikely((flags & RWF_APPEND) && (flags & RWF_NOAPPEND))) + return -EINVAL; if (flags & RWF_NOWAIT) { if (!(ki->ki_filp->f_mode & FMODE_NOWAIT)) @@ -3411,6 +3413,8 @@ static inline int kiocb_set_rw_flags(struct kiocb *ki, rwf_t flags) ki->ki_flags |= (IOCB_DSYNC | IOCB_SYNC); if (flags & RWF_APPEND) ki->ki_flags |= IOCB_APPEND; + if (flags & RWF_NOAPPEND) + ki->ki_flags &= ~IOCB_APPEND; return 0; } diff --git a/include/uapi/linux/fs.h b/include/uapi/linux/fs.h index 379a612f8f1d..591357d9b3c9 100644 --- a/include/uapi/linux/fs.h +++ b/include/uapi/linux/fs.h @@ -299,8 +299,11 @@ typedef int __bitwise __kernel_rwf_t; /* per-IO O_APPEND */ #define RWF_APPEND ((__force __kernel_rwf_t)0x00000010) +/* per-IO negation of O_APPEND */ +#define RWF_NOAPPEND ((__force __kernel_rwf_t)0x00000020) + /* mask of flags supported by the kernel */ #define RWF_SUPPORTED (RWF_HIPRI | RWF_DSYNC | RWF_SYNC | RWF_NOWAIT |\ - RWF_APPEND) + RWF_APPEND | RWF_NOAPPEND) #endif /* _UAPI_LINUX_FS_H */