Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp179379pxb; Wed, 18 Nov 2020 01:23:43 -0800 (PST) X-Google-Smtp-Source: ABdhPJx+FrR/fanDc1x8oudtvC390b5oIwqIgpkchH/9kF7Q5U8xQuMHaD3D7SngYEglmhLJdHvV X-Received: by 2002:a17:906:2818:: with SMTP id r24mr24138156ejc.100.1605691423268; Wed, 18 Nov 2020 01:23:43 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605691423; cv=none; d=google.com; s=arc-20160816; b=BSezlqqi3xfIHLiMXxivb40vnvGFsTZbUUL/8S2MWCOgNry1pGt7tBot09nLw4zplh bhGAZI5YJVCFzQvLXl7//qJ5Iq/GqdsbAX0A18erjoRxfmvLzM/1cCknmXIMBzoaW+s3 vFuGM/bztqPx92CNyORZZx4OWTk0OFDODEbqWZBxC4d+xFmkwf8+spdaEkY4ZzSslcGe m7e72nCxtVFQZioLx6e9yuAXw2BUFS9t0QPHS47fb+GDOSZtbFeKU/PpgBOYor9LprEr cSyBSMo7u+RmwFeG3cxmvieeBOi5YHtCV36DrcfFOxgIji8zUqBruByMx23Ltf126kUw 0WfQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:date:cc:to:subject:from:message-id; bh=WM6jws1kXasGz6r+96ZfYeIma36uEUdAl7QsLelhW64=; b=S9KCITmI/xdJF+rREUFkCpr6cWo2LMwdowjqdIdpqJSJl/SPVqD5E4BFHYLWz3xzU9 OFBscKGv1nkI03qM7HBmqEJAd8ihW4EN1F4WZWdw/6syGm7tMV1Tb39ZG81Ax7O99QNg hBhQNlLmmriv36r0cx5tM9uP0l21iMGuf5rI37aWq32q9pvxGF3YOCPMpF1cPg4mYeNJ AnZj7XNggpxvU7niACqBAPR+ohm0fbLF62avnC6QTi02iuX/Jl/EGr2QVo1C/pvX4qbz 96ynUwdarlgUZ8kr+WWSoJz03D+qzBr+b3FBSUVV+HYxOtTXhrMpJLHMXY/K/O8VG9gk G7mg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id f5si3466499edt.134.2020.11.18.01.23.20; Wed, 18 Nov 2020 01:23:43 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727271AbgKRJTU (ORCPT + 99 others); Wed, 18 Nov 2020 04:19:20 -0500 Received: from mx2.suse.de ([195.135.220.15]:52496 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726172AbgKRJTT (ORCPT ); Wed, 18 Nov 2020 04:19:19 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.221.27]) by mx2.suse.de (Postfix) with ESMTP id 2E198ABDE; Wed, 18 Nov 2020 09:19:18 +0000 (UTC) Received: by lion.mk-sys.cz (Postfix, from userid 1000) id E59BD603F9; Wed, 18 Nov 2020 10:19:17 +0100 (CET) Message-Id: <8a4f07e6ec47b681a32c6df5d463857e67bfc965.1605690824.git.mkubecek@suse.cz> From: Michal Kubecek Subject: [PATCH] eventfd: convert to ->write_iter() To: Alexander Viro Cc: linux-fsdevel@vger.kernel.org, Jens Axboe , linux-kernel@vger.kernel.org Date: Wed, 18 Nov 2020 10:19:17 +0100 (CET) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org While eventfd ->read() callback was replaced by ->read_iter() recently, it still provides ->write() for writes. Since commit 4d03e3cc5982 ("fs: don't allow kernel reads and writes without iter ops"), this prevents kernel_write() to be used for eventfd and with set_fs() removal, ->write() cannot be easily called directly with a kernel buffer. According to eventfd(2), eventfd descriptors are supposed to be (also) used by kernel to notify userspace applications of events which now requires ->write_iter() op to be available (and ->write() not to be). Therefore convert eventfd_write() to ->write_iter() semantics. This patch also cleans up the code in a similar way as commit 12aceb89b0bc ("eventfd: convert to f_op->read_iter()") did in read_iter(). Signed-off-by: Michal Kubecek --- fs/eventfd.c | 43 +++++++++++++++++++++---------------------- 1 file changed, 21 insertions(+), 22 deletions(-) diff --git a/fs/eventfd.c b/fs/eventfd.c index df466ef81ddd..35973d216847 100644 --- a/fs/eventfd.c +++ b/fs/eventfd.c @@ -261,35 +261,36 @@ static ssize_t eventfd_read(struct kiocb *iocb, struct iov_iter *to) return sizeof(ucnt); } -static ssize_t eventfd_write(struct file *file, const char __user *buf, size_t count, - loff_t *ppos) +static ssize_t eventfd_write(struct kiocb *iocb, struct iov_iter *from) { + struct file *file = iocb->ki_filp; struct eventfd_ctx *ctx = file->private_data; - ssize_t res; __u64 ucnt; DECLARE_WAITQUEUE(wait, current); - if (count < sizeof(ucnt)) + if (iov_iter_count(from) < sizeof(ucnt)) return -EINVAL; - if (copy_from_user(&ucnt, buf, sizeof(ucnt))) + if (unlikely(!copy_from_iter_full(&ucnt, sizeof(ucnt), from))) return -EFAULT; if (ucnt == ULLONG_MAX) return -EINVAL; spin_lock_irq(&ctx->wqh.lock); - res = -EAGAIN; - if (ULLONG_MAX - ctx->count > ucnt) - res = sizeof(ucnt); - else if (!(file->f_flags & O_NONBLOCK)) { + if (ULLONG_MAX - ctx->count <= ucnt) { + if ((file->f_flags & O_NONBLOCK) || + (iocb->ki_flags & IOCB_NOWAIT)) { + spin_unlock_irq(&ctx->wqh.lock); + return -EAGAIN; + } __add_wait_queue(&ctx->wqh, &wait); - for (res = 0;;) { + for (;;) { set_current_state(TASK_INTERRUPTIBLE); - if (ULLONG_MAX - ctx->count > ucnt) { - res = sizeof(ucnt); + if (ULLONG_MAX - ctx->count > ucnt) break; - } if (signal_pending(current)) { - res = -ERESTARTSYS; - break; + __remove_wait_queue(&ctx->wqh, &wait); + __set_current_state(TASK_RUNNING); + spin_unlock_irq(&ctx->wqh.lock); + return -ERESTARTSYS; } spin_unlock_irq(&ctx->wqh.lock); schedule(); @@ -298,14 +299,12 @@ static ssize_t eventfd_write(struct file *file, const char __user *buf, size_t c __remove_wait_queue(&ctx->wqh, &wait); __set_current_state(TASK_RUNNING); } - if (likely(res > 0)) { - ctx->count += ucnt; - if (waitqueue_active(&ctx->wqh)) - wake_up_locked_poll(&ctx->wqh, EPOLLIN); - } + ctx->count += ucnt; + if (waitqueue_active(&ctx->wqh)) + wake_up_locked_poll(&ctx->wqh, EPOLLIN); spin_unlock_irq(&ctx->wqh.lock); - return res; + return sizeof(ucnt); } #ifdef CONFIG_PROC_FS @@ -328,7 +327,7 @@ static const struct file_operations eventfd_fops = { .release = eventfd_release, .poll = eventfd_poll, .read_iter = eventfd_read, - .write = eventfd_write, + .write_iter = eventfd_write, .llseek = noop_llseek, }; -- 2.29.2