Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753103AbZKCSwQ (ORCPT ); Tue, 3 Nov 2009 13:52:16 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752401AbZKCSwQ (ORCPT ); Tue, 3 Nov 2009 13:52:16 -0500 Received: from gw1.cosmosbay.com ([212.99.114.194]:57324 "EHLO gw1.cosmosbay.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751523AbZKCSwP (ORCPT ); Tue, 3 Nov 2009 13:52:15 -0500 Message-ID: <4AF07BB7.1020802@gmail.com> Date: Tue, 03 Nov 2009 19:51:35 +0100 From: Eric Dumazet User-Agent: Thunderbird 2.0.0.23 (Windows/20090812) MIME-Version: 1.0 To: Gregory Haskins CC: "Michael S. Tsirkin" , netdev@vger.kernel.org, virtualization@lists.linux-foundation.org, kvm@vger.kernel.org, linux-kernel@vger.kernel.org, mingo@elte.hu, linux-mm@kvack.org, akpm@linux-foundation.org, hpa@zytor.com, Rusty Russell , s.hetze@linux-ag.com, "Paul E. McKenney" Subject: Re: [PATCHv7 3/3] vhost_net: a kernel-level virtio server References: <20091103172422.GD5591@redhat.com> <4AF0708B.4020406@gmail.com> <4AF07199.2020601@gmail.com> <4AF072EE.9020202@gmail.com> In-Reply-To: <4AF072EE.9020202@gmail.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-1.6 (gw1.cosmosbay.com [0.0.0.0]); Tue, 03 Nov 2009 19:51:46 +0100 (CET) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2377 Lines: 76 Gregory Haskins a ?crit : > Gregory Haskins wrote: >> Eric Dumazet wrote: >>> Michael S. Tsirkin a ?crit : >>>> +static void handle_tx(struct vhost_net *net) >>>> +{ >>>> + struct vhost_virtqueue *vq = &net->dev.vqs[VHOST_NET_VQ_TX]; >>>> + unsigned head, out, in, s; >>>> + struct msghdr msg = { >>>> + .msg_name = NULL, >>>> + .msg_namelen = 0, >>>> + .msg_control = NULL, >>>> + .msg_controllen = 0, >>>> + .msg_iov = vq->iov, >>>> + .msg_flags = MSG_DONTWAIT, >>>> + }; >>>> + size_t len, total_len = 0; >>>> + int err, wmem; >>>> + size_t hdr_size; >>>> + struct socket *sock = rcu_dereference(vq->private_data); >>>> + if (!sock) >>>> + return; >>>> + >>>> + wmem = atomic_read(&sock->sk->sk_wmem_alloc); >>>> + if (wmem >= sock->sk->sk_sndbuf) >>>> + return; >>>> + >>>> + use_mm(net->dev.mm); >>>> + mutex_lock(&vq->mutex); >>>> + vhost_no_notify(vq); >>>> + >>> using rcu_dereference() and mutex_lock() at the same time seems wrong, I suspect >>> that your use of RCU is not correct. >>> >>> 1) rcu_dereference() should be done inside a read_rcu_lock() section, and >>> we are not allowed to sleep in such a section. >>> (Quoting Documentation/RCU/whatisRCU.txt : >>> It is illegal to block while in an RCU read-side critical section, ) >>> >>> 2) mutex_lock() can sleep (ie block) >>> >> >> Michael, >> I warned you that this needed better documentation ;) >> >> Eric, >> I think I flagged this once before, but Michael convinced me that it >> was indeed "ok", if but perhaps a bit unconventional. I will try to >> find the thread. >> >> Kind Regards, >> -Greg >> > > Here it is: > > http://lkml.org/lkml/2009/8/12/173 > Yes, this doesnt convince me at all, and could be a precedent for a wrong RCU use. People wanting to use RCU do a grep on kernel sources to find how to correctly use RCU. Michael, please use existing locking/barrier mechanisms, and not pretend to use RCU. Some automatic tools might barf later. For example, we could add a debugging facility to check that rcu_dereference() is used in an appropriate context, ie conflict with existing mutex_lock() debugging facility. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/