Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752275AbaJKWZX (ORCPT ); Sat, 11 Oct 2014 18:25:23 -0400 Received: from casper.infradead.org ([85.118.1.10]:54229 "EHLO casper.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752154AbaJKWZS (ORCPT ); Sat, 11 Oct 2014 18:25:18 -0400 Date: Sat, 11 Oct 2014 23:25:14 +0100 From: Thomas Graf To: Eric Dumazet Cc: Heiko Carstens , Sasha Levin , paulmck@linux.vnet.ibm.com, Nikolay Aleksandrov , "David S. Miller" , netdev@vger.kernel.org, linux-kernel@vger.kernel.org, Ursula Braun Subject: Re: [bisected] e341694e3eb5 netlink_lookup() rcu conversion causes latencies Message-ID: <20141011222514.GA14186@casper.infradead.org> References: <20141011083627.GB5074@osiris> <1413055964.9362.50.camel@edumazet-glaptop2.roam.corp.google.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1413055964.9362.50.camel@edumazet-glaptop2.roam.corp.google.com> User-Agent: Mutt/1.5.23 (2014-03-12) X-SRS-Rewrite: SMTP reverse-path rewritten from by casper.infradead.org See http://www.infradead.org/rpr.html Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/11/14 at 12:32pm, Eric Dumazet wrote: > On Sat, 2014-10-11 at 10:36 +0200, Heiko Carstens wrote: > > Hi all, > > > > it just came to my attention that commit e341694e3eb5 > > "netlink: Convert netlink_lookup() to use RCU protected hash table" > > causes network latencies for me on s390. > > > > The testcase is quite simple and 100% reproducible on s390: > > > > Simply login via ssh to a remote system which has the above mentioned > > patch applied. Any action like pressing return now has significant > > latencies. Or in other words, working via such a connection becomes > > a pain ;) > > > > I haven't debugged it, however I assume the problem is that a) the > > commit introduces a synchronize_net() call und b) s390 kernels > > usually get compiled with CONFIG_HZ_100 while most other architectures > > use CONFIG_HZ_1000. > > If I change the kernel config to CONFIG_HZ_1000 the problem goes away, > > however I don't consider this a fix... > > > > Another reason why this hasn't been observed on x86 may or may not be > > that we haven't implemented CONFIG_HAVE_CONTEXT_TRACKING on s390 (yet). > > But that's just guessing... > > CC Paul and Sasha I think the issue here is obvious and a fix is on the way to move the insertion and removal to a worker to no longer require the synchronize_rcu(). What bothers me is that the synchronize_rcu() should only occur on expand/shrink and not for every table update. The default table size is 64. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/