Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754407AbaFLAQG (ORCPT ); Wed, 11 Jun 2014 20:16:06 -0400 Received: from out03.mta.xmission.com ([166.70.13.233]:49489 "EHLO out03.mta.xmission.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753042AbaFLAQE (ORCPT ); Wed, 11 Jun 2014 20:16:04 -0400 From: ebiederm@xmission.com (Eric W. Biederman) To: paulmck@linux.vnet.ibm.com Cc: chiluk@canonical.com, Rafael Tinoco , linux-kernel@vger.kernel.org, davem@davemloft.net, Christopher Arges , Jay Vosburgh References: <20140611133919.GZ4581@linux.vnet.ibm.com> <539879B8.4010204@canonical.com> <20140611161857.GC4581@linux.vnet.ibm.com> <53989F7B.6000004@canonical.com> <874mzr41kf.fsf@x220.int.ebiederm.org> <20140611225228.GO4581@linux.vnet.ibm.com> <87ioo7vy5s.fsf@x220.int.ebiederm.org> <20140611234902.GQ4581@linux.vnet.ibm.com> Date: Wed, 11 Jun 2014 17:14:55 -0700 In-Reply-To: <20140611234902.GQ4581@linux.vnet.ibm.com> (Paul E. McKenney's message of "Wed, 11 Jun 2014 16:49:02 -0700") Message-ID: <87bntzt24g.fsf@x220.int.ebiederm.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-XM-AID: U2FsdGVkX1+cllWzteljBBkZZOKS1cCE2I+1L46oCqI= X-SA-Exim-Connect-IP: 98.234.51.111 X-SA-Exim-Mail-From: ebiederm@xmission.com X-Spam-Report: * -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP * 0.7 XMSubLong Long Subject * 0.0 T_TM2_M_HEADER_IN_MSG BODY: T_TM2_M_HEADER_IN_MSG * 0.8 BAYES_50 BODY: Bayes spam probability is 40 to 60% * [score: 0.4736] * -0.0 DCC_CHECK_NEGATIVE Not listed in DCC * [sa07 1397; Body=1 Fuz1=1 Fuz2=1] * 1.0 T_XMDrugObfuBody_08 obfuscated drug references X-Spam-DCC: XMission; sa07 1397; Body=1 Fuz1=1 Fuz2=1 X-Spam-Combo: *;paulmck@linux.vnet.ibm.com X-Spam-Relay-Country: Subject: Re: Possible netns creation and execution performance/scalability regression since v3.8 due to rcu callbacks being offloaded to multiple cpus X-Spam-Flag: No X-SA-Exim-Version: 4.2.1 (built Wed, 14 Nov 2012 13:58:17 -0700) X-SA-Exim-Scanned: Yes (on in02.mta.xmission.com) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org "Paul E. McKenney" writes: > On Wed, Jun 11, 2014 at 04:12:15PM -0700, Eric W. Biederman wrote: >> "Paul E. McKenney" writes: >> >> > On Wed, Jun 11, 2014 at 01:46:08PM -0700, Eric W. Biederman wrote: >> >> On the chance it is dropping the old nsproxy which calls syncrhonize_rcu >> >> in switch_task_namespaces that is causing you problems I have attached >> >> a patch that changes from rcu_read_lock to task_lock for code that >> >> calls task_nsproxy from a different task. The code should be safe >> >> and it should be an unquestions performance improvement but I have only >> >> compile tested it. >> >> >> >> If you can try the patch it will tell is if the problem is the rcu >> >> access in switch_task_namespaces (the only one I am aware of network >> >> namespace creation) or if the problem rcu case is somewhere else. >> >> >> >> If nothing else knowing which rcu accesses are causing the slow down >> >> seem important at the end of the day. >> >> >> >> Eric >> >> >> > >> > If this is the culprit, another approach would be to use workqueues from >> > RCU callbacks. The following (untested, probably does not even build) >> > patch illustrates one such approach. >> >> For reference the only reason we are using rcu_lock today for nsproxy is >> an old lock ordering problem that does not exist anymore. >> >> I can say that in some workloads setns is a bit heavy today because of >> the synchronize_rcu and setns is more important that I had previously >> thought because pthreads break the classic unix ability to do things in >> your process after fork() (sigh). >> >> Today daemonize is gone, and notify the parent process with a signal >> relies on task_active_pid_ns which does not use nsproxy. So the old >> lock ordering problem/race is gone. >> >> The description of what was happening when the code switched from >> task_lock to rcu_read_lock to protect nsproxy. > > OK, never mind, then! ;-) I appreciate you posting your approach. I just figured I should do my homework, and verify my fuzzy memory. Who knows there might be different performance problems with my approach. But I am hoping this is one of those happy instances where we can just make everything simpler. Eric -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/