Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751362Ab1DFEqT (ORCPT ); Wed, 6 Apr 2011 00:46:19 -0400 Received: from e35.co.us.ibm.com ([32.97.110.153]:37035 "EHLO e35.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750738Ab1DFEqR convert rfc822-to-8bit (ORCPT ); Wed, 6 Apr 2011 00:46:17 -0400 From: Jay Vosburgh To: Weiping Pan cc: "Andy Gospodarek (supporter:BONDING DRIVER)" , netdev@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] bonding-tlb: better balance when choosing slaves In-reply-to: <4D9BD1A9.1040402@gmail.com> References: <1301753395-1205-1-git-send-email-panweiping3@gmail.com> <20427.1301768703@death> <4D9BD1A9.1040402@gmail.com> Comments: In-reply-to Weiping Pan message dated "Wed, 06 Apr 2011 10:36:25 +0800." X-Mailer: MH-E 8.2; nmh 1.3; GNU Emacs 23.2.1 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT Date: Tue, 05 Apr 2011 21:46:12 -0700 Message-ID: <3084.1302065172@death> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3892 Lines: 109 Weiping Pan wrote: >On 04/03/2011 02:25 AM, Jay Vosburgh wrote: >>> tlb_get_least_loaded_slave() always chooses slave from >>> bonding->first_slave, that gives the beginnig slaves more chances to be used. >>> >>> Let tlb_get_least_loaded_slave() chooses slave from a random positon in the >>> slave list, make all slaves transmit packets more balanced. >> If outgoing traffic is not being starved (i.e., connections are >> being balanced such that they are stacking up on one slave but >> under-utilizing another), then I don't understand what benefit this has. >> >> There is already some degree of randomness, as peers will be >> assigned in the order that packets are transmitted to them after each >> rebalance. The busiest peers will tend to be on the earlier slaves, and >> vice versa, but I'm not sure this is a bad thing. >> >> Does this have any real gain other than making the rx/tx >> statistics for the slaves more equal over time? >> >> I haven't measured it, but I would expect that for small numbers >> of peers, having them tend to stay on the same slaves over time is >> probably a good thing. >modprobe bonding mode=balance-tlb miimon=100 >ifconfig bond0 192.168.1.2 netmask 255.255.255.0 up >ifenslave bond0 eth0 >ifenslave bond0 eth1 >ifenslave bond0 eth2 >ping 192.168.1.100 -A -s 10240 > >I find that bonding will always use eth0 and eth1, it never uses eth2, >because tlb_get_least_loaded_slave() always chooses slave from >bonding->first_slave, that gives the beginnig slaves more chances to be >used. > >Do you think this is a problem ? Not for this test case, no. On the other hand, if you run three pings concurrently to three different destinations and it still never uses eth2, then that might be something to look into. >Does it has conflicts with the meaning of balance and reblance? Not really; with only one active flow, there isn't really any advantage to moving it around. The balance and rebalance activity becomes more interesting when the traffic volume and number of destinations is larger. -J >>> Signed-off-by: Weiping Pan(潘卫平) >>> --- >>> drivers/net/bonding/bond_alb.c | 17 +++++++++++++++-- >>> 1 files changed, 15 insertions(+), 2 deletions(-) >>> >>> diff --git a/drivers/net/bonding/bond_alb.c b/drivers/net/bonding/bond_alb.c >>> index 9bc5de3..9fa64b0 100644 >>> --- a/drivers/net/bonding/bond_alb.c >>> +++ b/drivers/net/bonding/bond_alb.c >>> @@ -36,6 +36,7 @@ >>> #include >>> #include >>> #include >>> +#include >>> #include >>> #include >>> #include >>> @@ -206,15 +207,27 @@ static long long compute_gap(struct slave *slave) >>> /* Caller must hold bond lock for read */ >>> static struct slave *tlb_get_least_loaded_slave(struct bonding *bond) >>> { >>> - struct slave *slave, *least_loaded; >>> + struct slave *slave, *least_loaded, *start_slave; >>> long long max_gap; >>> int i; >>> + u8 n; >>> >>> least_loaded = NULL; >>> + start_slave = bond->first_slave; >>> max_gap = LLONG_MIN; >>> + >>> + get_random_bytes(&n, 1); >>> + >>> + if (bond->slave_cnt == 0) >>> + return NULL; >>> + n = n % bond->slave_cnt; >>> + >>> + for (i=0; i>> + start_slave = start_slave->next; >>> + } >>> >>> /* Find the slave with the largest gap */ >>> - bond_for_each_slave(bond, slave, i) { >>> + bond_for_each_slave_from(bond, slave, i, start_slave) { >>> if (SLAVE_IS_OK(slave)) { >>> long long gap = compute_gap(slave); >>> >>> -- >>> 1.7.4 --- -Jay Vosburgh, IBM Linux Technology Center, fubar@us.ibm.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/