Received: by 10.223.176.5 with SMTP id f5csp178446wra; Thu, 8 Feb 2018 19:12:09 -0800 (PST) X-Google-Smtp-Source: AH8x224kUMaSJiRZX0bn1zX3AgQntteXgEiE5y4E1RjEPkq4xb6EHfGGBTD44xvlgjwwNeAdw2rd X-Received: by 10.99.102.1 with SMTP id a1mr1121133pgc.452.1518145929065; Thu, 08 Feb 2018 19:12:09 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518145929; cv=none; d=google.com; s=arc-20160816; b=Q4r6dvILWIxGttZfz7Jm2IpuKdbcbPVjVNOZcUVKv4brv1KAgRB9Z/0Iiosh682f7E Du9Lsqj1SQP69Fh8113E9XG1TP0O2LpztFRdvp5de1DnVHTO5tOjy790RMBSndRYHGR8 njLeHujHeeF+Fc08y5eQxf/egFoC5mTMs0b7xwlugRwS/Dxj1RfiR8srDn5LNLbo6XuG 381ZsUtk9fk+Z3LncX+ZIOIukBnYFOH4U5E87PGanYvAaAsiv1W4M9TMou4NaR3acpk1 pK0ov0q/bKbY4QtYl4L6AHAfOwsTyl45HdmC0rTDzEHNib4QUbF55lQ6wX7RJH0K6J+e JQlA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:references :in-reply-to:subject:cc:date:to:from:arc-authentication-results; bh=UKihrqMG4/HfYNe3Om2z0f4HEcWxWFJ4NofdJ+5GWJY=; b=ey1B/0u6EoqA9LLRb8yAGSGYmNR9L/urDDrfFtzHWlxMz7UX4ykZQl3h0tVNRV8hnM vS7AJMwi/ecBcVMEnKubqj86NjPoYi1mJNXzlTO+j3cxIu9ve92XopICO48Z3/T8/w6s RiwhgZrhXIRxlFfuU6RDF+uwSB9eC27pfDXGyH1nx50Z/ZA4inWFfO+R0v2HaPC/VMju A7LI6LW76JTDEV9Fwu3rKGFUPhIg0agDJDMX2TjP+yPoD11H8iRzl0QawSdlXRO1GuKw Lx657btKmbTcg3cQWGH3Sqd1jz1Kgbqr1Dsky/SseMkVkyVuH7EviPEziPlgCzej79Ux ADjg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c11-v6si959146plm.556.2018.02.08.19.11.55; Thu, 08 Feb 2018 19:12:09 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752464AbeBIDK5 (ORCPT + 99 others); Thu, 8 Feb 2018 22:10:57 -0500 Received: from mx2.suse.de ([195.135.220.15]:48935 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752246AbeBIDK4 (ORCPT ); Thu, 8 Feb 2018 22:10:56 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay1.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 29237AD4B; Fri, 9 Feb 2018 03:10:55 +0000 (UTC) From: NeilBrown To: Oleg Drokin Date: Fri, 09 Feb 2018 14:10:44 +1100 Cc: James Simmons , Greg Kroah-Hartman , devel@driverdev.osuosl.org, Andreas Dilger , Linux Kernel Mailing List , Lustre Development List , wang di Subject: Re: [PATCH 41/80] staging: lustre: lmv: separate master object with master stripe In-Reply-To: <09CE6CEC-52BC-4B29-B609-40DE68A64A33@intel.com> References: <1471378773-24590-1-git-send-email-jsimmons@infradead.org> <1471378773-24590-42-git-send-email-jsimmons@infradead.org> <87r2pvkkl5.fsf@notabene.neil.brown.name> <09CE6CEC-52BC-4B29-B609-40DE68A64A33@intel.com> Message-ID: <87o9kylux7.fsf@notabene.neil.brown.name> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha256; protocol="application/pgp-signature" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable On Thu, Feb 08 2018, Oleg Drokin wrote: >> On Feb 8, 2018, at 8:39 PM, NeilBrown wrote: >>=20 >> On Tue, Aug 16 2016, James Simmons wrote: > > my that=E2=80=99s an old patch > >>=20 ... >>=20 >> Whoever converted it to "!strcmp()" inverted the condition. This is a >> perfect example of why I absolutely *loathe* the "!strcmp()" construct!! >>=20 >> This causes many tests in the 'sanity' test suite to return >> -ENOMEM (that had me puzzled for a while!!). > > huh? I am not seeing anything of the sort and I was running sanity > all the time until a recent pause (but going to resume). That does surprised me - I reproduce it every time. I have two VMs running a SLE12-SP2 kernel with patches from lustre-release applied. These are servers. They have 2 3G virtual disks each. I have two over VMs running current mainline. These are clients. I guess your 'recent pause' included between v4.15-rc1 (8e55b6fd0660) and v4.15-rc6 (a93639090a27) - a full month when lustre wouldn't work at all :-( > >> This seems to suggest that no-one has been testing the mainline linux >> lustre. >> It also seems to suggest that there is a good chance that there >> are other bugs that have crept in while no-one has really been caring. >> Given that the sanity test suite doesn't complete for me, but just >> hangs (in test_27z I think), that seems particularly likely. > > Works for me, here=E2=80=99s a run from earlier today on 4.15.0: Well that's encouraging .. I haven't looked into this one yet - I'm not even sure where to start. > >> So my real question - to anyone interested in lustre for mainline linux >> - is: can we actually trust this code at all? > > Absolutely. Seems that you just stumbled upon a corner case that was not > being hit by people that do the testing, so you have something unique abo= ut > your setup, I guess. > >> I'm seriously tempted to suggest that we just >> rm -r drivers/staging/lustre >>=20 >> drivers/staging is great for letting the community work on code that has >> been "thrown over the wall" and is not openly developed elsewhere, but >> that is not the case for lustre. lustre has (or seems to have) an open >> development process. Having on-going development happen both there and >> in drivers/staging seems a waste of resources. > > It is a bit of a waste of resources, but there are some other things here. > E.g. we cannot have any APIs with no users in the kernel. > Also some people like to have in-kernel modules coming with their distros > (there were some users that used staging client on ubuntu as their > setup). > > Instead the plan was to clean up the staging client into acceptable state, > move it out of staging, bring in all the missing features and then > drop the client (more or less) from the lustre-release. That sounds like a great plan. Any idea why it didn't happen? It seems there is a lot of upstream work mixed in with the clean up, and I don't think that really helps anyone. Is it at all realistic that the client might be removed from lustre-release? That might be a good goal to work towards. > >> Might it make sense to instead start cleaning up the code in >> lustre-release so as to make it meet the upstream kernel standards. >> Then when the time is right, the kernel code can be moved *out* of >> lustre-release and *in* to linux. Then development can continue in >> Linux (just like it does with other Linux filesystems). > > While we can be cleaning lustre in lustre-release, there are some things > we cannot do as easily, e.g. decoupling Lustre client from the server. > Also it would not attract any reviews from all the janitor or > (more importantly) Al Viro and other people with a sharp eyes. > >> An added bonus of this is that there is an obvious path to getting >> server support in mainline Linux. The current situation of client-only >> support seems weird given how interdependent the two are. > > Given the pushback Lustre client was given I have no hope Lustre server > will get into mainline in my lifetime. Even if it is horrible it would be nice to have it in staging... I guess the changes required to ext4 prohibit that... I don't suppose it can be made to work with mainline ext4 in a reduced-functionality-and-performance way?? I think it would be a lot easier to motivate forward progress if there were a credible end goal of everything being in mainline. > >> What do others think? Is there any chance that the current lustre in >> Linux will ever be more than a poor second-cousin to the external >> lustre-release. If there isn't, should we just discard it now and move >> on? > > > I think many useful cleanups and fixes came from the staging tree at > the very least. > The biggest problem with it all is that we are in staging tree so > we cannot bring it to parity much. And we are in staging tree because > there=E2=80=99s a whole bunch of =E2=80=9Ccleanups=E2=80=9D requested tha= t take a lot of effort > (in both implementing them and then in finding other ways of achieving > things that were done in old ways before). Do you have a list of requested cleanups? I would find that to be useful. > I understand that beggars cannot be choosers and while there are people > that are grandfathered with their atrocities in current kernel tree, > we must adhere to the shining standards first before having our chance, > but the standards are not easy to adhere to in an established sizeable > codebase. > > Realistically speaking I suspect if we drop Lustre from staging, > it=E2=80=99s unlikely there would remain any steam behind the cleanup eff= orts > at all. Thanks for your thoughts, NeilBrown --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQIzBAEBCAAdFiEEG8Yp69OQ2HB7X0l6Oeye3VZigbkFAlp9ETQACgkQOeye3VZi gbm/6RAAuU07NkyPTPW9RPH6hvGMB8Nfq7Xyq4zsFJLlsGzzFK5basR7jV5e3s1r vRKx1+ee/j9L9BEzLhq6rJ3ZW685jEGltWzjOcS2bAaUST8FIW9zMJKflajD6PUL jkaQF4wwf2MSR7AuWX72A5kHDwbJP6mB7gSy1236SPthy2FkKCwUZOR6o5Wkij0P ectAZznRFUwjqxwvuu1whiz43g30+cGMuTy/6riHU84NxkUjvcCunQF0e30OBELU /Ehz4bNDDZxNBBgBoy37k4WHtEseTyg3KyY3eLADb0quM7T1qgtn7KUXE6pvRS0t sD2v2FSejw+fMS/WL3CX5d+D8AqfSBuZulIzbDEYzuyMBs92TYUv0YS1CpXGHILK DULk65UgCZz/b8NlYvtObAiKaE0FrQGbEQMkKrbsIL+++LNHwnExn8gusobmv0oS diO1liRcNPO67whTiO6ZWYlyvKUgFSTms+I+tERaOCK8KtIvxilnmpIvHvTubeuI p6u8jOH91wMPlm8qv/6n89TFOyigfmyARftx3G8BN0lP+5MFh+AuO0zE3EJx3Xq4 rT3oAsVJ/5nGJBXbOuq2QdvBZ2nTbWyD9U4yxV4wsAHOveMiuod2ANiP4eIpsLBU bNWn/kYXpPtiLBRN6STYoTX058AGgcIZDkYNyCec4FRaWcNN4MU= =pHap -----END PGP SIGNATURE----- --=-=-=--