Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752216AbbKKXkW (ORCPT ); Wed, 11 Nov 2015 18:40:22 -0500 Received: from mail-pa0-f49.google.com ([209.85.220.49]:34616 "EHLO mail-pa0-f49.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751350AbbKKXkU (ORCPT ); Wed, 11 Nov 2015 18:40:20 -0500 Date: Wed, 11 Nov 2015 15:40:15 -0800 From: Alexei Starovoitov To: Peter Zijlstra Cc: David Miller , will.deacon@arm.com, daniel@iogearbox.net, arnd@arndb.de, yang.shi@linaro.org, linaro-kernel@lists.linaro.org, eric.dumazet@gmail.com, zlim.lnx@gmail.com, ast@kernel.org, linux-kernel@vger.kernel.org, netdev@vger.kernel.org, xi.wang@gmail.com, catalin.marinas@arm.com, linux-arm-kernel@lists.infradead.org, yhs@plumgrid.com, bblanco@plumgrid.com Subject: Re: [PATCH 2/2] arm64: bpf: add BPF XADD instruction Message-ID: <20151111234014.GA17014@ast-mbp.thefacebook.com> References: <20151111162341.GN9562@arm.com> <20151111172659.GA86334@ast-mbp.thefacebook.com> <20151111.123548.1039494689070388545.davem@davemloft.net> <20151111175741.GR17308@twins.programming.kicks-ass.net> <20151111181132.GA90947@ast-mbp.thefacebook.com> <20151111183128.GS17308@twins.programming.kicks-ass.net> <20151111184427.GH11639@twins.programming.kicks-ass.net> <20151111185415.GI11639@twins.programming.kicks-ass.net> <20151111195558.GA4173@ast-mbp.thefacebook.com> <20151111222135.GU17308@twins.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20151111222135.GU17308@twins.programming.kicks-ass.net> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2212 Lines: 46 On Wed, Nov 11, 2015 at 11:21:35PM +0100, Peter Zijlstra wrote: > On Wed, Nov 11, 2015 at 11:55:59AM -0800, Alexei Starovoitov wrote: > > Therefore things like memory barriers, full set of atomics are not applicable > > in bpf world. > > There are still plenty of wait-free constructs one can make using them. yes, but all such lock-free algos are typically based on cmpxchg8b and tight loop, so it would be very hard for verifier to proof termination of such loops. I think when we'd need to add something like this, we'll add new bpf insn that will be membarrier+cmpxhg8b+check+loop as a single insn, so it cannot be misused. I don't know of any concrete use case yet. All possible though. > Say a barrier/rendezvous construct for knowing when an event has > happened on all CPUs. > > But if you really do not want any of that, I suppose that is a valid > choice. I do want it :) and I think in the future we'll add a bunch of interesting stuff. May be including things like above. I just don't want to rush things in just because x86 has such insn or because gcc has a builtin for it. Like we discussed adding popcnt insn. It can be useful in some cases, but doesn't seem to worth the pain of adding it to interpreter, JITs and llvm backends... as of today... May be tomorrow it will be must have. > Is even privileged (e)BPF not allowed things like this? I was thinking > the strict no loops stuff was for unpriv (e)BPF only. the only difference between unpriv and priv is the ability to send all values (including kernel addresses) to user space (like tracing needs to see all registers). The rest is the same. root should never crash the kernel as well. If we relax even little bit for root then the whole bpf stuff is no better than kernel module. btw, support for mini loops was requested many times in the past. I guess we'd have to add something like this, but it's tricky. Mainly because control flow graph analysis becomes much more complicated. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/