Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp1799329pxu; Tue, 24 Nov 2020 09:10:44 -0800 (PST) X-Google-Smtp-Source: ABdhPJzuxqFHJ5RYm81lgCl/NyOZh/Dv7nCBYdarJ6KVnoZMf17ESyTbNrrb2z72d5pLmS42EKec X-Received: by 2002:a17:906:39d7:: with SMTP id i23mr5287399eje.157.1606237844373; Tue, 24 Nov 2020 09:10:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606237844; cv=none; d=google.com; s=arc-20160816; b=OtHEYd8BUOXtRwwaWjXXc6SbCmMSZY6XcSlCyYs3mHkCN3K/TH6Lfa8X/B3qex7+Bh 56NO6JPP13zL7XUrbEXFc3NHLJRt8/qKrfgJMKrHFcawFlP/aD/xpdiQGYCHkRYfQunP 3GjdXAjCO7cVYINktl+o5oUnBf6eaZvPHJYkQSUf2WJpk6nmG6/ifyRIXbTSqSFW/W6P 73yHQbE2oz6RjTTLOkC4LyklJ7Zvts4lMRC9GttGD+QJ94k0i7vbiMcAjafUUGz06tA/ qmml53BuApd5m0tXVJ7DNz6OXVOrG/J0ytR68mRMS5g24vsFxZbYoa2yJSPxspkqaJPa pK7Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=ZBvHvzP+9o8/Uf4hbLg51E6jDSRQAkhWemb0Me9MezE=; b=FtQNYDjhL6Dev5bfyZ6UTdwL7MkFF8e4eLvHhjZsBJpEaT/NgtTSMFQS4Hn/4hdm88 6o4e3edBWU/zhFimSmK819IZMOF2Rq/sSFafhtPAXImvDQqFdgMCMKUOd4iXre51Hn1a hOSyFP3xX1fVMyQtKEQCOe7oSFooM0rX4o1I8+3Scn9IHZfxvUxohUmNIyd5dmGiWmY/ h9AEK7MCQ7csJixhJsJx9xNZnMjo2+Bmeu1K8ABONNLNsd65elVQaK2LlgQrMgkvGCs1 cWabTgQ0ezD4l3csxE0GfAqMkcSp9krClRExDawkpE9X367N+oxSYhAlqHXDgxqFpneU LmmQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=p8NsNIxB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id o10si8829137edf.226.2020.11.24.09.10.19; Tue, 24 Nov 2020 09:10:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=p8NsNIxB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2390302AbgKXRHK (ORCPT + 99 others); Tue, 24 Nov 2020 12:07:10 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39542 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390255AbgKXRHJ (ORCPT ); Tue, 24 Nov 2020 12:07:09 -0500 Received: from mail-lj1-x241.google.com (mail-lj1-x241.google.com [IPv6:2a00:1450:4864:20::241]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8B7AAC061A4D for ; Tue, 24 Nov 2020 09:07:07 -0800 (PST) Received: by mail-lj1-x241.google.com with SMTP id y10so7396750ljc.7 for ; Tue, 24 Nov 2020 09:07:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=ZBvHvzP+9o8/Uf4hbLg51E6jDSRQAkhWemb0Me9MezE=; b=p8NsNIxBBnZy2eUEOISROXe3cZDJXTooL+7OKlBIBBEwerKvASfRv3Z+BalyhTTE0L Nj6QKn3NM+qm7O7APifQO6dVMDhP1/9OH8Tq96BFAmoasRftP0/EkeahBcss/fA3kPVR QtTsWBXLSNLzhcZRk1SypgtNdqpdhqOzwGAWwUrhBn+YL2ibEamP/Y0ezy3RPXPIpvjg grGT3IqlE93NJCnv0vtc5O/8qyrXbPx4552Js/h956h21KJpdvb8nI0+5pPMQXFy6LU7 vCooPNZoJB2cEqLC3oZd60t+MzLUDoWirK6Uz1Rl6hBxNmre+p5ngKJqJ2PsqS2NdeLF nrdg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=ZBvHvzP+9o8/Uf4hbLg51E6jDSRQAkhWemb0Me9MezE=; b=IqmSoGA/t4lBlp5CtIcMxYW7Kwta9ZK5H9ZY/VIQ/Vvff5HwyHcPe7WT2/c+4WONp+ OacbUVJsGpWWugnwnzH4be4umkL9rzbrBfIxxBKskzTN3bw5saDffqnr43DFhIKrBiPZ UaNj++tloWw/9ATPpVoC0dOXXhP0ufHzLE/St1PmIRakLVTbwLd/x0POPQ981Dzw1z3s C0IL6sF2gY2HnRxy8zGFbKfdkFFDu4Nmz9m2sZJW6W5lxl+ip31vsgZCx+ZO7FOMrvdr FU+8A5pPz2Vzl+AaCpOKnh1rPzFeohS7IMI+GB2PG6g/MYqyUNmV7Y3gC7axeake09Y2 k5Ag== X-Gm-Message-State: AOAM533ARAlCHMbU4jpSGggVeogAZthP/d3BhRDvq0E2CPSOabngtWkr HZHlKbd/pqdSQ/k0ZsoiCYIrPpGFKHcVYcxesG6QIw== X-Received: by 2002:a2e:9216:: with SMTP id k22mr2160034ljg.138.1606237625645; Tue, 24 Nov 2020 09:07:05 -0800 (PST) MIME-Version: 1.0 References: <87lfer2c0b.fsf@oldenburg2.str.redhat.com> <20201124122639.x4zqtxwlpnvw7ycx@wittgenstein> <878saq3ofx.fsf@oldenburg2.str.redhat.com> <20201124164546.GA14094@infradead.org> In-Reply-To: <20201124164546.GA14094@infradead.org> From: Jann Horn Date: Tue, 24 Nov 2020 18:06:38 +0100 Message-ID: Subject: Re: [PATCH] syscalls: Document OCI seccomp filter interactions & workaround To: Christoph Hellwig , Kees Cook , Andy Lutomirski , Will Drewry Cc: Mark Wielaard , Florian Weimer , Christian Brauner , Linux API , "open list:DOCUMENTATION" , kernel list , dev@opencontainers.org, Jonathan Corbet , "Carlos O'Donell" Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org +seccomp maintainers/reviewers [thread context is at https://lore.kernel.org/linux-api/87lfer2c0b.fsf@oldenburg2.str.redhat.com/ ] On Tue, Nov 24, 2020 at 5:49 PM Christoph Hellwig wrote: > On Tue, Nov 24, 2020 at 03:08:05PM +0100, Mark Wielaard wrote: > > For valgrind the issue is statx which we try to use before falling back > > to stat64, fstatat or stat (depending on architecture, not all define > > all of these). The problem with these fallbacks is that under some > > containers (libseccomp versions) they might return EPERM instead of > > ENOSYS. This causes really obscure errors that are really hard to > > diagnose. > > So find a way to detect these completely broken container run times > and refuse to run under them at all. After all they've decided to > deliberately break the syscall ABI. (and yes, we gave the the rope > to do that with seccomp :(). FWIW, if the consensus is that seccomp filters that return -EPERM by default are categorically wrong, I think it should be fairly easy to add a check to the seccomp core that detects whether the installed filter returns EPERM for some fixed unused syscall number and, if so, prints a warning to dmesg or something along those lines...