Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp207956imm; Thu, 30 Aug 2018 21:05:24 -0700 (PDT) X-Google-Smtp-Source: ANB0VdbIdOolCrsU9ew35oigTAhnyebn90UpKzHCERiHbMnWgLb+IDbpoPARHMZl2gcEcE9soLkW X-Received: by 2002:a62:c8d2:: with SMTP id i79-v6mr13660943pfk.35.1535688324597; Thu, 30 Aug 2018 21:05:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1535688324; cv=none; d=google.com; s=arc-20160816; b=O5U9tPNnlbUuDG5GGlgRwsym+IW8PIavxPH4USkEym2XseR9aBT8iNCSNYa9WJVqS7 h4SSb5NYAw0qHhPKIqSFcqOpz42RZGRQAKz1T6Mxi3DZBzUBUb+hSdal9Fcu6XyZUs6f eGGJPiJwAHXWki8WNw6PDbcOXHU0EK//ditMVoNCmfvGvNXnTC93czq3bPujSTsm3w47 IH8bthz+8ufx3QQM+wJH4u6kDWb1V4/XelCe0zdzuUe2cnSiU3+JfExUk3dpTKtIoIFs B+WtnDh/zB92QVEB7gAETpDlegg2UguVA5p4poxJUOaTi63yGQ8lPc9temaN17h40LQ/ RhLg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=6QxZWGXGFPMvOSO/ufxJ67V4A8a+YB4lxBa1WvgF7YU=; b=I1+qXF+vGfaS7f7v4LqpJxHWFqkFqVZ5oYiByYKkKd76NHCjuqLvQVd6RDjEbNz+YJ IdnAH6iGaIVYXluk367v5R/6drcHIyx98Fl29rOMH/iyktTXt+C1wPQgganNadcaU1Xo cGu2/zS5MI9jcgNtSC665HncN77Ke2B7GhveW8X7S2f2E0CG/r4LVWki55i2XPVCTGPC w9vsh/2HFhI/YMyEA71WSO8d6F7RTD9ur2+ivNqY0YDic3mJ79/uihVBF1kMsE/HvGj2 2Lq//vKhXo8VtiaESsxNn+v03kn8/RCYdeVdZ7qQQDAlVfrczx18SwIPXDFC0I7l4yCm 6vyA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t127-v6si9248769pfc.118.2018.08.30.21.05.08; Thu, 30 Aug 2018 21:05:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727272AbeHaIJE (ORCPT + 99 others); Fri, 31 Aug 2018 04:09:04 -0400 Received: from zeniv.linux.org.uk ([195.92.253.2]:48162 "EHLO ZenIV.linux.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726974AbeHaIJD (ORCPT ); Fri, 31 Aug 2018 04:09:03 -0400 Received: from viro by ZenIV.linux.org.uk with local (Exim 4.87 #1 (Red Hat Linux)) id 1fvaej-0005c5-FW; Fri, 31 Aug 2018 04:03:33 +0000 Date: Fri, 31 Aug 2018 05:03:33 +0100 From: Al Viro To: Cong Wang Cc: Jamal Hadi Salim , Kees Cook , LKML , Jiri Pirko , David Miller , Linux Kernel Network Developers Subject: Re: [PATCH] net: sched: Fix memory exposure from short TCA_U32_SEL Message-ID: <20180831040333.GA20509@ZenIV.linux.org.uk> References: <20180826055801.GA42063@beast> <20180826061534.GT6515@ZenIV.linux.org.uk> <20180826173236.GU6515@ZenIV.linux.org.uk> <20180826225749.GY6515@ZenIV.linux.org.uk> <20180828000310.GE6515@ZenIV.linux.org.uk> <20180828155938.GF6515@ZenIV.linux.org.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180828155938.GF6515@ZenIV.linux.org.uk> User-Agent: Mutt/1.9.1 (2017-09-22) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Aug 28, 2018 at 04:59:38PM +0100, Al Viro wrote: > On Tue, Aug 28, 2018 at 01:03:10AM +0100, Al Viro wrote: > > if (tcf_exts_get_net(&n->exts)) > > tcf_queue_work(&n->rwork, u32_delete_key_freepf_work); > > else > > u32_destroy_key(n->tp, n, true); > > ... and we hit u32_destroy_key(, , true), which does > > Speaking of which, we'd better never hit that branch for other reasons - there's > no RCU delay between removal of knode from the hash chain and its kfree(). > tcf_queue_work() does guarantee such delay (by use of queue_rcu_work()), direct > call doesn't... > > Anyway, whichever branch is taken, the memory corruption problem remains - the > comments below are accurate, AFAICS. > > > Incidentally, if we hit > > tcf_queue_work(&n->rwork, u32_delete_key_freepf_work); > > instead of u32_destroy_key(), the things don't seem to be any better - we > > won't do anything to until rtnl is dropped, so u32_destroy() won't > > break on the second pass through the loop - it'll free there and > > return. Setting us up for trouble, since when u32_delete_key_freepf_work() > > finally gets to u32_destroy_key() we'll have ->ht_down pointing > > to freed memory and decrementing its contents... Build the kernel with slab poisoning and try this: tc qdisc add dev eth0 ingress tc filter add dev eth0 parent ffff: protocol ip prio 100 handle 1: u32 divisor 1 tc filter add dev eth0 parent ffff: protocol ip prio 200 handle 2: u32 divisor 1 tc filter add dev eth0 parent ffff: protocol ip prio 100 handle 1:0:11 u32 ht 1: link 801: offset at 0 mask 0f00 shift 6 plus 0 eat match ip protocol 6 ff tc filter delete dev eth0 parent ffff: protocol ip prio 200 tc filter change dev eth0 parent ffff: protocol ip prio 100 handle 1:0:11 u32 ht 1: link 0: offset at 0 mask 0f00 shift 6 plus 0 eat match ip protocol 6 ff tc filter delete dev eth0 parent ffff: protocol ip prio 100 Then watch it oops in u32_lookup_ht() from u32_get() from tc_del_tfilter() Oopsing insn: cmp %ebp,0x8(%rbx). RBX: 6b6b6b6b6b6b6b6b, i.e. slab poison... What happens is that ht 801: (created when we'd added tcf_proto for prio 200) gets pinned down by link 801: in the third tc filter add. Then removal of prio 200 triggers u32_destroy(), dropping refcount on 801: and doing nothing else to it. Then filter change drops the last reference to 801:, freeing it. And we have a freed struct tc_u_hnode stuck in the middle of the list...