Received: by 10.223.176.5 with SMTP id f5csp813280wra; Wed, 7 Feb 2018 07:58:38 -0800 (PST) X-Google-Smtp-Source: AH8x226uDuWUVDSTQgaZKQfDqnJdRPyqCi9kwBDyouMYI8bCD3+CKh6Po+fFVgacO1BMtkKKOzqj X-Received: by 10.99.97.23 with SMTP id v23mr996167pgb.250.1518019118875; Wed, 07 Feb 2018 07:58:38 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518019118; cv=none; d=google.com; s=arc-20160816; b=A4kMaNehTpcEQ5iAwWRaw0nmy9l5RRj6c91DcU/w0u19i5S3YxT+MX/lYAZuyXSgkv UP5go13RgA7+E/yn99eqj2LYAa45JKFFoovWLRLasoNen4xdpTU4wT+Y9ilhxIKN6LpE qUaZM3Xi/YRcnyBDz7ZnhlKrR+V+uw+cLS/wPb2gFxfrua99YsHEhuezmtmJUUccJGRJ +BYujYadEHuFD9aWLYF+rK3CN0NonudqnnHSiutU3xAg2NAgcuLg6YPZXh+XXr18FbwR kFm6MkatcYSELVlKgbSl8qu3dhZoNRvQWJ970RO0mexWwBA3JOoy/0xIbwFvab3cx51a uouw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id :arc-authentication-results; bh=GDPNmYg9lHRTo4HsYFrn5N1mbHzjQykWLSZy3hKYjgI=; b=XNHV+nKrPNnbX4mLVDHx0DtPScQ1Qn26/k86NSjw3FlORq0GEussKEXCNiq/mEC260 zcoATeD8yKuG3ORF6jPu5vUsWjp2jo9oO/sYexX49IqaV+s5A4nC6g/ZPQhbTbZiaLyD uB0gFVzEleEJPZr7lD4vHgAvcp2Jvu+TjOISuHNf062tcxPI2OvT5hXx7zLLqXQOE398 ISo8GxT/MawqvSlK2zQ04r4wDjEPPbXbZqFE2Fpep4wD6ZwQwoc1ChO5xjgOj2//5pDm M/Yar0jKoNMtrY4y8Fl5N4UmOtv2tGdlP4VdrWRFqAaST09MhPqTcmeNNsIlJuEwI42r lV+w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v5-v6si1230142plg.729.2018.02.07.07.58.24; Wed, 07 Feb 2018 07:58:38 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754667AbeBGP5M (ORCPT + 99 others); Wed, 7 Feb 2018 10:57:12 -0500 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:43460 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1754254AbeBGP5L (ORCPT ); Wed, 7 Feb 2018 10:57:11 -0500 Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.rdu2.redhat.com [10.11.54.5]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 6B5707C6BB; Wed, 7 Feb 2018 15:57:10 +0000 (UTC) Received: from localhost.localdomain (unknown [10.32.181.99]) by smtp.corp.redhat.com (Postfix) with ESMTP id 4C7ACD1BBD; Wed, 7 Feb 2018 15:57:08 +0000 (UTC) Message-ID: <1518019027.15509.21.camel@redhat.com> Subject: Re: WARNING: proc registration bug in clusterip_tg_check From: Paolo Abeni To: Cong Wang Cc: coreteam@netfilter.org, David Miller , Florian Westphal , Jozsef Kadlecsik , Alexey Kuznetsov , LKML , Linux Kernel Network Developers , netfilter-devel@vger.kernel.org, Pablo Neira Ayuso , syzkaller-bugs@googlegroups.com, Hideaki YOSHIFUJI , syzbot Date: Wed, 07 Feb 2018 16:57:07 +0100 In-Reply-To: <1517992986.2526.13.camel@redhat.com> References: <089e082d13a0cf25ed05648bf8b3@google.com> <1517992986.2526.13.camel@redhat.com> Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.11.54.5 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Wed, 07 Feb 2018 15:57:10 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Wed, 07 Feb 2018 15:57:10 +0000 (UTC) for IP:'10.11.54.5' DOMAIN:'int-mx05.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'pabeni@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2018-02-07 at 09:43 +0100, Paolo Abeni wrote: > On Tue, 2018-02-06 at 22:42 -0800, Cong Wang wrote: > > On Tue, Feb 6, 2018 at 6:27 AM, syzbot > > wrote: > > > Hello, > > > > > > syzbot hit the following crash on net-next commit > > > 617aebe6a97efa539cc4b8a52adccd89596e6be0 (Sun Feb 4 00:25:42 2018 +0000) > > > Merge tag 'usercopy-v4.16-rc1' of > > > git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux > > > > > > So far this crash happened 5 times on net-next, upstream. > > > C reproducer is attached. > > > syzkaller reproducer is attached. > > > Raw console output is attached. > > > compiler: gcc (GCC) 7.1.1 20170620 > > > .config is attached. > > > > > > IMPORTANT: if you fix the bug, please add the following tag to the commit: > > > Reported-by: syzbot+03218bcdba6aa76441a3@syzkaller.appspotmail.com > > > It will help syzbot understand when the bug is fixed. See footer for > > > details. > > > If you forward the report, please keep this part and the footer. > > > > > > x_tables: ip_tables: osf match: only valid for protocol 6 > > > x_tables: ip_tables: osf match: only valid for protocol 6 > > > x_tables: ip_tables: osf match: only valid for protocol 6 > > > ------------[ cut here ]------------ > > > proc_dir_entry 'ipt_CLUSTERIP/172.20.0.170' already registered > > > WARNING: CPU: 1 PID: 4152 at fs/proc/generic.c:330 proc_register+0x2a4/0x370 > > > fs/proc/generic.c:329 > > > Kernel panic - not syncing: panic_on_warn set ... > > > > > > CPU: 1 PID: 4152 Comm: syzkaller851476 Not tainted 4.15.0+ #221 > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS > > > Google 01/01/2011 > > > Call Trace: > > > __dump_stack lib/dump_stack.c:17 [inline] > > > dump_stack+0x194/0x257 lib/dump_stack.c:53 > > > panic+0x1e4/0x41c kernel/panic.c:183 > > > __warn+0x1dc/0x200 kernel/panic.c:547 > > > report_bug+0x211/0x2d0 lib/bug.c:184 > > > fixup_bug.part.11+0x37/0x80 arch/x86/kernel/traps.c:178 > > > fixup_bug arch/x86/kernel/traps.c:247 [inline] > > > do_error_trap+0x2d7/0x3e0 arch/x86/kernel/traps.c:296 > > > do_invalid_op+0x1b/0x20 arch/x86/kernel/traps.c:315 > > > invalid_op+0x22/0x40 arch/x86/entry/entry_64.S:1097 > > > RIP: 0010:proc_register+0x2a4/0x370 fs/proc/generic.c:329 > > > RSP: 0018:ffff8801cbd6ee20 EFLAGS: 00010286 > > > RAX: dffffc0000000008 RBX: ffff8801d2181038 RCX: ffffffff815a57ae > > > RDX: 0000000000000000 RSI: 1ffff100397add74 RDI: 1ffff100397add49 > > > RBP: ffff8801cbd6ee70 R08: 1ffff100397add0b R09: 0000000000000000 > > > R10: ffff8801cbd6ecd8 R11: 0000000000000000 R12: ffff8801b2bb1cc0 > > > R13: dffffc0000000000 R14: ffff8801b0d8dbc8 R15: ffff8801b2bb1d81 > > > proc_create_data+0xf8/0x180 fs/proc/generic.c:494 > > > clusterip_config_init net/ipv4/netfilter/ipt_CLUSTERIP.c:250 [inline] > > > > I think there is probably a race condition between clusterip_config_entry_put() > > and clusterip_config_init(), after we release the spinlock, a new proc > > with the same IP could be created therefore triggers this warning.... > > > > I am not sure if it is enough to just move the proc_remove() under > > spinlock... > > I *think* we should change the order on proc fs entry creation, > because clusterip_config_init() can race with itself, > clusterip_config_init() returns NULL if the clusterip_config_init has > no pte, and currently such entry is inserted into the list with NULL > pte and the list lock itself is released before creating the PTE. I was wrong. My suggested fix does not work at all. I tried your code and it fixes the issue here. Feel free to submit with: Tested-by: Paolo Abeni Thank you, Paolo