Received: by 2002:ac0:b08d:0:0:0:0:0 with SMTP id l13csp4958272imc; Mon, 25 Feb 2019 14:28:08 -0800 (PST) X-Google-Smtp-Source: AHgI3IZPex76L9IRWHodWOFVhAiGBTWPjB1HPdPNUNRa5RiIfx9bIf/E4JFU3Vshm5k1LUSr4/4E X-Received: by 2002:a62:be02:: with SMTP id l2mr22805697pff.55.1551133688836; Mon, 25 Feb 2019 14:28:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1551133688; cv=none; d=google.com; s=arc-20160816; b=E1PWm5OoGfYYqFt65oGYpJNLLzQATaGvVGSxP+RZoFHU7p/jaHGrEjzeeMqdofg6kq UVz/O0WONVB44zQdXOkYc4rDCPT0YA+gtO6DCXM2kqoBOZZ8woPNAWBWhUKF2Hoj7rHc M4na8LrNOobmDxdGptDADfM1n+N7vnOl4NYTtQPbgvCVDs+lmfA8Qa2JtTcfStr9Q/vt KJACGRhBU5xt54AlxoIoLbDH5VzCHWFK5VM/e0Lkc6jfAza1cGKNh2GoCbfgHPMAV2fy Xb1gtzKLmrV9BzFXm+kBJpWyzFIIPc4Y/2ckzcarf9cs+FvP9iw4ZKvZet0ZACcadsyL Qk/g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:dkim-signature:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject; bh=I7MyxE+9hg3tFLUS914t6N0EdyIAW91f6NRmxSCzpU8=; b=EcYzVKUVkpKtQHN7U6mwf8PwZzfDJ4OK2VKYQ+hp5LwZIxyNRexIeo7X5D0yOxN7Y9 vZpxutTipqEfkdCKYB9WR727oC9qm9gBV5g6VJvXipmSwCkqzv9dErzg4lI2Iy+j3sXI kpSvA33CBywMCCvmM+oi9qbZc2f2T/jvuAOuSYOgl3e6hGesubeLvsJyVgQsQFHCM4X1 7LqyvFiy5OCDHOfSYgW1Ts9B6HppUqAH0huxaWOCZ9rsW4EFmryUhz9arAxVfA5tnpVJ 65vv5ktZSJoZQLToq/D1pELiEKmVFrprRu3uXC1Fus18UQCGp6v9WpfmOzmd539CI8w+ xhHQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@nvidia.com header.s=n1 header.b="dba/+Xl7"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=nvidia.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k29si4243751pgb.267.2019.02.25.14.27.52; Mon, 25 Feb 2019 14:28:08 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@nvidia.com header.s=n1 header.b="dba/+Xl7"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=nvidia.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727567AbfBYW1a (ORCPT + 99 others); Mon, 25 Feb 2019 17:27:30 -0500 Received: from hqemgate14.nvidia.com ([216.228.121.143]:17030 "EHLO hqemgate14.nvidia.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726575AbfBYW13 (ORCPT ); Mon, 25 Feb 2019 17:27:29 -0500 Received: from hqpgpgate101.nvidia.com (Not Verified[216.228.121.13]) by hqemgate14.nvidia.com (using TLS: TLSv1.2, DES-CBC3-SHA) id ; Mon, 25 Feb 2019 14:27:36 -0800 Received: from hqmail.nvidia.com ([172.20.161.6]) by hqpgpgate101.nvidia.com (PGP Universal service); Mon, 25 Feb 2019 14:27:27 -0800 X-PGP-Universal: processed; by hqpgpgate101.nvidia.com on Mon, 25 Feb 2019 14:27:27 -0800 Received: from [10.26.11.173] (10.124.1.5) by HQMAIL101.nvidia.com (172.20.187.10) with Microsoft SMTP Server (TLS) id 15.0.1395.4; Mon, 25 Feb 2019 22:27:25 +0000 Subject: Re: Regression: SUNRPC: Use poll() to fix up the socket requeue races To: Trond Myklebust , "linux-kernel@vger.kernel.org" , "linux-nfs@vger.kernel.org" , "linux-tegra@vger.kernel.org" References: <95702994-6762-e42b-06ee-ae945c2137bd@nvidia.com> From: Jon Hunter Message-ID: <88677895-7f89-ae1d-3060-819f4b04f530@nvidia.com> Date: Mon, 25 Feb 2019 22:27:22 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: X-Originating-IP: [10.124.1.5] X-ClientProxiedBy: HQMAIL108.nvidia.com (172.18.146.13) To HQMAIL101.nvidia.com (172.20.187.10) Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nvidia.com; s=n1; t=1551133656; bh=I7MyxE+9hg3tFLUS914t6N0EdyIAW91f6NRmxSCzpU8=; h=X-PGP-Universal:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:X-Originating-IP: X-ClientProxiedBy:Content-Type:Content-Language: Content-Transfer-Encoding; b=dba/+Xl7tXsZYcIIa441F1nsvYM4ihG3CE+LVZ22tREmcuMjP0omgcNR/me37cYDS 7Iuz2imibBmCPYwbq/TI7auedWVITIiFxA1U4GKW4sXPEk0okA6yfjgQ72Iiv63d4G pkzt132lNcVBdTYoALE/pnsf8XnnDaonECDsY1fgCgIVGkdBi8uimMGQqRsWVrQmJw KKSaLE8LlZq5ghWvHusTW5xjJiSUQi+UkqJdr5EYwENjKsr2J/locHCCFE+OgSNKVt mAsoeo9qbF3cLZotM254DjUpr+gLpBr5jO3Cqr6nHAxMg8K+baZelhCPif59jkqXqq g7WDvegzwOleg== Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 25/02/2019 21:03, Trond Myklebust wrote: > On Mon, 2019-02-25 at 20:25 +0000, Jon Hunter wrote: >> Hi Trond, >> >> Starting in next-20190222 I have observed a regression with NFS >> causing >> some of our boards to fail to boot. Bisect points to your commit ... >> >> commit 0ffe86f48026b7f34db22d1004bc9992f0db8b33 >> Author: Trond Myklebust >> Date: Wed Jan 30 14:51:26 2019 -0500 >> >> SUNRPC: Use poll() to fix up the socket requeue races >> >> >> After reverting this on top of -next I no longer see the problem. I >> have >> not had chance to look any closer, but wanted to see if you had any >> ideas what might be the problem. >> >> Cheers >> Jon > > What kind of boot is this? UDP or TCP? nfsroot? NFSv3 or NFSv4? This is nfsroot. I don't specify any particular NFS version from the kernel cmdline, but this is seen with ARM kernel configs tegra_defconfig and multi_v7_defconfig. Looking at the logs I am seeing the following crash which appears to point to UDP ... [ 8.032956] Unable to handle kernel NULL pointer dereference at virtual address 00000024 [ 8.041137] pgd = (ptrval) [ 8.043858] [00000024] *pgd=00000000 [ 8.047437] Internal error: Oops: 5 [#1] SMP ARM [ 8.052049] Modules linked in: [ 8.055104] CPU: 1 PID: 100 Comm: kworker/u9:2 Not tainted 5.0.0-rc7-next-20190222-g94a4752 #1 [ 8.063699] Hardware name: NVIDIA Tegra SoC (Flattened Device Tree) [ 8.069960] Workqueue: xprtiod xs_udp_data_receive_workfn [ 8.075353] PC is at udp_poll+0x30/0x64 [ 8.079178] LR is at udp_poll+0x10/0x64 [ 8.083006] pc : [] lr : [] psr: 20000113 [ 8.089257] sp : e8963ea8 ip : 00000000 fp : 00001064 [ 8.094471] r10: 00000000 r9 : e8963ef4 r8 : e8963efc [ 8.099687] r7 : e940db34 r6 : 00000000 r5 : e9d1c540 r4 : 00000345 [ 8.106203] r3 : c0d3bdd8 r2 : e8908cc0 r1 : e8908cc0 r0 : e89089c0 [ 8.112718] Flags: nzCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment none [ 8.119843] Control: 10c5387d Table: 8020406a DAC: 00000051 [ 8.125579] Process kworker/u9:2 (pid: 100, stack limit = 0x(ptrval)) [ 8.132008] Stack: (0xe8963ea8 to 0xe8964000) [ 8.136352] 3ea0: e940d800 e940db04 e89089c0 c0dcac04 fffffff5 00000000 [ 8.144515] 3ec0: e88f4400 c0dcae80 e8963efc e8963ef4 e940da58 e940d800 c187c680 e8941b58 [ 8.152681] 3ee0: 000010f0 e940da50 e940db44 00000000 e8963f44 fffffff5 00000000 00000000 [ 8.160838] 3f00: e894247c ea203180 e940db34 ea3f5c00 e940db38 ea35e500 00000000 00000000 [ 8.169001] 3f20: c188ef50 c035f21c ea3f5c00 ea3f5c00 e8962018 ea203180 ea3f5c00 ea3f5c18 [ 8.177164] 3f40: e8962018 c188e91c ea203194 c1703d00 00000088 c035f548 e890d05c e8962000 [ 8.185327] 3f60: ea200480 c1703d00 e890d05c e890d040 ea200480 00000000 e890d05c ea203180 [ 8.193492] 3f80: c035f508 ea433ebc 00000000 c0364810 ea200480 c03646ec 00000000 00000000 [ 8.201657] 3fa0: 00000000 00000000 00000000 c03010e8 00000000 00000000 00000000 00000000 [ 8.209815] 3fc0: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 [ 8.217979] 3fe0: 00000000 00000000 00000000 00000000 00000013 00000000 00000000 00000000 [ 8.226154] [] (udp_poll) from [] (xs_poll_check_readable+0x30/0x70) [ 8.234241] [] (xs_poll_check_readable) from [] (xs_udp_data_receive_workfn+0x23c/0x2a0) [ 8.244062] [] (xs_udp_data_receive_workfn) from [] (process_one_work+0x164/0x450) [ 8.253359] [] (process_one_work) from [] (worker_thread+0x40/0x524) [ 8.261444] [] (worker_thread) from [] (kthread+0x124/0x154) [ 8.268834] [] (kthread) from [] (ret_from_fork+0x14/0x2c) [ 8.276044] Exception stack(0xe8963fb0 to 0xe8963ff8) [ 8.281084] 3fa0: 00000000 00000000 00000000 00000000 [ 8.289248] 3fc0: 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 [ 8.297411] 3fe0: 00000000 00000000 00000000 00000000 00000013 00000000 [ 8.304017] Code: e1510002 13844041 e3140040 0a000005 (e5962024) [ 8.310129] ---[ end trace 52d5b8bca1dce72c ]--- [ 19.321470] nfs: server 192.168.99.1 not responding, still trying [ 55.001474] nfs: server 192.168.99.1 not responding, still trying Cheers Jon -- nvpublic