Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp139897ybl; Mon, 12 Aug 2019 13:14:32 -0700 (PDT) X-Google-Smtp-Source: APXvYqyn1Auhs6NWnYla5ApaGt93tLWdmA8NWIwQd8m+PMGD0G6Inw4xlpX3+GIAHvLtcmJdxDqY X-Received: by 2002:aa7:9682:: with SMTP id f2mr16828194pfk.256.1565640872034; Mon, 12 Aug 2019 13:14:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1565640872; cv=none; d=google.com; s=arc-20160816; b=QZ/vnPYG4c/WMhFBe6kwh7hEQEDu4KPRMrTPM3Csnu8Mo+i87eEcUzIJ0N7C9sDlTc pJnikTB//in5MzWrcpryaeZ+jD7GP1swT24ZC2zQxL/YMdJMPe4+v9QBIK0IsjWKmAhe P9cl/LmGnDFzKKUDYIkAVH5vfkHXRQIViivfu0cVktH2XR+HLGWWUxNq8QrCzfaQjLXo NsrCA05ONSg/jAR9f7hY4BDpwF8P+bI2u882vr103B84RpFcFIwnNtylGVdkUJ6WZxFP V+Dq42qZoZrrf5XhSrXp/1u3X1az71MZ77kDDYi8VMss/8au8nU1zyeNHw9WJwDQif2q lvkQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:dkim-signature; bh=9bz98gX/vj4hZ9b018mnpxRR+6pAkprICagFvUhCrgM=; b=lfY+PM7JezfXz6iRQmKdqBHoPfAo+m8mZd/IEbLnVYZ0+uFiSbvCi0G4K/2YEfyHka b7VemDYNi0j91eF2UfckIFSW2AO8Zlr4sg5M+h5fRovfomnmcX+hjcXV57R1Fzk3+JI6 yQm3q+bmF9htKjvcIcfvS6gQwQSTbDQt4oYMmamLOPwsGaaZuahHMEDOCFg32pL8MHxA e8EE7uKkF9rcwvkYWhcxlaHpvRcyBT3ftTIWUPCu1BOZug4920EoGF31J8rvF5ptvlge YAo9UlGNstvicONRtuc+5CPsJHl/2I0vWvh58wzk2YT/ZavpfinWIaJJ5bIYACwsNGEp Hr7Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@eikelenboom.it header.s=20180706 header.b=ijGBeVBy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=eikelenboom.it Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id m5si19677473pgp.507.2019.08.12.13.14.15; Mon, 12 Aug 2019 13:14:32 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@eikelenboom.it header.s=20180706 header.b=ijGBeVBy; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=eikelenboom.it Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727440AbfHLUNl (ORCPT + 99 others); Mon, 12 Aug 2019 16:13:41 -0400 Received: from server.eikelenboom.it ([91.121.65.215]:52184 "EHLO server.eikelenboom.it" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727193AbfHLUNl (ORCPT ); Mon, 12 Aug 2019 16:13:41 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=eikelenboom.it; s=20180706; h=Content-Transfer-Encoding:Content-Type: In-Reply-To:MIME-Version:Date:Message-ID:From:References:To:Subject:Sender: Reply-To:Cc:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help: List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=9bz98gX/vj4hZ9b018mnpxRR+6pAkprICagFvUhCrgM=; b=ijGBeVByqn7yrub+/0Bq/bmJ3f vh8IFW5/ehKNhUkDdQaTE4zGJuNOmsLqFX006P97bXIq4UOO9KAD704fpKtz3P941bGa5Snjgl9wA pFjr3SGzhTvWfD4MHzZk/aDPxingu1mZhFzxlSHdc81q/ipgNQ1j/Ay0n8RVk4+SRmjE=; Received: from ip4da85049.direct-adsl.nl ([77.168.80.73]:46986 helo=[172.16.1.50]) by server.eikelenboom.it with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1hxGhO-0006Pt-Pn; Mon, 12 Aug 2019 22:13:46 +0200 Subject: Re: 5.3-rc3-ish VM crash: RIP: 0010:tcp_trim_head+0x20/0xe0 To: Eric Dumazet , netdev , linux-kernel References: <27aebb57-0ca9-fba3-092f-39131ad2b648@eikelenboom.it> <4d803565-b716-42ab-1db8-3dcade91e939@gmail.com> From: Sander Eikelenboom Message-ID: Date: Mon, 12 Aug 2019 22:17:40 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 MIME-Version: 1.0 In-Reply-To: <4d803565-b716-42ab-1db8-3dcade91e939@gmail.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 12/08/2019 19:56, Eric Dumazet wrote: > > > On 8/12/19 2:50 PM, Sander Eikelenboom wrote: >> L.S., >> >> While testing a somewhere-after-5.3-rc3 kernel (which included the latest net merge (33920f1ec5bf47c5c0a1d2113989bdd9dfb3fae9), >> one of my Xen VM's (which gets quite some network load) crashed. >> See below for the stacktrace. >> >> Unfortunately I haven't got a clear trigger, so bisection doesn't seem to be an option at the moment. >> I haven't encountered this on 5.2, so it seems to be an regression against 5.2. >> >> Any ideas ? >> >> -- >> Sander >> >> >> [16930.653595] general protection fault: 0000 [#1] SMP NOPTI >> [16930.653624] CPU: 0 PID: 3275 Comm: rsync Not tainted 5.3.0-rc3-20190809-doflr+ #1 >> [16930.653657] RIP: 0010:tcp_trim_head+0x20/0xe0 >> [16930.653677] Code: 2e 0f 1f 84 00 00 00 00 00 90 41 54 41 89 d4 55 48 89 fd 53 48 89 f3 f6 46 7e 01 74 2f 8b 86 bc 00 00 00 48 03 86 c0 00 00 00 <8b> 40 20 66 83 f8 01 74 19 31 d2 31 f6 b9 20 0a 00 00 48 89 df e8 >> [16930.653741] RSP: 0000:ffffc90000003ad8 EFLAGS: 00010286 >> [16930.653762] RAX: fffe888005bf62c0 RBX: ffff8880115fb800 RCX: 000000008010000b > > crash in " mov 0x20(%rax),%eax" and RAX=fffe888005bf62c0 (not a valid kernel address) > > Look like one bit corruption maybe. > > Nothing comes to mind really between 5.2 and 53 that could explain this. Hi Eric, Hmm could be it's a rare coincidence, sp that it just never occurred on pre 5.3 by chance. Let's wait and see if it reoccurs, will report back if it does. Thanks for your explanation. -- Sander >> [16930.653791] RDX: 00000000000005a0 RSI: ffff8880115fb800 RDI: ffff888016b00880 >> [16930.653819] RBP: ffff888016b00880 R08: 0000000000000001 R09: 0000000000000000 >> [16930.653848] R10: ffff88800ae00800 R11: 00000000bfe632e6 R12: 00000000000005a0 >> [16930.653875] R13: 0000000000000001 R14: 00000000bfe62d46 R15: 0000000000000004 >> [16930.653913] FS: 00007fe71fe2cb80(0000) GS:ffff88801f200000(0000) knlGS:0000000000000000 >> [16930.653943] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> [16930.653965] CR2: 000055de0f3e7000 CR3: 0000000011f32000 CR4: 00000000000006f0 >> [16930.653993] Call Trace: >> [16930.654005] >> [16930.654018] tcp_ack+0xbb0/0x1230 >> [16930.654033] tcp_rcv_established+0x2e8/0x630 >> [16930.654053] tcp_v4_do_rcv+0x129/0x1d0 >> [16930.654070] tcp_v4_rcv+0xac9/0xcb0 >> [16930.654088] ip_protocol_deliver_rcu+0x27/0x1b0 >> [16930.654109] ip_local_deliver_finish+0x3f/0x50 >> [16930.654128] ip_local_deliver+0x4d/0xe0 >> [16930.654145] ? ip_protocol_deliver_rcu+0x1b0/0x1b0 >> [16930.654163] ip_rcv+0x4c/0xd0 >> [16930.654179] __netif_receive_skb_one_core+0x79/0x90 >> [16930.654200] netif_receive_skb_internal+0x2a/0xa0 >> [16930.654219] napi_gro_receive+0xe7/0x140 >> [16930.654237] xennet_poll+0x9be/0xae0 >> [16930.654254] net_rx_action+0x136/0x340 >> [16930.654271] __do_softirq+0xdd/0x2cf >> [16930.654287] irq_exit+0x7a/0xa0 >> [16930.654304] xen_evtchn_do_upcall+0x27/0x40 >> [16930.654320] xen_hvm_callback_vector+0xf/0x20 >> [16930.654339] >> [16930.654349] RIP: 0033:0x55de0d87db99 >> [16930.654364] Code: 00 00 48 89 7c 24 f8 45 39 fe 45 0f 42 fe 44 89 7c 24 f4 eb 09 0f 1f 40 00 83 e9 01 74 3e 89 f2 48 63 f8 4c 01 d2 44 38 1c 3a <75> 25 44 38 6c 3a ff 75 1e 41 0f b6 3c 24 40 38 3a 75 14 41 0f b6 >> [16930.654432] RSP: 002b:00007ffd5531eec8 EFLAGS: 00000a87 ORIG_RAX: ffffffffffffff0c >> [16930.655004] RAX: 0000000000000002 RBX: 000055de0f3e8e50 RCX: 000000000000007f >> [16930.655034] RDX: 000055de0f3dc2d2 RSI: 0000000000003492 RDI: 0000000000000002 >> [16930.655062] RBP: 0000000000007fff R08: 00000000000080ea R09: 00000000000001f0 >> [16930.655089] R10: 000055de0f3d8e40 R11: 0000000000000094 R12: 000055de0f3e0f2a >> [16930.655116] R13: 0000000000000010 R14: 0000000000007f16 R15: 0000000000000080 >> [16930.655144] Modules linked in: >> [16930.655200] ---[ end trace 533367c95501b645 ]--- >> [16930.655223] RIP: 0010:tcp_trim_head+0x20/0xe0 >> [16930.655243] Code: 2e 0f 1f 84 00 00 00 00 00 90 41 54 41 89 d4 55 48 89 fd 53 48 89 f3 f6 46 7e 01 74 2f 8b 86 bc 00 00 00 48 03 86 c0 00 00 00 <8b> 40 20 66 83 f8 01 74 19 31 d2 31 f6 b9 20 0a 00 00 48 89 df e8 >> [16930.655312] RSP: 0000:ffffc90000003ad8 EFLAGS: 00010286 >> [16930.655331] RAX: fffe888005bf62c0 RBX: ffff8880115fb800 RCX: 000000008010000b >> [16930.655360] RDX: 00000000000005a0 RSI: ffff8880115fb800 RDI: ffff888016b00880 >> [16930.655387] RBP: ffff888016b00880 R08: 0000000000000001 R09: 0000000000000000 >> [16930.655414] R10: ffff88800ae00800 R11: 00000000bfe632e6 R12: 00000000000005a0 >> [16930.655441] R13: 0000000000000001 R14: 00000000bfe62d46 R15: 0000000000000004 >> [16930.655475] FS: 00007fe71fe2cb80(0000) GS:ffff88801f200000(0000) knlGS:0000000000000000 >> [16930.655502] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 >> [16930.655525] CR2: 000055de0f3e7000 CR3: 0000000011f32000 CR4: 00000000000006f0 >> [16930.655553] Kernel panic - not syncing: Fatal exception in interrupt >> [16930.655789] Kernel Offset: disabled >>