Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752113Ab3HZTIg (ORCPT ); Mon, 26 Aug 2013 15:08:36 -0400 Received: from mx1.redhat.com ([209.132.183.28]:31913 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751725Ab3HZTIf (ORCPT ); Mon, 26 Aug 2013 15:08:35 -0400 Date: Mon, 26 Aug 2013 15:08:22 -0400 From: Dave Jones To: Hillf Danton Cc: Linux-MM , Linux Kernel , Linus Torvalds Subject: Re: unused swap offset / bad page map. Message-ID: <20130826190757.GB27768@redhat.com> Mail-Followup-To: Dave Jones , Hillf Danton , Linux-MM , Linux Kernel , Linus Torvalds References: <20130807153030.GA25515@redhat.com> <20130819231836.GD14369@redhat.com> <20130821204901.GA19802@redhat.com> <20130823032127.GA5098@redhat.com> <20130823035344.GB5098@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2840 Lines: 49 On Mon, Aug 26, 2013 at 11:45:53AM +0800, Hillf Danton wrote: > On Fri, Aug 23, 2013 at 11:53 AM, Dave Jones wrote: > > > > It actually seems worse, seems I can trigger it even easier now, as if > > there's a leak. > > > Can you please try the new fix for TLB flush? > > commit 2b047252d087be7f2ba > Fix TLB gather virtual address range invalidation corner cases No luck. [ 4588.541886] swap_free: Unused swap offset entry 00002d15 [ 4588.541952] BUG: Bad page map in process trinity-kid12 pte:005a2a80 pmd:22c01f067 [ 4588.541979] addr:00007f0e95fa8000 vm_flags:00100073 anon_vma:ffff880217665550 mapping: (null) index:1a42 [ 4588.542011] Modules linked in: snd_seq_dummy fuse hidp bnep scsi_transport_iscsi rfcomm ipt_ULOG can_bcm can_raw nfnetlink nfc caif_socket caif af_802154 phonet af_rxrpc bluetooth rfkill can llc2 pppoe pppox ppp_generic slhc irda crc_ccitt rds af_key rose x25 atm netrom appletalk ipx p8023 psnap p8022 llc ax25 xfs libcrc32c snd_hda_codec_realtek snd_hda_intel e1000e snd_hda_codec snd_hwdep ptp snd_seq snd_seq_device snd_pcm usb_debug pps_core pcspkr snd_page_alloc snd_timer snd soundcore [ 4588.542245] CPU: 2 PID: 25390 Comm: trinity-kid12 Not tainted 3.11.0-rc7+ #13 [ 4588.542321] 0000000000000000 ffff88021ba33c98 ffffffff816f9ddf 00007f0e95fa8000 [ 4588.542354] ffff88021ba33ce0 ffffffff81177047 00000000005a2a80 0000000000001a42 [ 4588.542386] 00007f0e96000000 ffff88022c01fd40 00000000005a2a80 ffff88021ba33e00 [ 4588.542418] Call Trace: [ 4588.542435] [] dump_stack+0x54/0x74 [ 4588.542457] [] print_bad_pte+0x187/0x220 [ 4588.542478] [] unmap_single_vma+0x524/0x850 [ 4588.542500] [] unmap_vmas+0x49/0x90 [ 4588.542521] [] exit_mmap+0xc5/0x170 [ 4588.542542] [] mmput+0x77/0x100 [ 4588.542562] [] do_exit+0x28d/0xcd0 [ 4588.542583] [] ? trace_hardirqs_on_caller+0x115/0x1e0 [ 4588.542607] [] ? trace_hardirqs_on+0xd/0x10 [ 4588.542629] [] do_group_exit+0x4c/0xc0 [ 4588.543534] [] SyS_exit_group+0x14/0x20 [ 4588.544438] [] tracesys+0xdd/0xe2 I can reproduce this pretty quickly by driving the system into swapping using a few instances of 'trinity -C64' (this creates 64 threads) I'm not sure how far back this bug goes, so I'll try some older kernels and see if I can bisect it, because we don't seem to be getting closer to figuring out what's actually happening.. Dave -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/