Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6116309ybv; Tue, 18 Feb 2020 10:12:33 -0800 (PST) X-Google-Smtp-Source: APXvYqykcWFjjhIUhBM2O6TVmQuCczDUjpbqK560EqMgdz/TWHImJAMZxWIbiULpuhrV/uISvNsU X-Received: by 2002:a05:6808:181:: with SMTP id w1mr2131268oic.176.1582049553459; Tue, 18 Feb 2020 10:12:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582049553; cv=none; d=google.com; s=arc-20160816; b=lcaTp/c4KAly2mgIibu8zSXjNhJ28VrtweUByNdu6TtDdRZKxrThpA72XWQZjiXvYN 2sKE4gjp8ExIaep/Xk/76mFr4fJCOUfRHAt5/fKBnSKHulIJs1lFb1aTItz1puQzZou2 KDWGTE8VGO0cwF7tRDzoqznpqrnnkb9YaUZLl0BcPOnJ/vkiBEZG6p1Kvh+GgRh/PLoQ C6uyPyaih2zKZ4iRzCEQQ/hz8QpUKZpG+GJSFLT2qRzAlY6QWdcI0wU7El395Emnw7L2 LQkBV5HREFUHWkwrrZ8tM8Q5hU6b5XJbNijM5Wmf6TYfxx9VaNYx4KyuTynbOiN6ShI9 P1iA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:date:subject:user-agent:message-id :references:cc:in-reply-to:from:to:content-transfer-encoding :mime-version; bh=X+25b4R7UYIFepwsImi65TpbVhyLAc5CUzZVQu3cPns=; b=a1vPF3DMT5VA7xU5tGNUfo7dhrshjuykFD6mYG20g8WmeLig9l4KV8dyHIZIIbDXNi dyh2/6Lw4KlLasLNK7o4ROPaPvAxx4N6xnn4nfFlTg5E2oZxmXwu5iFJqJlp/TFOSt8N jbBY6bO2Vtk8yT+kL7/KHm+/0Zy3pIvCBq8ih+Tj3pcXFWYnMb0XyS5jwzREkdSHCmt/ LfXYnlrAO/GUB6rM4Amo3aGVeEtLFiaKWUHRDkQFMlrbkWfsJ2tiIEV2ScOBT7UsFhlH diKMR6iu/d0K3WurXjXWG4xK+6AIVB8I/dBTKy4j5Xx3RLf/Jtt2fsxaOq8/2VmVh538 gDhA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z8si2092883otp.79.2020.02.18.10.12.20; Tue, 18 Feb 2020 10:12:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726496AbgBRSLA convert rfc822-to-8bit (ORCPT + 99 others); Tue, 18 Feb 2020 13:11:00 -0500 Received: from mail.fireflyinternet.com ([77.68.26.236]:58033 "EHLO fireflyinternet.com" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1726174AbgBRSK7 (ORCPT ); Tue, 18 Feb 2020 13:10:59 -0500 X-Default-Received-SPF: pass (skip=forwardok (res=PASS)) x-ip-name=78.156.65.138; Received: from localhost (unverified [78.156.65.138]) by fireflyinternet.com (Firefly Internet (M1)) with ESMTP (TLS) id 20267249-1500050 for multiple; Tue, 18 Feb 2020 18:10:12 +0000 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT To: Hillf Danton , Toralf Foerster From: Chris Wilson In-Reply-To: <20200217023013.5696-1-hdanton@sina.com> Cc: Hillf Danton , intel-gfx , Linux Kernel References: <20200216032625.11452-1-hdanton@sina.com> <20200216145518.9728-1-hdanton@sina.com> <20200217023013.5696-1-hdanton@sina.com> Message-ID: <158204940993.8112.7051733810171511907@skylake-alporthouse-com> User-Agent: alot/0.6 Subject: Re: [Intel-gfx] kernel 5.5.4: BUG: kernel NULL pointer dereference, address: 000000000000000 Date: Tue, 18 Feb 2020 18:10:09 +0000 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Quoting Hillf Danton (2020-02-17 02:30:13) > > On Sun, 16 Feb 2020 22:17:59 +0100 Toralf Foerster wrote: > > > > This is similar to the behaviour before, the BUG occurres after few minutes/hours. > > It brought now: > > > > > > Feb 16 22:09:01 t44 CROND[8918]: (root) CMD (/usr/lib/sa/sa1 30 2 -S XALL) > > Feb 16 22:10:01 t44 CROND[8980]: (root) CMD (/usr/lib/sa/sa1 30 2 -S XALL) > > Feb 16 22:10:37 t44 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000 > > Feb 16 22:10:37 t44 kernel: #PF: supervisor instruction fetch in kernel mode > > Feb 16 22:10:37 t44 kernel: #PF: error_code(0x0010) - not-present page > > Feb 16 22:10:37 t44 kernel: PGD 0 P4D 0 > > Feb 16 22:10:37 t44 kernel: Oops: 0010 [#1] SMP PTI > > Feb 16 22:10:37 t44 kernel: CPU: 1 PID: 3403 Comm: X Tainted: G T 5.5.4 #3 > > Feb 16 22:10:37 t44 kernel: Hardware name: LENOVO 20AQCTO1WW/20AQCTO1WW, BIOS GJET92WW (2.42 ) 03/03/2017 > > Feb 16 22:10:37 t44 kernel: RIP: 0010:0x0 > > Feb 16 22:10:37 t44 kernel: Code: Bad RIP value. > > Feb 16 22:10:37 t44 kernel: RSP: 0018:ffffad37009eba20 EFLAGS: 00010087 > > Feb 16 22:10:37 t44 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 00000000000e68b0 > > Feb 16 22:10:37 t44 kernel: RDX: 0000000000000000 RSI: ffff8b35598cba88 RDI: ffff8b362d9146c0 > > Feb 16 22:10:37 t44 kernel: RBP: ffff8b362d9146c0 R08: 0000000000000000 R09: ffff8b35598cbe00 > > Feb 16 22:10:37 t44 kernel: R10: 0000000000000002 R11: 0000000000000005 R12: ffffad37009eba28 > > Feb 16 22:10:37 t44 kernel: R13: 0000000000000000 R14: ffff8b36a40fa200 R15: ffff8b369bf99600 > > Feb 16 22:10:37 t44 kernel: FS: 00007f2b751398c0(0000) GS:ffff8b36b2680000(0000) knlGS:0000000000000000 > > Feb 16 22:10:37 t44 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > Feb 16 22:10:37 t44 kernel: CR2: ffffffffffffffd6 CR3: 0000000323292001 CR4: 00000000001606e0 > > Feb 16 22:10:37 t44 kernel: Call Trace: > > Feb 16 22:10:37 t44 kernel: dma_fence_signal_locked+0x85/0xc0 > > Feb 16 22:10:37 t44 kernel: dma_fence_signal+0x1f/0x40 > > Feb 16 22:10:37 t44 kernel: i915_request_retire+0x9a/0x290 [i915] > > Feb 16 22:10:37 t44 kernel: i915_request_create+0x3f/0xc0 [i915] > > Feb 16 22:10:37 t44 kernel: i915_gem_do_execbuffer+0x973/0x17d0 [i915] > > Feb 16 22:10:37 t44 kernel: i915_gem_execbuffer2_ioctl+0xe9/0x3a0 [i915] > > Feb 16 22:10:37 t44 kernel: ? i915_gem_execbuffer_ioctl+0x2c0/0x2c0 [i915] > > Feb 16 22:10:37 t44 kernel: drm_ioctl_kernel+0xae/0x100 [drm] > > Feb 16 22:10:37 t44 kernel: drm_ioctl+0x223/0x400 [drm] > > Feb 16 22:10:37 t44 kernel: ? i915_gem_execbuffer_ioctl+0x2c0/0x2c0 [i915] > > Feb 16 22:10:37 t44 kernel: do_vfs_ioctl+0x4d4/0x760 > > Feb 16 22:10:37 t44 kernel: ksys_ioctl+0x5b/0x90 > > Feb 16 22:10:37 t44 kernel: __x64_sys_ioctl+0x15/0x20 > > Feb 16 22:10:37 t44 kernel: do_syscall_64+0x46/0x100 > > Feb 16 22:10:37 t44 kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9 > > Feb 16 22:10:37 t44 kernel: RIP: 0033:0x7f2b75372137 > > Feb 16 22:10:37 t44 kernel: Code: 00 00 00 75 0c 48 c7 c0 ff ff ff ff 48 83 c4 18 c3 e8 2d d4 01 00 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 19 ed 0c 00 f7 d8 64 89 01 48 > > Feb 16 22:10:37 t44 kernel: RSP: 002b:00007ffebe2b4c38 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 > > Feb 16 22:10:37 t44 kernel: RAX: ffffffffffffffda RBX: 00007ffebe2b4c80 RCX: 00007f2b75372137 > > Feb 16 22:10:37 t44 kernel: RDX: 00007ffebe2b4c80 RSI: 0000000040406469 RDI: 000000000000000d > > Feb 16 22:10:37 t44 kernel: RBP: 0000000040406469 R08: 0000561477eb8670 R09: 0000000000000202 > > Feb 16 22:10:37 t44 kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000561477e7b0b0 > > Feb 16 22:10:37 t44 kernel: R13: 000000000000000d R14: 00007f2b74b51c48 R15: 0000000000000000 > > Feb 16 22:10:37 t44 kernel: Modules linked in: af_packet bridge stp llc ip6table_filter ip6_tables xt_MASQUERADE iptable_nat nf_nat nf_log_ipv4 nf_log_common xt_LOG xt_limit xt_recent xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter ip_tables uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videodev videobuf2_common btusb btrtl btbcm btintel bluetooth ecdh_generic ecc rmi_smbus rmi_core mousedev x86_pkg_temp_thermal coretemp kvm_intel kvm i915 irqbypass intel_gtt i2c_algo_bit drm_kms_helper cfbfillrect syscopyarea input_leds snd_hda_codec_realtek snd_hda_codec_generic cfbimgblt sysfillrect sysimgblt fb_sys_fops cfbcopyarea wmi_bmof snd_hda_intel snd_intel_dspcfg drm snd_hda_codec tpm_tis psmouse aesni_intel snd_hda_core glue_helper crypto_simd iwlmvm cryptd snd_pcm thinkpad_acpi ledtrig_audio tpm_tis_core iwlwifi pcspkr drm_panel_orientation_quirks ehci_pci atkbd e1000e i2c_i801 ehci_hcd tpm thermal snd_timer ac snd soundcore battery rng_core agpgart > i2c_core wmi evdev > > Feb 16 22:10:37 t44 kernel: CR2: 0000000000000000 > > Feb 16 22:10:37 t44 kernel: ---[ end trace 7df1d4246cb74d36 ]--- > > Feb 16 22:10:37 t44 kernel: RIP: 0010:0x0 > > Feb 16 22:10:37 t44 kernel: Code: Bad RIP value. > > Feb 16 22:10:37 t44 kernel: RSP: 0018:ffffad37009eba20 EFLAGS: 00010087 > > Feb 16 22:10:37 t44 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 00000000000e68b0 > > Feb 16 22:10:37 t44 kernel: RDX: 0000000000000000 RSI: ffff8b35598cba88 RDI: ffff8b362d9146c0 > > Feb 16 22:10:37 t44 kernel: RBP: ffff8b362d9146c0 R08: 0000000000000000 R09: ffff8b35598cbe00 > > Feb 16 22:10:37 t44 kernel: R10: 0000000000000002 R11: 0000000000000005 R12: ffffad37009eba28 > > Feb 16 22:10:37 t44 kernel: R13: 0000000000000000 R14: ffff8b36a40fa200 R15: ffff8b369bf99600 > > Feb 16 22:10:37 t44 kernel: FS: 00007f2b751398c0(0000) GS:ffff8b36b2680000(0000) knlGS:0000000000000000 > > Feb 16 22:10:37 t44 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > Feb 16 22:10:37 t44 kernel: CR2: ffffffffffffffd6 CR3: 0000000323292001 CR4: 00000000001606e0 > > Feb 16 22:11:01 t44 CROND[8996]: (root) CMD (/usr/lib/sa/sa1 30 2 -S XALL) > > Feb 16 22:12:49 t44 syslog-ng[1912]: syslog-ng starting up; version='3.22.1' > > Feb 16 22:12:49 t44 acpid[1943]: starting up with netlink and the input layer > > Then it's not lock that counts for the NULL dereference. > And try the debug patch after reverting the last patch. > > --- a/drivers/dma-buf/dma-fence.c > +++ b/drivers/dma-buf/dma-fence.c > @@ -129,7 +129,6 @@ EXPORT_SYMBOL(dma_fence_context_alloc); > int dma_fence_signal_locked(struct dma_fence *fence) > { > struct dma_fence_cb *cur, *tmp; > - struct list_head cb_list; > > lockdep_assert_held(fence->lock); > > @@ -137,15 +136,12 @@ int dma_fence_signal_locked(struct dma_f > &fence->flags))) > return -EINVAL; > > - /* Stash the cb_list before replacing it with the timestamp */ > - list_replace(&fence->cb_list, &cb_list); > - > fence->timestamp = ktime_get(); > set_bit(DMA_FENCE_FLAG_TIMESTAMP_BIT, &fence->flags); > trace_dma_fence_signaled(fence); > > - list_for_each_entry_safe(cur, tmp, &cb_list, node) { > - INIT_LIST_HEAD(&cur->node); > + list_for_each_entry_safe(cur, tmp, &fence->cb_list, node) { > + list_del_init(&cur->node); > cur->func(fence, cur); > } > > > _______________________________________________ > Intel-gfx mailing list > Intel-gfx@lists.freedesktop.org > https://lists.freedesktop.org/mailman/listinfo/intel-gfx >