Received: by 2002:a05:6a10:af89:0:0:0:0 with SMTP id iu9csp280877pxb; Thu, 20 Jan 2022 13:24:05 -0800 (PST) X-Google-Smtp-Source: ABdhPJw9Wt+9UQRUO8RqYJl12EGTdX9hVwz0iYT5QGSBo2gbh53urKVX+I1AlktO+OrTMoH9AKax X-Received: by 2002:a05:6a00:130a:b0:4c5:9f7d:c607 with SMTP id j10-20020a056a00130a00b004c59f7dc607mr587728pfu.30.1642713844992; Thu, 20 Jan 2022 13:24:04 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1642713844; cv=none; d=google.com; s=arc-20160816; b=jpUNwI59dFlyyTy/i/41kG2V2HX3nOLRGol7rs+TYNB14H4+1hKY88U44Px+3moox+ RIAKX6oBFzMOVtDfMaHWTFX6npn16nDk1rXvhyjWx5Lo/9wWSlmM+Q/f9sa+LIn7Mvqk 7BtCFIHLoxsAZI4rHudPhG6x+KPpOLGZ/Wd64GSkanrN7EuOnF05VDqVETqe3KV19ySQ 9UROYGDTqjfI6qL4r0n4K3CH1SphvUg2jsGGVOCUzRwebh1RqyMU7P9ZKUto6HnYBLDk kdPKNV9oy0j2WINiEAGPdX9Y3A745E3RPLEIlZmMS3PgkYjEzB1fgswtzARnOHti51vS Hd8Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:subject:content-transfer-encoding:mime-version :user-agent:message-id:in-reply-to:date:references:cc:to:from; bh=umFVnZG5wI2xzXhK+ctfdGAmCrYdSB9DtGvpIG75YtE=; b=SspP6xOfQH1f/OB/FncVgbUInZdstauT9AGpAnGDgl7uwqktpvVMWCiEMLxW0r7iTI X0q75vCBdCuoJAuAcBkUToMjjansPQCdfzwfb8gUdaEQBLdMtLh5KWsFobtCAZ7hcCG7 Pw3B60L7VpCqmXHpiVkPXE5g4xw5Wf4ebW2as7OU5SSq1nR3LxxuiXEoYm20n0YbVkId zM80/8p+yKt9Pr+LEJ+1wyH6/Rp+/7uyUAFmt2ZfaN0vfKrLsReuOyx+gzqcMD+LUwRN 2a3uk5bj6PQDrEFPwrqnlalLCLV8VpNvrQmI258eouF7+tGZ0kNkeo0tLPS1OBYfh1Og VZhg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=xmission.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u1si5652044ple.232.2022.01.20.13.23.49; Thu, 20 Jan 2022 13:24:04 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=xmission.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1347620AbiARRxZ convert rfc822-to-8bit (ORCPT + 99 others); Tue, 18 Jan 2022 12:53:25 -0500 Received: from out01.mta.xmission.com ([166.70.13.231]:35934 "EHLO out01.mta.xmission.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244034AbiARRxY (ORCPT ); Tue, 18 Jan 2022 12:53:24 -0500 Received: from in01.mta.xmission.com ([166.70.13.51]:47986) by out01.mta.xmission.com with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.93) (envelope-from ) id 1n9sfa-0023gS-Lb; Tue, 18 Jan 2022 10:53:22 -0700 Received: from ip68-110-24-146.om.om.cox.net ([68.110.24.146]:59944 helo=email.froward.int.ebiederm.org.xmission.com) by in01.mta.xmission.com with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.93) (envelope-from ) id 1n9sfZ-007NEG-Jr; Tue, 18 Jan 2022 10:53:22 -0700 From: "Eric W. Biederman" To: Dmitry Osipenko Cc: linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org, linux-api@vger.kernel.org, Linus Torvalds , Alexey Gladkov , Kyle Huey , Oleg Nesterov , Kees Cook , Al Viro References: <87a6ha4zsd.fsf@email.froward.int.ebiederm.org> <20211213225350.27481-1-ebiederm@xmission.com> <9363765f-9883-75ee-70f1-a1a8e9841812@gmail.com> <87pmp67y4r.fsf@email.froward.int.ebiederm.org> <5bbb54c4-7504-cd28-5dde-4e5965496625@gmail.com> <87bl0m14ew.fsf@email.froward.int.ebiederm.org> <6692758a-0af2-67e0-26fd-365625b3ad0c@gmail.com> <87iluqtcj3.fsf@email.froward.int.ebiederm.org> <99353796-eea7-b765-f355-46e50f1b8773@gmail.com> Date: Tue, 18 Jan 2022 11:52:56 -0600 In-Reply-To: <99353796-eea7-b765-f355-46e50f1b8773@gmail.com> (Dmitry Osipenko's message of "Tue, 18 Jan 2022 20:30:53 +0300") Message-ID: <87o8496idj.fsf@email.froward.int.ebiederm.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT X-XM-SPF: eid=1n9sfZ-007NEG-Jr;;;mid=<87o8496idj.fsf@email.froward.int.ebiederm.org>;;;hst=in01.mta.xmission.com;;;ip=68.110.24.146;;;frm=ebiederm@xmission.com;;;spf=neutral X-XM-AID: U2FsdGVkX19IW9E8YSc8oWkIg8XRmmxKSG6JFJ1R4z0= X-SA-Exim-Connect-IP: 68.110.24.146 X-SA-Exim-Mail-From: ebiederm@xmission.com X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on sa03.xmission.com X-Spam-Level: ** X-Spam-Status: No, score=2.0 required=8.0 tests=ALL_TRUSTED,BAYES_50, DCC_CHECK_NEGATIVE,T_TM2_M_HEADER_IN_MSG,XMNoVowels,XMSubLong, XM_B_Unicode autolearn=disabled version=3.4.2 X-Spam-Virus: No X-Spam-Report: * -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP * 0.8 BAYES_50 BODY: Bayes spam probability is 40 to 60% * [score: 0.5000] * 1.5 XMNoVowels Alpha-numberic number with no vowels * 0.7 XMSubLong Long Subject * 0.0 T_TM2_M_HEADER_IN_MSG BODY: No description available. * 0.0 XM_B_Unicode BODY: Testing for specific types of unicode * -0.0 DCC_CHECK_NEGATIVE Not listed in DCC * [sa03 1397; Body=1 Fuz1=1 Fuz2=1] X-Spam-DCC: XMission; sa03 1397; Body=1 Fuz1=1 Fuz2=1 X-Spam-Combo: **;Dmitry Osipenko X-Spam-Relay-Country: X-Spam-Timing: total 474 ms - load_scoreonly_sql: 0.03 (0.0%), signal_user_changed: 3.9 (0.8%), b_tie_ro: 2.6 (0.6%), parse: 0.78 (0.2%), extract_message_metadata: 12 (2.6%), get_uri_detail_list: 2.5 (0.5%), tests_pri_-1000: 9 (1.9%), tests_pri_-950: 1.00 (0.2%), tests_pri_-900: 0.80 (0.2%), tests_pri_-90: 73 (15.4%), check_bayes: 72 (15.1%), b_tokenize: 8 (1.6%), b_tok_get_all: 11 (2.3%), b_comp_prob: 2.2 (0.5%), b_tok_touch_all: 48 (10.2%), b_finish: 0.66 (0.1%), tests_pri_0: 360 (76.1%), check_dkim_signature: 0.45 (0.1%), check_dkim_adsp: 1.70 (0.4%), poll_dns_idle: 0.19 (0.0%), tests_pri_10: 2.8 (0.6%), tests_pri_500: 7 (1.6%), rewrite_mail: 0.00 (0.0%) Subject: Re: [PATCH 1/8] signal: Make SIGKILL during coredumps an explicit special case X-SA-Exim-Version: 4.2.1 (built Sat, 08 Feb 2020 21:53:50 +0000) X-SA-Exim-Scanned: Yes (on in01.mta.xmission.com) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Dmitry Osipenko writes: > 11.01.2022 20:20, Eric W. Biederman пишет: >> Dmitry Osipenko writes: >> >>> 08.01.2022 21:13, Eric W. Biederman пишет: >>>> Dmitry Osipenko writes: >>>> >>>>> 05.01.2022 22:58, Eric W. Biederman пишет: >>>>>> >>>>>> I have not yet been able to figure out how to run gst-pluggin-scanner in >>>>>> a way that triggers this yet. In truth I can't figure out how to >>>>>> run gst-pluggin-scanner in a useful way. >>>>>> >>>>>> I am going to set up some unit tests and see if I can reproduce your >>>>>> hang another way, but if you could give me some more information on what >>>>>> you are doing to trigger this I would appreciate it. >>>>> >>>>> Thanks, Eric. The distro is Arch Linux, but it's a development >>>>> environment where I'm running latest GStreamer from git master. I'll try >>>>> to figure out the reproduction steps and get back to you. >>>> >>>> Thank you. >>>> >>>> Until I can figure out why this is causing problems I have dropped the >>>> following two patches from my queue: >>>> signal: Make SIGKILL during coredumps an explicit special case >>>> signal: Drop signals received after a fatal signal has been processed >>>> >>>> I have replaced them with the following two patches that just do what >>>> is needed for the rest of the code in the series: >>>> signal: Have prepare_signal detect coredumps using >>>> signal: Make coredump handling explicit in complete_signal >>>> >>>> Perversely my failure to change the SIGKILL handling when coredumps are >>>> happening proves to me that I need to change the SIGKILL handling when >>>> coredumps are happening to make the code more maintainable. >>> >>> Eric, thank you again. I started to look at the reproduction steps and >>> haven't completed it yet. Turned out the problem affects only older >>> NVIDIA Tegra2 Cortex-A9 CPU that lacks support of ARM NEON instructions >>> set, hence the problem isn't visible on x86 and other CPUs out of the >>> box. I'll need to check whether the problem could be simulated on all >>> arches or maybe it's specific to VFP exception handling of ARM32. >> >> It sounds like the gstreamer plugins only fail on certain hardware on >> arm32, and things don't hang in coredumps unless the plugins fail. >> That does make things tricky to minimize. >> >> I have just verified that the known problematic code is not >> in linux-next for Jan 11 2022. >> >> If folks as they have time can double check linux-next and verify all is >> well I would appreciate it. I don't expect that there are problems but >> sometimes one problem hides another. > > Hello Eric, > > I reproduced the trouble on x86_64. > > Here are the reproduction steps, using ArchLinux and linux-next-20211224: > > ``` > sudo pacman -S base-devel git mesa glu meson wget > git clone https://github.com/grate-driver/gstreamer.git > cd gstreamer > git checkout sigill > meson --prefix=/usr -Dgst-plugins-base:playback=enabled -Dgst-devtools:validate=disabled build > cd build > sudo ninja install > wget https://www.peach.themazzone.com/big_buck_bunny_720p_h264.mov > rm -r ~/.cache/gstreamer-1.0 > gst-play-1.0 ./big_buck_bunny_720p_h264.mov > ``` > > The SIGILL, thrown by [1], causes the hang. There is no hang using v5.16.1 kernel. > > [1] https://github.com/grate-driver/gstreamer/commit/006f9a2ee6dcf7b31c9b5413815d6054d82a3b2f Thank you. I will verify this works before I add my updated version to my signal-for-v5.18 branch. Have you by any chance tried a newer version of linux-next without commit fbc11520b58a ("signal: Make SIGKILL during coredumps an explicit special case") in it? If not I will double check that my pulling the commit out does not break in the case you have documented. Eric