Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp1547414ybb; Thu, 2 Apr 2020 02:58:46 -0700 (PDT) X-Google-Smtp-Source: APiQypKRG9LwwpusMuZohoeXwzY5lCO6e2EMDalYIe75jqeFoHlo3drJnMPy1I3ea5SkAtSCyCcu X-Received: by 2002:a4a:c116:: with SMTP id s22mr2112789oop.0.1585821525716; Thu, 02 Apr 2020 02:58:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585821525; cv=none; d=google.com; s=arc-20160816; b=kfcJ4118Ov+u+0nmQM7CcjpoS7TVpR5uQjZGNDtmVnkh4gIZgQhoKfBJeiR2NVGfiP cl+egtPFOsbDME5MaiQ4t8do5T9aGNhGSHyBGPZ5+8nkDQaWsgYS9IFaz7l8aYR0Deun 0O3HOL/QYT6n0N/EUc1Qu6G6RGV77fezJBH8mTggYay903q4S42szdYv476lVVhv+ZIQ 00+4QXFl2EKqNjFbYiZf1dMNk/9yxplD420JxK64r1BNdG4Fz4Z1Xtz8CWTP8Uvb4lXC rr+AsKdK6TnEmMespm7sYmkECsRpNKA7y7LjU8V8EUAqqIYEoRZ5+tfaOPrK981E3xK9 EHRw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from; bh=0YdEv7dE26IFdGR0L7RZi3HQqiq/OENpSV31gPlBnME=; b=ay+JfSXa8xVoIRVjVrE6QNst7q33ygbwRfMHtQGwod7uph1+fuxP+NQBrgJRLBSw7e YM9jdY0CyCkVMz8PR6qoqrFCJcByZTS6PnFaG2OKJZItqcTTeGnTXrHc2UaCHgD1dFRI zmqEMJwBAZJhTFUe6adWJ4Q9DGA8Pzu9kwO7fwsP+lNhEMrYN1H7+5tspPtkJOmtsKhy JijExKQ5wABJ9biDZPhBXBKHM7qsVfNl9TvE+UzZoPh5/IQooGGkNRIjCAvFhnTFtNif hTZG/4TKgmSQbsxkiozeNluXmwXesVlRXjBKkVGs93yPGJxF0jKTqBKoG4f7oXXzRyjf LbbA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d1si2108258otq.225.2020.04.02.02.58.33; Thu, 02 Apr 2020 02:58:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387749AbgDBJ57 (ORCPT + 99 others); Thu, 2 Apr 2020 05:57:59 -0400 Received: from Galois.linutronix.de ([193.142.43.55]:36813 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728135AbgDBJ57 (ORCPT ); Thu, 2 Apr 2020 05:57:59 -0400 Received: from p5de0bf0b.dip0.t-ipconnect.de ([93.224.191.11] helo=nanos.tec.linutronix.de) by Galois.linutronix.de with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1jJwbh-00013E-B2; Thu, 02 Apr 2020 11:57:53 +0200 Received: by nanos.tec.linutronix.de (Postfix, from userid 1000) id D2C75100D52; Thu, 2 Apr 2020 11:57:52 +0200 (CEST) From: Thomas Gleixner To: Corentin Labbe , qemu-discuss@nongnu.org, mingo@redhat.com, bp@alien8.de, hpa@zytor.com, x86@kernel.org Cc: linux-kernel@vger.kernel.org Subject: Re: qemu-x86: kernel panic when host is loaded In-Reply-To: <20200402093132.GA15839@Red> References: <20200402093132.GA15839@Red> Date: Thu, 02 Apr 2020 11:57:52 +0200 Message-ID: <87eet6nra7.fsf@nanos.tec.linutronix.de> MIME-Version: 1.0 Content-Type: text/plain X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Corentin, Corentin Labbe writes: > On our kernelci lab, each qemu worker pass an healtcheck job each day and after each job failure, so it is heavily used. > The healtcheck job is a Linux boot with a stable release. > > Since we upgraded our worker to buster, the qemu x86_64 healthcheck randomly panic with: > <0>[ 0.009000] Kernel panic - not syncing: IO-APIC + timer doesn't work! Boot with apic=debug and send a report. Then try booting with the 'noapic' option. > > After some test I found the source of this kernel panic, the host is > loaded and qemu run "slower". Simply renicing all qemu removed this > behavour. > > So now what can I do ? > Appart renicing qemu process, does something could be done ? As the qemu timer/ioapic routing is actually sane, you might try to add "no_timer_check" to the kernel command line. Thanks, tglx