Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp485148ybl; Wed, 4 Dec 2019 06:08:24 -0800 (PST) X-Google-Smtp-Source: APXvYqzMN5rl3WIMj7EZAQfWexiTNsyR7DYPyi/StTFYgX5fjn0fHuAiK8cnoTAoGS75Dn7Cwb7Y X-Received: by 2002:a05:6830:1042:: with SMTP id b2mr2531481otp.306.1575468503917; Wed, 04 Dec 2019 06:08:23 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1575468503; cv=none; d=google.com; s=arc-20160816; b=u6c98WFDMm/cOHtoilHx9d4MknWgARVtNSmHzbzUK700ZlCKYkED1cHhaS9AJt2az/ XEEM2/KGT/RMco+UAnh+bYmtLiEeZxfibBr0kJFxtKdwpnUGz8rowD3yHfQ2o/a1eaV3 fbOa5a6bWiOlrzmi6vsLvANQBwjemU9erHJjSEabnp3ZvyAP5J5ezRJpuBZ0C2DMTM5H ajKhikS2mSE2KYs61QKUJja8SuBwOrg6gutTXS/r1+swkTcfsTDeadsGeVQCuBa/V1hC pCylTahYERA0nkmrWtDeUeQvbWM3rap5X7gUi6TygHFytweZmh716uYobkz8Tparmz9Q Rijg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=rXWbUltUyRr4KUrDMqmF7ICeKlKH7dTCVQWMGX1tQqY=; b=I4iwEe0CNElGZOaTq8htEkL8eKnYHQpxmkz87yy4WY76OjLI7o3c55NHL24ksp7qWr sAJ2esO5LSErHLLMQ0FrWoLkt9As6ZWE7oMMvmVw+Yq0J0gZJ5PCxlhdwBBvmShyRU+8 RaU/HychRMOgZhGQif5ezVvoCDqT36LY9Cs0Yy0TKjFANN21pw803wVLWjBSgXPShm8C PL65zkw5eadL6HNt/+aO2STzuYeVL+rmQNdW1xQNhpllVoamx4xnT/lrhDgq/sVmAmFh SyuyCVyPcrn0TMANGxQZoHKjBI43Qp2g1c2/SO5hi9U5IkEaAryqbXeUrO36fKjSc5V2 ZDYQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o12si3279837otk.323.2019.12.04.06.08.00; Wed, 04 Dec 2019 06:08:23 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727948AbfLDOHC (ORCPT + 99 others); Wed, 4 Dec 2019 09:07:02 -0500 Received: from foss.arm.com ([217.140.110.172]:56326 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727828AbfLDOHC (ORCPT ); Wed, 4 Dec 2019 09:07:02 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 8F2EA1FB; Wed, 4 Dec 2019 06:07:01 -0800 (PST) Received: from [10.1.194.37] (e113632-lin.cambridge.arm.com [10.1.194.37]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 57EBE3F68E; Wed, 4 Dec 2019 06:07:00 -0800 (PST) Subject: Re: Null pointer crash at find_idlest_group on db845c w/ linus/master To: Vincent Guittot Cc: Qais Yousef , John Stultz , Quentin Perret , Peter Zijlstra , Dietmar Eggemann , Juri Lelli , Patrick Bellasi , Ingo Molnar , lkml References: <20191204094216.u7yld5r3zelp22lf@e107158-lin.cambridge.arm.com> <20191204100925.GA15727@linaro.org> <629cca09-dde7-5d77-42e1-c68f2c1820d2@arm.com> From: Valentin Schneider Message-ID: Date: Wed, 4 Dec 2019 14:06:59 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 04/12/2019 12:08, Vincent Guittot wrote: >> Also; does it really have to involve an affinity "race"? AFAIU affinity >> could have been changed a while back, but the waking CPU isn't allowed >> so we skip the local_group (in simpler cases where each CPU is a group). > > In fact, this will depend of the uninitialized values of local_sgs. I > have been able to reproduce the situation where we skip local group > but not to trigger the crash because the values already in the stack > don't trigger the misfit comparison. > One more thing, DB845 has a single DynamIQ cluster that is represented as a flat hierarchy (unlike regular big.LITTLE, see arch/arm64/boot/dts/qcom/sdm845.dtsi) so we'll just have a single MC level with groups being individual CPUs, making the bug easier to reproduce (than on regular big.LITTLE, that is). > I wait for John feedback to confirm that this fix his problem and > will send a clean version of the patch >> >>