Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752094AbdFNLfg (ORCPT ); Wed, 14 Jun 2017 07:35:36 -0400 Received: from szxga02-in.huawei.com ([45.249.212.188]:7831 "EHLO szxga02-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750756AbdFNLff (ORCPT ); Wed, 14 Jun 2017 07:35:35 -0400 Subject: Re: [PATCH v8 6/9] drivers: perf: hisi: Add support for Hisilicon Djtag driver To: Will Deacon , Mark Rutland References: <1495457312-237127-1-git-send-email-zhangshaokun@hisilicon.com> <20170608163519.GA19643@leverpostej> <8666a0fa-126d-e4a3-ac4b-7962f5d79942@huawei.com> <20170609143050.GM13955@arm.com> <0fbf57f0-9ff7-4fd4-07c7-c5e86028a7d2@huawei.com> <20170614100658.GE16190@arm.com> <20170614104230.GC6085@leverpostej> <20170614110141.GL16190@arm.com> CC: Shaokun Zhang , , , , , , , , , , , , , From: John Garry Message-ID: <53af9b5b-ac93-eaf9-8551-75fb25a243aa@huawei.com> Date: Wed, 14 Jun 2017 12:35:07 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.3.0 MIME-Version: 1.0 In-Reply-To: <20170614110141.GL16190@arm.com> Content-Type: text/plain; charset="windows-1252"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [10.203.181.153] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A090204.59411F80.00DA,ss=1,re=0.000,recu=0.000,reip=0.000,cl=1,cld=1,fgs=0, ip=0.0.0.0, so=2014-11-16 11:51:01, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: 792f63f512f873eb6419b9634a451d4e Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1917 Lines: 72 On 14/06/2017 12:01, Will Deacon wrote: > On Wed, Jun 14, 2017 at 11:42:30AM +0100, Mark Rutland wrote: >> On Wed, Jun 14, 2017 at 11:06:58AM +0100, Will Deacon wrote: >>> Apologies, I misunderstood your algorithm (I thought step (a) was on one CPU >>> and step (b) was on another). Still, I don't understand the need for the >>> timeout. If you instead read back the flag immediately, wouldn't it still >>> work? e.g. >>> >>> >>> lock: >>> Readl_relaxed flag >>> if (locked) >>> goto lock; >>> >>> Writel_relaxed unique ID to flag >>> Readl flag >>> if (locked by somebody else) >>> goto lock; >>> >>> >>> >>> unlock: >>> Writel unlocked value to flag >>> >>> >>> Given that we're dealing with iomem, I think it will work, but I could be >>> missing something obvious. >> >> Don't we have the race below where both threads can enter the critical >> section? >> >> // flag f initial zero (unlocked) >> >> // t1, flag 1 // t2, flag 2 >> readl(f); // reads 0 l = readl(f); // reads 0 >> >> >> >> writel(1, f); >> readl(f); // reads 1 >> >> writel(2, f); >> readl(f) // reads 2 >> >> >> > > Urgh, yeah, of course and *that's* what the udelay is trying to avoid, > by "ensuring" that the time and subsequent write > propagation is all over before we re-read the flag. > > John -- how much space do you have on this device? Do you have, e.g. a byte > for each CPU? Hi Will, To be clear, the agents in our case are the kernel and UEFI. Within the kernel, we use a kernel spinlock to lock the same djtag between threads, for these reasons: - kernel has a native spinlock - we are limited in locking values, as the lock flag is only a 8b field in v2 hw (called module select) Thanks, John > > Will > > . >