Received: by 2002:ac0:a5a6:0:0:0:0:0 with SMTP id m35-v6csp909329imm; Thu, 13 Sep 2018 09:32:05 -0700 (PDT) X-Google-Smtp-Source: ANB0VdYsxjWIek3qwYXRmYe0JrvM2pnw8YaMx4YBklhbYJTOKsJmT9ukS0AN7SgojG+7vvgf8/Yc X-Received: by 2002:a65:654d:: with SMTP id a13-v6mr7749978pgw.35.1536856325465; Thu, 13 Sep 2018 09:32:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1536856325; cv=none; d=google.com; s=arc-20160816; b=lJQCaVoCBd93Q0IX1yM1KNvFTbyDr6ZJGC5FLHZujuZKnw36eUA1JCZwiuRdTtJZMF w4SUC5lp4PQe9ElOf6+eC5onto/QgbX/gYRxCypbdBYtr7y4ROTXLQ9xwn3G6miQAj2E oxdhB2OFB+fNB/DZCo8XS8QovqI9Vn9WhGjs/JKl41LdOr9i7MaHTHYf0tevEd1o98XT WWn2pBSpZE/7GHpHBeG2Wwqy1FPBmkC6tIiYgqcM4QnneLZZ3DSR99jkXEhmuTCqVCZa D8iHzfvKUsZBRu49B+KPxy/zJtHz61C2Jnhfxh1IkEahJz1vyNqymMbF7JyACFmO8CgW I5eQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=26jAUDPNwS6L9EYvuRKR7dcyEodiSDuD3AHAjd17eOM=; b=zdVmYRwRIqwYpYznoDTlVrqq/zcUriljl5K0PHRaqKDVpwJb3pdxcm8DaGkYN6xPcl ++J6NgCOxdmQhWClQedVt6aLkFTSBHxMMBX7+/pXKRq4J8L6jMEw/0ZScM1cbULMNaaQ nw5Y03nMKI4aKwelizs27AHUPmrhlDn4mKdmFTDUj6AywHTCcUyuAdKrRaOeQo9V9HPa uOoOvjY5rPbCoB922nakQ2yspVK8kKStkYshJtxXKheEhgG4JWaxm+AsYlJRU3yOAuIo iJ9wqHPjt+6JvB8IhZl5JRqWWCeBmzHQB77xItN6MU0Ad3LvSUYOS641tAKCSELazCNT 2h5g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id o18-v6si4406466pll.344.2018.09.13.09.31.47; Thu, 13 Sep 2018 09:32:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727721AbeIMVl4 (ORCPT + 99 others); Thu, 13 Sep 2018 17:41:56 -0400 Received: from mga18.intel.com ([134.134.136.126]:52156 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726912AbeIMVlr (ORCPT ); Thu, 13 Sep 2018 17:41:47 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga005.fm.intel.com ([10.253.24.32]) by orsmga106.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 13 Sep 2018 09:31:32 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.53,369,1531810800"; d="scan'208";a="262337122" Received: from yoojae-mobl1.amr.corp.intel.com (HELO [10.7.153.143]) ([10.7.153.143]) by fmsmga005.fm.intel.com with ESMTP; 13 Sep 2018 09:31:30 -0700 Subject: Re: [PATCH i2c-next v6] i2c: aspeed: Handle master/slave combined irq events properly To: =?UTF-8?Q?C=c3=a9dric_Le_Goater?= , Guenter Roeck Cc: Joel Stanley , linux-aspeed@lists.ozlabs.org, Vernon Mauery , OpenBMC Maillist , Brendan Higgins , Linux Kernel Mailing List , linux-i2c@vger.kernel.org, jarkko.nikula@linux.intel.com, Linux ARM , James Feist References: <20180823225731.19063-1-jae.hyun.yoo@linux.intel.com> <20180911183734.GA21976@roeck-us.net> <1f34fe8c-69ef-5f2d-25dc-d5f6037cc558@linux.intel.com> <20180911204107.GA26017@roeck-us.net> <20180911233302.GA18799@roeck-us.net> <5698ca34-14c9-8d05-c4e6-5acf85ff9d14@linux.intel.com> <20180912013449.GA12612@roeck-us.net> <7fd98646-fb5a-be4d-ce37-84b74e0fa8b3@linux.intel.com> <285ea914-5407-7fde-036d-95978f95a430@kaod.org> From: Jae Hyun Yoo Message-ID: <7a1706b1-7787-eeff-2295-f6180fe84c6e@linux.intel.com> Date: Thu, 13 Sep 2018 09:31:29 -0700 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <285ea914-5407-7fde-036d-95978f95a430@kaod.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Cédric, On 9/12/2018 10:47 PM, Cédric Le Goater wrote: > On 09/12/2018 06:54 PM, Jae Hyun Yoo wrote: >> On 9/11/2018 6:34 PM, Guenter Roeck wrote: >>> On Tue, Sep 11, 2018 at 04:58:44PM -0700, Jae Hyun Yoo wrote: >>>> On 9/11/2018 4:33 PM, Guenter Roeck wrote: >>>>> Looking into the patch, clearing the interrupt status at the end of an >>>>> interrupt handler is always suspicious and tends to result in race >>>>> conditions (because additional interrupts may have arrived while handling >>>>> the existing interrupts, or because interrupt handling itself may trigger >>>>> another interrupt). With that in mind, the following patch fixes the >>>>> problem for me. >>>>> >>>>> Guenter >>>>> >>>>> --- >>>>> >>>>> diff --git a/drivers/i2c/busses/i2c-aspeed.c b/drivers/i2c/busses/i2c-aspeed.c >>>>> index c258c4d9a4c0..c488e6950b7c 100644 >>>>> --- a/drivers/i2c/busses/i2c-aspeed.c >>>>> +++ b/drivers/i2c/busses/i2c-aspeed.c >>>>> @@ -552,6 +552,8 @@ static irqreturn_t aspeed_i2c_bus_irq(int irq, void *dev_id) >>>>>       spin_lock(&bus->lock); >>>>>       irq_received = readl(bus->base + ASPEED_I2C_INTR_STS_REG); >>>>> +    /* Ack all interrupt bits. */ >>>>> +    writel(irq_received, bus->base + ASPEED_I2C_INTR_STS_REG); >>>>>       irq_remaining = irq_received; >>>>>   #if IS_ENABLED(CONFIG_I2C_SLAVE) >>>>> @@ -584,8 +586,6 @@ static irqreturn_t aspeed_i2c_bus_irq(int irq, void *dev_id) >>>>>               "irq handled != irq. expected 0x%08x, but was 0x%08x\n", >>>>>               irq_received, irq_handled); >>>>> -    /* Ack all interrupt bits. */ >>>>> -    writel(irq_received, bus->base + ASPEED_I2C_INTR_STS_REG); >>>>>       spin_unlock(&bus->lock); >>>>>       return irq_remaining ? IRQ_NONE : IRQ_HANDLED; >>>>>   } >>>>> >>>> >>>> My intention of putting the code at the end of interrupt handler was, >>>> to reduce possibility of combined irq calls which is explained in this >>>> patch. But YES, I agree with you. It could make a potential race >>> >>> Hmm, yes, but that doesn't explain why it would make sense to acknowledge >>> the interrupt late. The interrupt ack only means "I am going to handle these >>> interrupts". If additional interrupts arrive while the interrupt handler >>> is active, those will have to be acknowledged separately. >>> >>> Sure, there is a risk that an interrupt arrives while the handler is >>> running, and that it is handled but not acknowledged. That can happen >>> with pretty much all interrupt handlers, and there are mitigations to >>> limit the impact (for example, read the interrupt status register in >>> a loop until no more interrupts are pending). But acknowledging >>> an interrupt that was possibly not handled is always bad idea. >> >> Well, that's generally right but not always. Sometimes that depends on >> hardware and Aspeed I2C is the case. >> >> This is a description from Aspeed AST2500 datasheet: >>   I2CD10 Interrupt Status Register >>   bit 2 Receive Done Interrupt status >>         S/W needs to clear this status bit to allow next data receiving. >> >> It means, driver should hold this bit to prevent transition of hardware >> state machine until the driver handles received data, so the bit should >> be cleared at the end of interrupt handler. >> >> Let me share my test result. Your code change works on 100KHz bus speed >> but doesn't work well on 1MHz bus speed. Investigated that interrupt >> handling is fast enough in 100KHz test but in 1MHz, most of data is >> corrupted because the bit is cleared at the beginning of interrupt >> handler so it allows receiving of the next data but the interrupt >> handler isn't fast enough to read the data buffer on time. I checked >> this problem on BMC-ME channel which ME sends lots of IPMB packets to >> BMC at 1MHz speed. You could simply check the data corruption problem on >> the BMC-ME channel. > > OK. > >> My thought is, the current code is right for real Aspeed I2C hardware. >> It seems that QEMU 3.0 model for witherspoon-bmc doesn't simulate the >> actual Aspeed I2C hardware correctly. > > That might be very well possible yes. it also misses support for the slave > mode and the DMA registers. > Yes, it would be good if qemu's Aspeed I2C model supports slave mode. Since the current linux Aspeed I2C driver supports byte transfer mode only, so DMA transfer mode support in qemu could be considered later. Implementing pool mode and DMA mode for linux Aspeed I2C are in my backlog at this moment. Thanks, Jae > Thanks for the info, > > C. >