Received: by 10.223.176.5 with SMTP id f5csp3018997wra; Mon, 5 Feb 2018 14:17:58 -0800 (PST) X-Google-Smtp-Source: AH8x227C96n8qXJKBdwG6kufuC6zayOdbOKr6gOzB/dyYyN6ndadvfz6kA6PgEq/Pq/zuLxYWH6v X-Received: by 10.101.101.200 with SMTP id y8mr247631pgv.0.1517869078332; Mon, 05 Feb 2018 14:17:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517869078; cv=none; d=google.com; s=arc-20160816; b=nn/Uofrvxu8HLZPYwsT6/NZdXQDN7TBX4aW3HY23qsw/2b6IGu4KqNZLhuRawCalba kael+vb1DoD5i+dqgXMLK8S/o4UZrRmzpjZLijU5CtV+bKILt9DP9s38he/WsvevSWPo 9nd/W56wIYN7RdkKOxJUoctvSsZ0/7qp5wMj92anDyqwFAtTkzNIH1YZnTyD1qBZjOsC i/Ej3LP7ro1cYoa60f9s1HALgLyud1nrMD29Qs2/8NSiM6X1d/0F0lalYbZC9Fwnneom Zv6WJ4YQMyqPfNJIhW8787B4qmBFVMANs0/iRwEo/EjV4UDr7f5mp7JZozZxZq/VgEEP UG8Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:dkim-signature:user-agent:message-id :references:in-reply-to:subject:cc:to:from:date :content-transfer-encoding:mime-version:arc-authentication-results; bh=vyqyPcUx0Z0bDekybxYw8KI3mQPqaEl1UGmJOUw21O4=; b=XcZstII0+epXQKuRf70xWbzgG2KKOb2tGqgEkxI9GHrn6muS8KdM1JxjiGVGhMLccV 5ORM8vNP0cSN2MAIdEjBgIWR1cBcun27XaBh/PlnxLwE47upQ8tuae4lqyQimpJI+dFe yZD4gDIDFTyKd303TCdmLOVVqNWeZQKCckQSjxgl/TIQ5Zkd5VBfZs5evc/xiCuKg1GA pq2S0TY44TZoxEDEoMVAAOQjOQV96la3QOnFzjQ33+agrcOL6E/4/rDSe8oXtc9JwPLm tI3+KNEI7MNZvAcd0txT4qAUbOc93JUcMPTMrSCEIUUpAC/3pDcu/Xjn3+HXuJr+rcL2 9M1A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@agner.ch header.s=dkim header.b=QNWBuaPp; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id bh11-v6si7437579plb.62.2018.02.05.14.17.44; Mon, 05 Feb 2018 14:17:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@agner.ch header.s=dkim header.b=QNWBuaPp; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751913AbeBEWRF (ORCPT + 99 others); Mon, 5 Feb 2018 17:17:05 -0500 Received: from mail.kmu-office.ch ([178.209.48.109]:50626 "EHLO mail.kmu-office.ch" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751175AbeBEWQ7 (ORCPT ); Mon, 5 Feb 2018 17:16:59 -0500 Received: from webmail.kmu-office.ch (unknown [IPv6:2a02:418:6a02::a3]) by mail.kmu-office.ch (Postfix) with ESMTPSA id 813825C0AF7; Mon, 5 Feb 2018 23:09:36 +0100 (CET) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Date: Mon, 05 Feb 2018 23:16:57 +0100 From: stefan@agner.ch To: Boris Brezillon , shijie.huang@arm.com Cc: han.xu@nxp.com, max.oss.09@gmail.com, richard@nod.at, linux-kernel@vger.kernel.org, marek.vasut@gmail.com, linux-mtd@lists.infradead.org, cyrille.pitchen@wedev4u.fr, dwmw2@infradead.org Subject: Re: [PATCH] mtd: nand: gpmi: fall back to legacy mode if no ECC information present In-Reply-To: <33bcf28ff1c613886c950465160e5a97@agner.ch> References: <20180129144440.13648-1-stefan@agner.ch> <20180130142348.35f1bce3@bbrezillon> <20180131105705.17bbe858@bbrezillon> <33bcf28ff1c613886c950465160e5a97@agner.ch> Message-ID: <4952bf96840ae5b0caba7b8f472e2b1b@agner.ch> X-Sender: stefan@agner.ch User-Agent: Roundcube Webmail/1.3.3 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=agner.ch; s=dkim; t=1517868576; bh=vyqyPcUx0Z0bDekybxYw8KI3mQPqaEl1UGmJOUw21O4=; h=MIME-Version:Content-Type:Content-Transfer-Encoding:Date:From:To:Cc:Subject:In-Reply-To:References:Message-ID; b=QNWBuaPpsvqFAU+VLMP30Q4Y8SbKDZJoCAAeO4FdmRTM32+WGhNszd+maPIua/636m54hjBR6zyy/gT5K2kBW8y/Vn119UY7Pd8vyEHIgnNCsxSG1jSfhrO2mP98jVbjFKEinc9rxb5rNAun4smpT0fS109iwTALxRPkdESu0vo= Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Boris, [Also adding Huang] On 31.01.2018 22:18, stefan@agner.ch wrote: > I accidentally removed ML/cc before, re-adding. > > On 31.01.2018 10:57, Boris Brezillon wrote: >> On Wed, 31 Jan 2018 10:19:05 +0100 >> stefan@agner.ch wrote: >> >>> On 30.01.2018 14:23, Boris Brezillon wrote: >>> > Hi Stefan, >>> > >>> > On Mon, 29 Jan 2018 15:44:40 +0100 >>> > Stefan Agner wrote: >>> > >>> >> In case fsl,use-minimum-ecc is set, the driver tries to determine >>> >> ECC layout by using the ECC information provided by the MTD stack. >>> >> However, in case the NAND chip does not provide any information, >>> >> the driver currently fails with: >>> >> nand: device found, Manufacturer ID: 0xc2, Chip ID: 0xf1 >>> >> nand: Macronix NAND 128MiB 3,3V 8-bit >>> >> nand: 128 MiB, SLC, erase size: 128 KiB, page size: 2048, OOB size: 64 >>> >> gpmi-nand 1806000.gpmi-nand: Error setting BCH geometry : 1 >>> >> gpmi-nand: probe of 1806000.gpmi-nand failed with error 1 >>> >> >>> >> Fall back to implementation specific default mode if no ECC >>> >> information are provided by the NAND chip and fsl,use-minimum-ecc >>> >> is specified. >>> > >>> > Hm, this sounds a bit fragile: if we ever fix the Macronix driver >>> > (which should be done BTW) to set the appropriate ECC requirements, it >>> > will break all platforms that were relying on this 'fall back to legacy >>> > logic'. >>> >>> I see. It is just that downstream behaves that way, hence we sell >>> modules which use minimal ECC on ONFI enabled chips and legacy (maximum >>> ECC which fits into OOB) on modules with non-ONFI chips. >> >> And I guess you use the same DT for both variants of the board :-/ >> > > Actually we only have two SKUs, and they differ also otherwise so I have > two DTs anyway. > >>> >>> Currently we operate the above Macronix chip with 8-bit ECC since quite >>> a while. >> >> Honestly, I don't see a good solution here except adding an extra DT or >> live-patching it from the bootloader, because, even if this hack works >> for you know, it might not in the future. > > Extra DT is fine for Linux. > > The problem is more with U-Boot, where I tried to add minimal ECC > support via Kconfig symbol and align with Linux behavior. For U-Boot I > would really prefer to have a single binary for all SKUs... > > I already sent a first patchset > https://patchwork.ozlabs.org/patch/867180/ > > I guess it should be somehow possible to do a board specific selection > of ECC. But this is a discussion for another thread. > >> >> In the future, if you plan to have boards with different variants of >> NANDs, I recommend that you always maximize ECC, this way you won't >> have this kind of issues. > > Makes sense. Unfortunately, for those products we already ship, changing > would be rather painful. > >> >>> >>> > So, if what you really want is legacy_set_geometry(), don't specify >>> > "fsl,use-minimum-ecc" in your DT and you should be good. Otherwise, fix >>> > the Macronix driver to initialize ->ecc_{strength,step_size}_ds >>> > appropriately. >>> >>> The datasheet says: >>> • High Reliability >>> - Endurance: 100K cycles (with 1-bit ECC per 528-byte) >>> >>> So we would set ecc_strenght to 1? >> >> If the datasheet says so, then yes, you should have >> ->ecc_strength_ds = 1 and ->ecc_step_size_ds = 512. >> >>> But then there is almost no room for >>> wear leveling. I remember that I dumped the fixed bits once on such a >>> chip, and there were several blocks from factory which needed one bit >>> fixed... >> >> Well, that's a different issue. You might want to maximize the ECC >> strength for your specific board. In this case, you should not specify >> "fsl,use-minimum-ecc" in your DT, or, if the driver supports it (but I >> doubt it does), you should add "nand-ecc-maximize". Alternatively, if >> you want to keep some of the OOB space, you can ask for a specific ECC >> config with the "nand-ecc-strength" and "nand-ecc-step-size" properties. > > Different issue, but in the end all I care about: Does wear leveling > work properly. > > The NAND chip documentation also mentions that typical access is per > page (2K), I guess if one uses a single ECC across the complete page > then 4-bits are available, which should allow a somewhat decent wear > leveling. > > I guess we can go with nand-ecc-strength/nand-ecc-step-size for that > chip for now. This seems not to be the case for the driver in question gpmi_nand_init calls: nand_scan_ident -> nand_dt_init (which fills chip->ecc.strength/chip->ecc.size) then gpmi_init_last -> gpmi_set_geometry -> bch_set_geometry -> legacy_set_geometry/set_geometry_by_ecc_info In both cases struct bch_geometry is calculated and overwrites ecc.strength/ecc.size (without considering either of them, set_geometry_by_ecc_info is considering ecc_strength_ds/ecc_step_ds though). I guess we would have to add a third option in case device tree specifies strength/size, and validate whether it can be reasonably fulfilled? E.g. extend common_nfc_set_geometry: int common_nfc_set_geometry(struct gpmi_nand_data *this) { + struct nand_chip *chip = &this->nand; + + if (chip->ecc.strength set && chip->ecc.strength set) + return set_geometry_by_ecc_dt_info(this); + if ((of_property_read_bool(this->dev->of_node, "fsl,use-minimum-ecc")) || legacy_set_geometry(this)) return set_geometry_by_ecc_info(this); return 0; } -- Stefan > > However, in Linux we should at least fix the device tree bindings > documentation for "fsl,use-minimum-ecc" then. > > -- > Stefan