Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp2683869pxj; Sun, 6 Jun 2021 10:13:31 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwoJmhGkyqITNeW+xCxSzCqWXLQL4F+KGiTAYuilQGrMscoF+YNDj9zyRxdVE9n/sIECDMM X-Received: by 2002:aa7:c2c7:: with SMTP id m7mr16177112edp.156.1622999611350; Sun, 06 Jun 2021 10:13:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1622999611; cv=none; d=google.com; s=arc-20160816; b=YkgnJ7G5sZNzZMK3NW2ovIMv7If9d00PWHKND8VpmhtssobTeEA3CrO6eHhkHNv4el 8bn6NN7Do7atsNfppSsmSVh9NAVIsf9rVpggLhOvTkJvG0zEqNL60MnBRSMnSCEzMOgs rggEkyK1DBumbNMangkfFwrbcM9/nA5EQ4FsGr4x9cFjwhK65+qQvzd/Y/LsbxBVoERq icWCUS2pUnnZiMfxsdF482fBbeFsb/sA78vmH2dx4WddDaNh9v44RJhyw/fGC9NEEebW hx3i8llgTtZ1nyg8+wXUEkDT5NK/5qhpRNTlezQmbHz71tek+KuxjzpbiGlz6JhCqIA5 g5+g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=lx8AMOpcbiIl60921q/AwKP0FSvfFkR9JoJ9AUJAYmk=; b=Eu2/SGMOosV5dK4i1BDhjk/dMe3QVpL6dQ1CZ9syFbswisL2aENOuWGLtQ45Wjky/Y N9JPVA0sqaLY8wuw01s82ekB70u2J34IEy4kyHFqYr97zgzd9hOYB/ERIrylWClSGfd5 LKf8Arwbnq/rJ2hQ1wKu6L0nk0mNgnPueU6RgqcjC74qTVy1/Z3bwPxBFQmwH0isZPuG 1keAMnPo/XD4lMa+hol2PtutZRXEkysOWMHijmGZl23tTjm9i7RsVwrF9rA0Ic1LVlFi lwPa14pBvKt5litRRAWg2zZ1YIdgTcWwbDaZR7xlxyFCLodDDmuoM57kzyfFMw4290bs +uUA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=obfGnxAB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t12si12591583edc.179.2021.06.06.10.13.07; Sun, 06 Jun 2021 10:13:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=obfGnxAB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229738AbhFFRNu (ORCPT + 99 others); Sun, 6 Jun 2021 13:13:50 -0400 Received: from mail.kernel.org ([198.145.29.99]:58110 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229445AbhFFRNr (ORCPT ); Sun, 6 Jun 2021 13:13:47 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id 7CF7C6141E; Sun, 6 Jun 2021 17:11:57 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1622999517; bh=ek88S5MJp2DWQgPYMwL1saaDEGtd79UyfF/zL5Bxla4=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=obfGnxAB2LWikAkaOYsHqqvCUAfOOGRyAiqn1U/UWekBYp6seMPPjH+1ZAd8Hv+Oy Lg2+rSMI+6ZwlmbNm19s64vBuMcvG/RjVb/AUf4sHc7swrgiYtEE1fRcRuk2D8cE78 H2GqLqD+QPH4qIE5t0/nNtX0D0vIDJmzc5D8sKQ4L/7/32XwFk9AHt6Y9tBXjqzetc ZsaddXEjxhin2CloAYEk4WdrXPxkF8sJ9y2UbXlietIcz0KFWpFnkNDR/jHXrnxFNO eY+s8n7ovc1wROASM4UfPZE9P2KaAUS0/xv8RMKeOv6gPcNY3dyK6cyONtyNzyKmyI XmiSmYZEWMcgQ== Received: by mail-lj1-f176.google.com with SMTP id d2so14530385ljj.11; Sun, 06 Jun 2021 10:11:57 -0700 (PDT) X-Gm-Message-State: AOAM532rD+R58lIJcV6kQKhtTp+tDeIkokl1LAyuYOlCpI6MF6ECsSp3 rgNE/UJ0ghGsEpaw/yUhVjw6NkpMRlrvM4KeE+w= X-Received: by 2002:a2e:3506:: with SMTP id z6mr12308612ljz.238.1622999515725; Sun, 06 Jun 2021 10:11:55 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Guo Ren Date: Mon, 7 Jun 2021 01:11:44 +0800 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH RFC 0/3] riscv: Add DMA_COHERENT support To: Anup Patel , Atish Patra Cc: Palmer Dabbelt , "anup@brainfault.org" , "drew@beagleboard.org" , Christoph Hellwig , "wefu@redhat.com" , "lazyparser@gmail.com" , "linux-riscv@lists.infradead.org" , "linux-kernel@vger.kernel.org" , "linux-arch@vger.kernel.org" , "linux-sunxi@lists.linux.dev" , "guoren@linux.alibaba.com" , Paul Walmsley Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Anup and Atish, On Thu, Jun 3, 2021 at 2:00 PM Anup Patel wrote: > > > > > -----Original Message----- > > From: Palmer Dabbelt > > Sent: 03 June 2021 09:43 > > To: guoren@kernel.org > > Cc: anup@brainfault.org; drew@beagleboard.org; Christoph Hellwig > > ; Anup Patel ; wefu@redhat.com; > > lazyparser@gmail.com; linux-riscv@lists.infradead.org; linux- > > kernel@vger.kernel.org; linux-arch@vger.kernel.org; linux- > > sunxi@lists.linux.dev; guoren@linux.alibaba.com; Paul Walmsley > > > > Subject: Re: [PATCH RFC 0/3] riscv: Add DMA_COHERENT support > > > > On Sat, 29 May 2021 17:30:18 PDT (-0700), Palmer Dabbelt wrote: > > > On Fri, 21 May 2021 17:36:08 PDT (-0700), guoren@kernel.org wrote: > > >> On Wed, May 19, 2021 at 3:15 PM Anup Patel > > wrote: > > >>> > > >>> On Wed, May 19, 2021 at 12:24 PM Drew Fustini > > wrote: > > >>> > > > >>> > On Wed, May 19, 2021 at 08:06:17AM +0200, Christoph Hellwig > > wrote: > > >>> > > On Wed, May 19, 2021 at 02:05:00PM +0800, Guo Ren wrote: > > >>> > > > Since the existing RISC-V ISA cannot solve this problem, it is > > >>> > > > better to provide some configuration for the SOC vendor to > > customize. > > >>> > > > > >>> > > We've been talking about this problem for close to five years. > > >>> > > So no, if you don't manage to get the feature into the ISA it > > >>> > > can't be supported. > > >>> > > > >>> > Isn't it a good goal for Linux to support the capabilities present > > >>> > in the SoC that a currently being fab'd? > > >>> > > > >>> > I believe the CMO group only started last year [1] so the RV64GC > > >>> > SoCs that are going into mass production this year would not have > > >>> > had the opporuntiy of utilizing any RISC-V ISA extension for > > >>> > handling cache management. > > >>> > > >>> The current Linux RISC-V policy is to only accept patches for frozen > > >>> or ratified ISA specs. > > >>> (Refer, Documentation/riscv/patch-acceptance.rst) > > >>> > > >>> This means even if emulate CMO instructions in OpenSBI, the Linux > > >>> patches won't be taken by Palmer because CMO specification is still > > >>> in draft stage. > > >> Before CMO specification release, could we use a sbi_ecall to solve > > >> the current problem? This is not against the specification, when CMO > > >> is ready we could let users choose to use the new CMO in Linux. > > >> > > >> From a tech view, CMO trap emulation is the same as sbi_ecall. > > >> > > >>> > > >>> Also, we all know how much time it takes for RISCV international to > > >>> freeze some spec. Judging by that we are looking at another > > >>> 3-4 years at minimum. > > > > > > Sorry for being slow here, this thread got buried. > > > > > > I've been trying to work with a handful of folks at the RISC-V > > > foundation to try and get a subset of the various in-development > > > specifications (some simple CMOs, something about non-caching in the > > > page tables, and some way to prevent speculative accesse from > > > generating coherence traffic that will break non-coherent systems). > > > I'm not sure we can get this together quickly, but I'd prefer to at > > > least try before we jump to taking vendor-specificed behavior here. > > > It's obviously an up-hill battle to try and get specifications through > > > the process and I'm certainly not going to promise it will work, but > > > I'm hoping that the impending need to avoid forking the ISA will be > > > sufficient to get people behind producing some specifications in a timely > > fashion. > > > > > > I wasn't aware than this chip had non-coherent devices until I saw > > > this thread, so we'd been mostly focused on the Beagle V chip. That > > > was in a sense an easier problem because the SiFive IP in it was never > > > designed to have non-coherent devices so we'd have to make anything > > > work via a series of slow workarounds, which would make emulating the > > > eventually standardized behavior reasonable in terms of performance > > > (ie, everything would be super slow so who really cares). > > > > > > I don't think relying on some sort of SBI call for the CMOs whould be > > > such a performance hit that it would prevent these systems from being > > > viable, but assuming you have reasonable performance on your > > > non-cached accesses then that's probably not going to be viable to > > > trap and emulate. At that point it really just becomes silly to > > > pretend that we're still making things work by emulating the > > > eventually ratified behavior, as anyone who actually tries to use this > > > thing to do IO would need out of tree patches. I'm not sure exactly > > > what the plan is for the page table bits in the specification right > > > now, but if you can give me a pointer to some documentation then I'm > > > happy to try and push for something compatible. > > > > > > If we can't make the process work at the foundation then I'd be > > > strongly in favor of just biting the bullet and starting to take > > > vendor-specific code that's been implemented in hardware and is > > > necessarry to make things work acceptably. That's obviously a > > > sub-optimal solution as it'll lead to a bunch of ISA fragmentation, > > > but at least we'll be able to keep the software stack together. > > > > > > Can you tell us when these will be in the hands of users? That's > > > pretty important here, as I don't want to be blocking real users from > > > having their hardware work. IIRC there were some plans to distribute > > > early boards, but it looks like the foundation got involved and I > > > guess I lost the thread at that point. > > > > > > Sorry this is all such a headache, but hopefully we can get things > > > sorted out. > > > > I talked with some of the RISC-V foundation folks, we're not going to have an > > ISA specification for the non-coherent stuff any time soon. I took a look at > > this code and I definately don't want to take it as is, but I'm not opposed to > > taking something that makes the hardware work as long as it's a lot cleaner. > > We've already got two of these non-coherent chips, I'm sure more will come, > > and I'd rather have the extra headaches than make everyone fork the software > > stack. > > Thanks for confirming. The CMO extension is still in early stages so it will > certainly take more time for them. After CMO extension is finalized, it will > take some more time to have actual RISC-V platforms with CMO implemented. > > > > > After talking to Atish it looks like there's likely to be an SBI extension to > > handle the CMOs, which should let us avoid the bulk of the vendor-specific > > behavior in the kernel. I know some people are worried about adding to the > > SBI surface. I'm worried about that too, but that's way better than sticking a > > bunch of vendor-specific instructions into the kernel. The SBI extension > > should make for a straight-forward cache flush implementation in Linux, so > > let's just plan on that getting through quickly (as has been done before). > > Yes, I agree. We can have just a single SBI call which is meant for DMA sync > purpose only which means it will flush/invalidate pages from all cache > levels irrespective of the cache hierarchy (i.e. flush/invalidate to RAM). The > CMO extension might more generic cache operations which can target > any cache level. > > I am already preparing a write-up for SBI DMA sync call in SBI spec. I will > share it with you separately as well. > > > > > Unfortunately we've yet to come up with a way to handle the non-cacheable > > mappings without introducing a degree of vendor-specific behavior or > > seriously impacting performance (mark them as not valid and deal with them > > in the trap handler). I'm not really sure it counts as supporting the hardware > > if it's massively slow, so that really leaves us with vendor-specific mappings as > > the only option to make these chips work. > > A RISC-V platform can have non-cacheable mappings is following possible > ways: > 1) Fixed physical address range as non-cacheable using PMAs > 2) Custom page table attributes > 3) Svpmbt extension being defined by RVI > > Atish and me both think it is possible to have RISC-V specific DMA ops > implementation which can handle all above case. Atish is already working > on DMA ops implementation for RISC-V. Not only DMA ops, but also icache_sync & __vdso_icache_sync. Please have a look at: https://lore.kernel.org/linux-riscv/1622970249-50770-12-git-send-email-guoren@kernel.org/T/#u > > > > > This implementation, which adds some Kconfig entries that control page table > > bits, definately isn't suitable for upstream. Allowing users to set arbitrary > > page table bits will eventually conflict with the standard, and is just going to > > be a mess. It'll also lead to kernels that are only compatible with specific > > designs, which we're trying very hard to avoid. At a bare minimum we'll need > > some way to detect systems with these page table bits before setting them, > > and some description of what the bits actually do so we can reason about > > them. > > Yes, vendor specific Kconfig options are strict NO NO. We can't give-up the > goal of unified kernel image for all platforms. > > Regards, > Anup -- Best Regards Guo Ren ML: https://lore.kernel.org/linux-csky/