Received: by 10.223.176.46 with SMTP id f43csp784826wra; Fri, 19 Jan 2018 01:46:42 -0800 (PST) X-Google-Smtp-Source: ACJfBosP8xwKA5ZXq48x2lQEac3+CekY4BjJopxK7S7CmrsBi+N5QpPa+qJ2VVb+UQqxstRYEfjR X-Received: by 10.98.158.89 with SMTP id s86mr26312182pfd.203.1516355202526; Fri, 19 Jan 2018 01:46:42 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516355202; cv=none; d=google.com; s=arc-20160816; b=dqQDXuVzTcy7hGlyd7gxcoibG27j75VzkVPIX6YDUZRv+M5zFx/s6XEwg/ciPjYp2y xqvdbe6Voj0B0mCur6LlTY0uOiVuuQPN5xkUnB6P5CrCiU6zFrbERsJ7JYjcQ4Y6NgRw G382eUDjMAwf9u/KHg8B1Dhf0hywtJ3FXWr+LxK6roBUuGOd8dhaanU6Z4l5gQMhupYY g47i76Ymzc1asq/K7u04wC8HJmSEmOJ8uzC9UgnXc6JQQQgODvzgYFEHTNNXZ4clQbVy Ey3uw7CrKTyGwR0zWybA6lQrIMopRu/TXY7He5KUGZJLTDBA43YdQWThVoY/el1vC3Tq +l9g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:arc-authentication-results; bh=dcR23QShl7ZBMVvMr/iM2uqbGdiOxS95nanjwDshM0E=; b=vyAP9NfNi4kpnMJo7SLlu/uDsyRL7JYBY7d4+4ksz2UdbhxznoFi5GnHwA7B7JF7Xo TWLcyULZyNazpWUKpRHGxXwVyJ2774w1Il0aPzwY0300OZsKSQ28nPIwFHEivjjK94F5 wT0dqGygJA1oGH4WSYHLfqZusfuoiH83SEkxVeASIvJhR1APdE81HE1iDp68jGNm3VzX VDlrwqti6iugM+8OTQIYAJUjLgzWlLdplbz2UiU6YajG096kkLYnpWZX2maIDSgCf7Ft M0GRm24F2xlAmNh9FqUy+S8pKyTTQAID5sp8zGK92wkrw+hZn+8q8DAhg+GEJbmJPR0z tAgg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a13si4830751pgt.663.2018.01.19.01.46.26; Fri, 19 Jan 2018 01:46:42 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754930AbeASJpo (ORCPT + 99 others); Fri, 19 Jan 2018 04:45:44 -0500 Received: from pegase1.c-s.fr ([93.17.236.30]:63779 "EHLO pegase1.c-s.fr" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750923AbeASJph (ORCPT ); Fri, 19 Jan 2018 04:45:37 -0500 Received: from localhost (mailhub1-int [192.168.12.234]) by localhost (Postfix) with ESMTP id 3zNGCP0Q9nz9tvr3; Fri, 19 Jan 2018 10:45:21 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at c-s.fr Received: from pegase1.c-s.fr ([192.168.12.234]) by localhost (pegase1.c-s.fr [192.168.12.234]) (amavisd-new, port 10024) with ESMTP id BLtGrkHtr13o; Fri, 19 Jan 2018 10:45:20 +0100 (CET) Received: from messagerie.si.c-s.fr (messagerie.si.c-s.fr [192.168.25.192]) by pegase1.c-s.fr (Postfix) with ESMTP id 3zNGCN6kVdz9tvr2; Fri, 19 Jan 2018 10:45:20 +0100 (CET) Received: from localhost (localhost [127.0.0.1]) by messagerie.si.c-s.fr (Postfix) with ESMTP id 296A18BBE9; Fri, 19 Jan 2018 10:45:36 +0100 (CET) X-Virus-Scanned: amavisd-new at c-s.fr Received: from messagerie.si.c-s.fr ([127.0.0.1]) by localhost (messagerie.si.c-s.fr [127.0.0.1]) (amavisd-new, port 10023) with ESMTP id WAKCP16atDFV; Fri, 19 Jan 2018 10:45:36 +0100 (CET) Received: from PO15451 (po15451.idsi0.si.c-s.fr [172.25.231.40]) by messagerie.si.c-s.fr (Postfix) with ESMTP id DEBBC8BBE8; Fri, 19 Jan 2018 10:45:35 +0100 (CET) Subject: Re: [PATCH v2 1/5] powerpc/mm: Enhance 'slice' for supporting PPC32 To: "Aneesh Kumar K.V" , Benjamin Herrenschmidt , Paul Mackerras , Michael Ellerman , Scott Wood Cc: linuxppc-dev@lists.ozlabs.org, linux-kernel@vger.kernel.org References: <49148d07955d3e5f963cedf9adcfcc37c3e03ef4.1516179904.git.christophe.leroy@c-s.fr> <87vafyz265.fsf@linux.vnet.ibm.com> <84dc1df4-db2f-be11-c1f3-5dddd1e44983@c-s.fr> <28c3ba39-ef31-5ff3-7672-3e9d1942be94@c-s.fr> From: Christophe LEROY Message-ID: Date: Fri, 19 Jan 2018 10:45:35 +0100 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.5.2 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: fr Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Le 19/01/2018 à 10:13, Aneesh Kumar K.V a écrit : > > > On 01/19/2018 02:37 PM, Christophe LEROY wrote: >> >> >> Le 19/01/2018 à 10:02, Aneesh Kumar K.V a écrit : >>> >>> >>> On 01/19/2018 02:14 PM, Christophe LEROY wrote: >>>> >>>> >>>> Le 19/01/2018 à 09:24, Aneesh Kumar K.V a écrit : >>>>> Christophe Leroy writes: >>>>> >>>>>> In preparation for the following patch which will fix an issue on >>>>>> the 8xx by re-using the 'slices', this patch enhances the >>>>>> 'slices' implementation to support 32 bits CPUs. >>>>>> >>>>>> On PPC32, the address space is limited to 4Gbytes, hence only the low >>>>>> slices will be used. As of today, the code uses >>>>>> SLICE_LOW_TOP (0x100000000ul) and compares it with addr to determine >>>>>> if addr refers to low or high space. >>>>>> On PPC32, such a (addr < SLICE_LOW_TOP) test is always false because >>>>>> 0x100000000ul degrades to 0. Therefore, the patch modifies >>>>>> SLICE_LOW_TOP to (0xfffffffful) and modifies the tests to >>>>>> (addr <= SLICE_LOW_TOP) which will then always be true on PPC32 >>>>>> as addr has type 'unsigned long' while not modifying the PPC64 >>>>>> behaviour. >>>>>> >>>>>> This patch moves "slices" functions prototypes from page64.h to >>>>>> page.h >>>>>> >>>>>> The high slices use bitmaps. As bitmap functions are not prepared to >>>>>> handling bitmaps of size 0, the bitmap_xxx() calls are wrapped into >>>>>> slice_bitmap_xxx() macros which will take care of the 0 nbits case. >>>>>> >>>>>> Signed-off-by: Christophe Leroy >>>>>> --- >>>>>>   v2: First patch of v1 serie split in two parts ; added >>>>>> slice_bitmap_xxx() macros. >>>>>> >>>>>>   arch/powerpc/include/asm/page.h      | 14 +++++++++ >>>>>>   arch/powerpc/include/asm/page_32.h   | 19 ++++++++++++ >>>>>>   arch/powerpc/include/asm/page_64.h   | 21 ++----------- >>>>>>   arch/powerpc/mm/hash_utils_64.c      |  2 +- >>>>>>   arch/powerpc/mm/mmu_context_nohash.c |  7 +++++ >>>>>>   arch/powerpc/mm/slice.c              | 60 >>>>>> ++++++++++++++++++++++++------------ >>>>>>   6 files changed, 83 insertions(+), 40 deletions(-) >>>>>> >>>>>> diff --git a/arch/powerpc/include/asm/page.h >>>>>> b/arch/powerpc/include/asm/page.h >>>>>> index 8da5d4c1cab2..d0384f9db9eb 100644 >>>>>> --- a/arch/powerpc/include/asm/page.h >>>>>> +++ b/arch/powerpc/include/asm/page.h >>>>>> @@ -342,6 +342,20 @@ typedef struct page *pgtable_t; >>>>>>   #endif >>>>>>   #endif >>>>>> +#ifdef CONFIG_PPC_MM_SLICES >>>>>> +struct mm_struct; >>>>>> + >>>>>> +unsigned long slice_get_unmapped_area(unsigned long addr, >>>>>> unsigned long len, >>>>>> +                      unsigned long flags, unsigned int psize, >>>>>> +                      int topdown); >>>>>> + >>>>>> +unsigned int get_slice_psize(struct mm_struct *mm, unsigned long >>>>>> addr); >>>>>> + >>>>>> +void slice_set_user_psize(struct mm_struct *mm, unsigned int psize); >>>>>> +void slice_set_range_psize(struct mm_struct *mm, unsigned long >>>>>> start, >>>>>> +               unsigned long len, unsigned int psize); >>>>>> +#endif >>>>>> + >>>>> >>>>> Should we do a slice.h ? the way we have other files? and then do >>>> >>>> Yes we could add a slice.h instead of using page.h for that, good idea. >>>> >>>>> >>>>> arch/powerpc/include/asm/book3s/64/slice.h that will carry >>>>> #define slice_bitmap_zero(dst, nbits) \ >>>>>     do { if (nbits) bitmap_zero(dst, nbits); } while (0) >>>>> #define slice_bitmap_set(dst, pos, nbits) \ >>>>> do { if (nbits) bitmap_set(dst, pos, nbits); } while (0) >>>>> #define slice_bitmap_copy(dst, src, nbits) \ >>>>> do { if (nbits) bitmap_copy(dst, src, nbits); } while (0) >>>>> #define slice_bitmap_and(dst, src1, src2, nbits) \ >>>>>     ({ (nbits) ? bitmap_and(dst, src1, src2, nbits) : 0; }) >>>>> #define slice_bitmap_or(dst, src1, src2, nbits) \ >>>>>     do { if (nbits) bitmap_or(dst, src1, src2, nbits); } while (0) >>>>> #define slice_bitmap_andnot(dst, src1, src2, nbits) \ >>>>>     ({ (nbits) ? bitmap_andnot(dst, src1, src2, nbits) : 0; }) >>>>> #define slice_bitmap_equal(src1, src2, nbits) \ >>>>>     ({ (nbits) ? bitmap_equal(src1, src2, nbits) : 1; }) >>>>> #define slice_bitmap_empty(src, nbits) \ >>>>>     ({ (nbits) ? bitmap_empty(src, nbits) : 1; }) >>>>> >>>>> This without that if(nbits) check and a proper static inline so >>>>> that we >>>>> can do type checking. >>>> >>>> Is it really worth duplicating that just for eliminating the 'if >>>> (nbits)' in one case ? >>>> >>>> Only in book3s/64 we will be able to eliminate that, for nohash/32 >>>> we need to keep the test due to the difference between low and high >>>> slices. >>> >>> the other advantage is we move the SLICE_LOW_SHIFT to the right >>> location. IMHO mm subystem is really complex with these really >>> overloaded headers. If we can keep it  seperate we should with >>> minimal code duplication? >> >> For the constants I fully agree with your proposal and I will do it. I >> was only questionning the benefit of moving the slice_bitmap_xxxx() >> stuff, taking into account that the 'if (nbits)' test is already >> eliminated by the compiler. >> > > That is compiler dependent as you are finding with the other patch where > if (0) didn't get compiled out I don't think so. When I had the missing prototype, the compilation goes ok, including the final link. Which means at the end the code is not included since radix_enabled() evaluates to 0. Many many parts of the kernel are based on this assumption. Christophe > > -aneesh