Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752726Ab1EHUMI (ORCPT ); Sun, 8 May 2011 16:12:08 -0400 Received: from smtp-out.google.com ([216.239.44.51]:52816 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751634Ab1EHUMF (ORCPT ); Sun, 8 May 2011 16:12:05 -0400 DomainKey-Signature: a=rsa-sha1; c=nofws; d=google.com; s=beta; h=date:from:x-x-sender:to:cc:subject:in-reply-to:message-id :references:user-agent:mime-version:content-type; b=kbvocFbliXTvNJRzd/X2vZk1dakM8gUXX0DmmGOeSKXnms5LpfljXEzLq4gFr5AhLx 1RJrSJn6Np4GKOdQgGZQ== Date: Sun, 8 May 2011 13:12:01 -0700 (PDT) From: Hugh Dickins X-X-Sender: hugh@sister.anvils To: Mikulas Patocka cc: linux-kernel@vger.kernel.org, linux-parisc@vger.kernel.org, Linus Torvalds , Michel Lespinasse , Oleg Nesterov Subject: Re: [PATCH] Don't mlock guardpage if the stack is growing up In-Reply-To: Message-ID: References: User-Agent: Alpine 2.00 (LSU 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3842 Lines: 100 On Sun, 8 May 2011, Mikulas Patocka wrote: > Don't mlock guardpage if the stack is growing up > > Linux kernel excludes guard page when performing mlock on a VMA with > down-growing stack. However, some architectures have up-growing stack and > locking the guard page should be excluded in this case too. > > This patch fixes lvm2 on PA-RISC (and possibly other architectures with > up-growing stack). lvm2 calculates the number of used pages when locking > and when unlocking and reports an internal error if the numbers mismatch. > On PA-RISC, the kernel would incorrectly attempt to mlock the stack guard > page, this causes allocation of one more page and internal error in lvm2. > > Signed-off-by: Mikulas Patocka Interesting, I'd convinced myself that the growsup case was safe, because of how we always approach the vma from its bottom end. I've added Michel to the Cc, he's the one with the best grasp here. Could you point us to where lvm2 is making these calculations? I don't understand quite what it's doing. Thanks, Hugh > > --- > include/linux/mm.h | 6 ++++++ > mm/memory.c | 21 ++++++++++++--------- > 2 files changed, 18 insertions(+), 9 deletions(-) > > Index: linux-2.6.39-rc6-fast/include/linux/mm.h > =================================================================== > --- linux-2.6.39-rc6-fast.orig/include/linux/mm.h 2011-05-07 05:59:51.000000000 +0200 > +++ linux-2.6.39-rc6-fast/include/linux/mm.h 2011-05-07 05:59:52.000000000 +0200 > @@ -1016,6 +1016,12 @@ static inline int vma_stack_continue(str > return vma && (vma->vm_end == addr) && (vma->vm_flags & VM_GROWSDOWN); > } > > +/* Is the vma a continuation of the stack vma below it? */ > +static inline int vma_stack_growsup_continue(struct vm_area_struct *vma, unsigned long addr) > +{ > + return vma && (vma->vm_start == addr) && (vma->vm_flags & VM_GROWSUP); > +} > + > extern unsigned long move_page_tables(struct vm_area_struct *vma, > unsigned long old_addr, struct vm_area_struct *new_vma, > unsigned long new_addr, unsigned long len); > Index: linux-2.6.39-rc6-fast/mm/memory.c > =================================================================== > --- linux-2.6.39-rc6-fast.orig/mm/memory.c 2011-05-07 05:59:51.000000000 +0200 > +++ linux-2.6.39-rc6-fast/mm/memory.c 2011-05-07 05:59:52.000000000 +0200 > @@ -1412,9 +1412,12 @@ no_page_table: > > static inline int stack_guard_page(struct vm_area_struct *vma, unsigned long addr) > { > - return (vma->vm_flags & VM_GROWSDOWN) && > + return ((vma->vm_flags & VM_GROWSDOWN) && > (vma->vm_start == addr) && > - !vma_stack_continue(vma->vm_prev, addr); > + !vma_stack_continue(vma->vm_prev, addr)) || > + ((vma->vm_flags & VM_GROWSUP) && > + (vma->vm_end == addr + PAGE_SIZE) && > + !vma_stack_growsup_continue(vma->vm_next, addr + PAGE_SIZE)); > } > > /** > @@ -1551,18 +1554,18 @@ int __get_user_pages(struct task_struct > continue; > } > > - /* > - * If we don't actually want the page itself, > - * and it's the stack guard page, just skip it. > - */ > - if (!pages && stack_guard_page(vma, start)) > - goto next_page; > - > do { > struct page *page; > unsigned int foll_flags = gup_flags; > > /* > + * If we don't actually want the page itself, > + * and it's the stack guard page, just skip it. > + */ > + if (!pages && stack_guard_page(vma, start)) > + goto next_page; > + > + /* > * If we have a pending SIGKILL, don't keep faulting > * pages and potentially allocating memory. > */ -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/