Received: by 2002:a25:ca44:0:0:0:0:0 with SMTP id a65csp298184ybg; Tue, 28 Jul 2020 06:24:13 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxoR24ZRccKRmdnz/eDljAt6jaP/9PvFAb/Pw/SwqgysyUTq4dUJQfyvWXUzcVveFu6GCHP X-Received: by 2002:a17:906:1813:: with SMTP id v19mr26640698eje.249.1595942653448; Tue, 28 Jul 2020 06:24:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1595942653; cv=none; d=google.com; s=arc-20160816; b=NzMJ+w8dxztJEi7LdtB/IYzpqFT8FzarYv91DtvDHT2hUaeDNRgTKfzrjIY/8evKrV S7cUMYc9BVolgQBuifAGVaN58FkQF7BrY2MyG/JijEdgj+dG3u+9lfzqipC8vMw/kjSi 2/9ncBiJEzyOY8Ilov4xnwmpqnoCn1iwnG9A+7vpvlG1TapCkCOaWonI2m1dmAz1xKyg T3ZZ0D5uGPEYwoq2aNEC8KyPTSONlfa+9ILxkzJOn6vuq3B8UojpAdEvsx95FHxMl00g TsdTDNLmR6kOMX83HXzIoBG1nWSKP3IfiXRf5YggIvAzGTvTNaUOtHmCSOaQReSlIFOi mKog== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=eWvBvifkBpu4dO5sxKMI4PeyvRtV6/ynK0GnyhWaMS8=; b=BFaG4SS8GjqMQ0uvi4BYvsihv5WCGvmBwDudbsTphW3T84c8am9UTlk7wGawWp+7t7 B7+Ysr47uFZMluLyMm0Ap1pOVqylXymknJFDFzVQf5MH2Bk2XLWgXtPp4gWDb8eQTZMF ZxEKZ6tPWGKUc0nrRQlr+A0wcAdT8uRDBI/3h2bmiCMI31Tswo7PrRR4opH55j6WAcPI mswxv1zXpHQxdu5MPlfq8nu/m/Or4L+qy4ob3VnoQsv76tz6ZXexJrb95H9l4F4NUoxK Qk5DCHnusY+JPaUbtCfOJsrsHa8zQt90MNCeTE+O8JLEVSY5GgNfiSAJTI3uuYdlsmiq 893g== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id d26si7676813edy.410.2020.07.28.06.23.50; Tue, 28 Jul 2020 06:24:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=virtuozzo.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730071AbgG1NXb (ORCPT + 99 others); Tue, 28 Jul 2020 09:23:31 -0400 Received: from relay.sw.ru ([185.231.240.75]:37472 "EHLO relay3.sw.ru" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729433AbgG1NXb (ORCPT ); Tue, 28 Jul 2020 09:23:31 -0400 Received: from [192.168.15.36] by relay3.sw.ru with esmtp (Exim 4.93) (envelope-from ) id 1k0PYr-0004Hm-NN; Tue, 28 Jul 2020 16:22:30 +0300 Subject: Re: [RFC PATCH 5/5] mm: introduce MADV_DOEXEC To: Anthony Yznaga , linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, linux-arch@vger.kernel.org Cc: mhocko@kernel.org, tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, x86@kernel.org, hpa@zytor.com, viro@zeniv.linux.org.uk, akpm@linux-foundation.org, arnd@arndb.de, ebiederm@xmission.com, keescook@chromium.org, gerg@linux-m68k.org, christian.brauner@ubuntu.com, peterz@infradead.org, esyr@redhat.com, jgg@ziepe.ca, christian@kellner.me, areber@redhat.com, cyphar@cyphar.com, steven.sistare@oracle.com References: <1595869887-23307-1-git-send-email-anthony.yznaga@oracle.com> <1595869887-23307-6-git-send-email-anthony.yznaga@oracle.com> From: Kirill Tkhai Message-ID: <743a51db-dc27-c49c-9c65-ac164f5283ba@virtuozzo.com> Date: Tue, 28 Jul 2020 16:22:40 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <1595869887-23307-6-git-send-email-anthony.yznaga@oracle.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 27.07.2020 20:11, Anthony Yznaga wrote: > madvise MADV_DOEXEC preserves a memory range across exec. Initially > only supported for non-executable, non-stack, anonymous memory. > MADV_DONTEXEC reverts the effect of a previous MADV_DOXEXEC call and > undoes the preservation of the range. After a successful exec call, > the behavior of all ranges reverts to MADV_DONTEXEC. > > Signed-off-by: Steve Sistare > Signed-off-by: Anthony Yznaga > --- > include/uapi/asm-generic/mman-common.h | 3 +++ > mm/madvise.c | 25 +++++++++++++++++++++++++ > 2 files changed, 28 insertions(+) > > diff --git a/include/uapi/asm-generic/mman-common.h b/include/uapi/asm-generic/mman-common.h > index f94f65d429be..7c5f616b28f7 100644 > --- a/include/uapi/asm-generic/mman-common.h > +++ b/include/uapi/asm-generic/mman-common.h > @@ -72,6 +72,9 @@ > #define MADV_COLD 20 /* deactivate these pages */ > #define MADV_PAGEOUT 21 /* reclaim these pages */ > > +#define MADV_DOEXEC 22 /* do inherit across exec */ > +#define MADV_DONTEXEC 23 /* don't inherit across exec */ > + > /* compatibility flags */ > #define MAP_FILE 0 > > diff --git a/mm/madvise.c b/mm/madvise.c > index dd1d43cf026d..b447fa748649 100644 > --- a/mm/madvise.c > +++ b/mm/madvise.c > @@ -103,6 +103,26 @@ static long madvise_behavior(struct vm_area_struct *vma, > case MADV_KEEPONFORK: > new_flags &= ~VM_WIPEONFORK; > break; > + case MADV_DOEXEC: For me MADV_KEEPONEXEC sounds better as it's symmetric to MADV_KEEPONFORK. > + /* > + * MADV_DOEXEC is only supported on private, non-executable, > + * non-stack anonymous memory and if the VM_EXEC_KEEP flag > + * is available. > + */ > + if (!VM_EXEC_KEEP || vma->vm_file || vma->vm_flags & (VM_EXEC|VM_SHARED|VM_STACK)) { > + error = -EINVAL; > + goto out; > + } > + new_flags |= (new_flags & ~VM_MAYEXEC) | VM_EXEC_KEEP; > + break; > + case MADV_DONTEXEC: > + if (!VM_EXEC_KEEP) { > + error = -EINVAL; > + goto out; > + } > + if (new_flags & VM_EXEC_KEEP) > + new_flags |= (new_flags & ~VM_EXEC_KEEP) | VM_MAYEXEC; > + break; > case MADV_DONTDUMP: > new_flags |= VM_DONTDUMP; > break; > @@ -983,6 +1003,8 @@ static int madvise_inject_error(int behavior, > case MADV_SOFT_OFFLINE: > case MADV_HWPOISON: > #endif > + case MADV_DOEXEC: > + case MADV_DONTEXEC: > return true; > > default: > @@ -1037,6 +1059,9 @@ static int madvise_inject_error(int behavior, > * MADV_DONTDUMP - the application wants to prevent pages in the given range > * from being included in its core dump. > * MADV_DODUMP - cancel MADV_DONTDUMP: no longer exclude from core dump. > + * MADV_DOEXEC - On exec, preserve and duplicate this area in the new process > + * if the new process allows it. > + * MADV_DONTEXEC - Undo the effect of MADV_DOEXEC. > * > * return values: > * zero - success >