Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp3646415ybz; Mon, 27 Apr 2020 20:29:52 -0700 (PDT) X-Google-Smtp-Source: APiQypJcsACczs79BLKenT91sHu5VSTe3/Bz9WJZS4ILHLKxMcPWM9HJvFkeTfOzn1iGpp2DKNqz X-Received: by 2002:aa7:cdce:: with SMTP id h14mr20750311edw.51.1588044592620; Mon, 27 Apr 2020 20:29:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588044592; cv=none; d=google.com; s=arc-20160816; b=XW/9AxGejOae2nSWqKhCYzAGu47OLlwpe2jtOKrGCqDAUJNEhPKKYSYYwFsNmZhtIe IxmFWA58gWAs8Tq4LbFNj1vpFaUoQG9pSaXm2ja4Qd0SzLnhnp8T3BdmtGNPDoElQQB9 +NjYhlbtA6lDN7DM8cLLUNdFNwr2PZttu72QS5TuJ2+zNNq0gSSYj07O6On7XEPnjBy4 Pe84TEd6A9ehef1ajiRHYf8f1sHAQhCxjpxjkdwGi8PRjko5NGsmmevFMp4NilGOmu12 /IxCTqzCw1xBwZ0lDT06waXFg7NkbM5MG2jzHyaLENB4s7GQxjDK7ur9j4N5MEWsc9dQ qFeA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:from:subject:mime-version :message-id:date:dkim-signature; bh=u+G4hEDBCMfTTINsEe1rRLbmDctcZioKYhqJKU0a/to=; b=PcXm2GtJb1a3kfNurjBWFtviipbX+8yvLbEVplfpQkUK66D1gPpnF89LMq4TSP5P6S SE65x4sRr4a3YSZJCqyqAIeBojN8b/fxs54Kj6g2k5k/NADrArm8JiZjNnjjFjYfs1NA B993f0Zm18yNZGIi64Vm6L2L/IVT3LYU395rlhWHQayX71k3wCRer5MNlBJaXB1JfRr7 mlmzrwFyGsiZvKXN31uAyGZhQHlgKIt1eGgbRgusMk79tPyWkebE/kD4wWcuajl6oTCi tgt4oNyU7tXtF4pssNL1kPSxhGtDotG+bExKATxjwRPQ/wglVadUvcrQDOMcZVCbuYC0 9KVg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=KS7OZjdt; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id mh4si969087ejb.385.2020.04.27.20.29.29; Mon, 27 Apr 2020 20:29:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=KS7OZjdt; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726415AbgD1D2A (ORCPT + 99 others); Mon, 27 Apr 2020 23:28:00 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43198 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726270AbgD1D17 (ORCPT ); Mon, 27 Apr 2020 23:27:59 -0400 Received: from mail-yb1-xb49.google.com (mail-yb1-xb49.google.com [IPv6:2607:f8b0:4864:20::b49]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2A1DAC03C1AC for ; Mon, 27 Apr 2020 20:27:59 -0700 (PDT) Received: by mail-yb1-xb49.google.com with SMTP id y73so23154392ybe.22 for ; Mon, 27 Apr 2020 20:27:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:message-id:mime-version:subject:from:to:cc; bh=u+G4hEDBCMfTTINsEe1rRLbmDctcZioKYhqJKU0a/to=; b=KS7OZjdt/sKvdslTKy6L/T6sE7YdAFI7rtNbHDh37qyDWlr6BYYTbLkiyvmD0nU0Vg PRRFYjaV7UbuK8P97MQGc1kQlBkhSW5m1ljSw8YwuhdjhF7jXweNqaFbkY9Plz6Wl1j4 osppM2gxoE4Yrgh4n66OgZ3JsEY6l1ff2OMDXWcjzIlhM9kTGJ4VqWohzV1FRAHXNefl ZYwNdvb4LAg5HEbG7ldnRs7JPffDElIHozrCI8WkjzJY+A19nzvQ5AN+xrhkWxSSzafL O46PyHQSSBOenRmN3cTUjh1IJpeaEsSCOrwDvLioHY9vH6+s/1WzYSsDqeo2yVlPtFEk +EUg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:message-id:mime-version:subject:from:to:cc; bh=u+G4hEDBCMfTTINsEe1rRLbmDctcZioKYhqJKU0a/to=; b=I9vgh3Ctl6GCNojIhFMmzQhjL63R0ZYzBFthiY2JPBsyTFY72o/75zZ831+YsSSm86 WXjgKkiGjmb9hKK/Qe3nBg1XRBmBxw+m4x36wusO7pbdXcDYQxoTNzrHwn2J4GTLTdIp MEQuikW+88WyHuMHN1fD0wsfOIWQP69qd/O6KYF9Y5VmjtCHNUI4CulNC78CxEWs+pMs DtRif6vg2YFORzz+6yMKGt5FAUHdkvkpw0xFafdllKQ4Ea8r/Gv6xCDR7coTTWdHjrxw uCCfpVJyAkqOvnYD/ooH0ex8mNXKouBfKix/c9qxjqowM3Z/GEdEL8dQyX7dsssGGB6e dqPA== X-Gm-Message-State: AGi0Pua238blD0w+5KZaIoHMFfk8/KEvEJPwSEp/01H6kr8+914IQsp6 9uNQ1jQH9c8ig8jiBV6ZMM2NXPtI9w== X-Received: by 2002:a25:afd0:: with SMTP id d16mr42901934ybj.441.1588044478270; Mon, 27 Apr 2020 20:27:58 -0700 (PDT) Date: Tue, 28 Apr 2020 05:27:40 +0200 Message-Id: <20200428032745.133556-1-jannh@google.com> Mime-Version: 1.0 X-Mailer: git-send-email 2.26.2.303.gf8c07b1a785-goog Subject: [PATCH 0/5] Fix ELF / FDPIC ELF core dumping, and use mmap_sem properly in there From: Jann Horn To: Andrew Morton Cc: Linus Torvalds , Christoph Hellwig , linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-fsdevel@vger.kernel.org, Alexander Viro , "Eric W . Biederman" , Oleg Nesterov , Russell King , linux-arm-kernel@lists.infradead.org, Mark Salter , Aurelien Jacquiot , linux-c6x-dev@linux-c6x.org, Yoshinori Sato , Rich Felker , linux-sh@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org At the moment, we have that rather ugly mmget_still_valid() helper to work around : ELF core dumping doesn't take the mmap_sem while traversing the task's VMAs, and if anything (like userfaultfd) then remotely messes with the VMA tree, fireworks ensue. So at the moment we use mmget_still_valid() to bail out in any writers that might be operating on a remote mm's VMAs. With this series, I'm trying to get rid of the need for that as cleanly as possible. In particular, I want to avoid holding the mmap_sem across unbounded sleeps. Patches 1, 2 and 3 are relatively unrelated cleanups in the core dumping code. Patches 4 and 5 implement the main change: Instead of repeatedly accessing the VMA list with sleeps in between, we snapshot it at the start with proper locking, and then later we just use our copy of the VMA list. This ensures that the kernel won't crash, that VMA metadata in the coredump is consistent even in the presence of concurrent modifications, and that any virtual addresses that aren't being concurrently modified have their contents show up in the core dump properly. The disadvantage of this approach is that we need a bit more memory during core dumping for storing metadata about all VMAs. After this series has landed, we should be able to rip out mmget_still_valid(). Testing done so far: - Creating a simple core dump on X86-64 still works. - The created coredump on X86-64 opens in GDB, and both the stack and the exectutable look vaguely plausible. - 32-bit ARM compiles with FDPIC support, both with MMU and !MMU config. I'm CCing some folks from the architectures that use FDPIC in case anyone wants to give this a spin. This series is based on (Christoph Hellwig's "remove set_fs calls from the coredump code v4"). Jann Horn (5): binfmt_elf_fdpic: Stop using dump_emit() on user pointers on !MMU coredump: Fix handling of partial writes in dump_emit() coredump: Refactor page range dumping into common helper binfmt_elf, binfmt_elf_fdpic: Use a VMA list snapshot mm/gup: Take mmap_sem in get_dump_page() fs/binfmt_elf.c | 170 ++++++++++++--------------------------- fs/binfmt_elf_fdpic.c | 106 +++++++++--------------- fs/coredump.c | 102 +++++++++++++++++++++++ include/linux/coredump.h | 12 +++ mm/gup.c | 69 +++++++++------- 5 files changed, 243 insertions(+), 216 deletions(-) base-commit: 6a8b55ed4056ea5559ebe4f6a4b247f627870d4c prerequisite-patch-id: c0a20b414eebc48fe0a8ca570b05de34c7980396 prerequisite-patch-id: 51973b8db0fa4b114e0c3fd8936b634d9d5061c5 prerequisite-patch-id: 0e1e8de282ca6d458dc6cbdc6b6ec5879edd8a05 prerequisite-patch-id: d5ee749c4d3a22ec80bd0dd88aadf89aeb569db8 prerequisite-patch-id: 46ce14e59e98e212a1eca0aef69c6dcdb62b8242 -- 2.26.2.303.gf8c07b1a785-goog