Received: by 2002:ab2:3319:0:b0:1ef:7a0f:c32d with SMTP id i25csp630475lqc; Fri, 8 Mar 2024 07:28:57 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCX+5i9zimqjz8eZDuHhvEGqhILl9wWhQ5Ijo31uOpoqVDp2z/5Ll/A4TByGn25wf6X0YkDfgnB5du9K5qIC2CVYlcwxiyI6yKlaDiDfyg== X-Google-Smtp-Source: AGHT+IFvqrIUxIKK6T/JsLiCgbTwn6eGlhHZPhhLkLQZ7nBax9Mwmt+GROWpAY/Lhy1mku6qBAEs X-Received: by 2002:a05:6102:3a0a:b0:471:e00e:75ce with SMTP id b10-20020a0561023a0a00b00471e00e75cemr13725107vsu.7.1709911737249; Fri, 08 Mar 2024 07:28:57 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1709911737; cv=pass; d=google.com; s=arc-20160816; b=fxVThA61CaZyw/kVx6fxlUsXQY+I6a3mEjJvQ6x1+W9ha6Clga0aJwmwfzXuhj9lvX Zqrkih5qjxvxKU2c+YPaf12tnkwTLUSCVLH/ebLSogx/ueEuRNPmkR74GxewbPaw95q3 a/vIFFK5IzghenmiyXM+HlfCts/TAfBYkQKZn0eEb7+oEkQ8q3htJUBjAzsdRKFqDibb iX3SCC1/dpqtA0dX9vLdNkVW7k84Hdz+lC8QEYtXcIZgcjuCKNFXHvUhNBIS9vu9rm4k rQgrqmvuekFan/W7THhS1I8VADNryGfQe6ZnbRgHMOH5HzWSNQJ7u0MgvI0P5UiP17lZ y+Nw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:in-reply-to:message-id :date:subject:cc:to:from:dkim-signature; bh=8py++85EKHGUxfeo4EvyYDPRKraIlX/Wy1G8feIVn9o=; fh=E8F5/vp/nAcPVJWJwVt5RE8c2HME7kVqVeSMFxe8huE=; b=xojqf0K0kuNmzwyFy+n292Ne096py1+DV2D6wmtxVzyvjl+3QiWpjpoBJLwywnOCBe jDk4humXpbxjbl40AoZ/ISkK1mygtFEdlsrN2Q+efVJdccfo+1OcBojfFrkyygQqrNYw 9D1RF7aiN+btDE9/uTq9ZZommexGV2VkTJAA+PUijcNfPLXQJOC5Xbk2UhEb1WGvQFke M/yjPSJSUNchTljyZ+o8AgRuroO9ejq5o+NQvZma2YF3sagNpbG2THsYiE+WX+34FUnE 7e7z82zvruprYsT1iOqt1tFxeFh+6y3X2gXTWRW6mS5RaTCSWl1zd6Xkd45ovgbTln3J oj1g==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=kO428Zvw; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-97231-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-97231-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [2604:1380:45d1:ec00::1]) by mx.google.com with ESMTPS id gd10-20020a056102624a00b0047280a46bedsi3548533vsb.248.2024.03.08.07.28.57 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 08 Mar 2024 07:28:57 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-97231-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) client-ip=2604:1380:45d1:ec00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=kO428Zvw; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-97231-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:45d1:ec00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-97231-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id F270B1C216C7 for ; Fri, 8 Mar 2024 15:28:56 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 83A8E208CE; Fri, 8 Mar 2024 15:28:43 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="kO428Zvw" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 9B150200C1; Fri, 8 Mar 2024 15:28:42 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709911722; cv=none; b=WQXxmjSzPR7Xg3bsxmkjhha9+cmSApvsBCXRS4qT5t9oa6o6h7oXJp4K0jTHDGWWp7xhhBfHZIIrIJz76fuw6r+SCzLtPiD/TY3NKpuh6mp2gWlUTUoFxAhQhhgi4g928VrK6MiEznYByEK+mzZ3FKQLxf03iQ73tQ509xg28a8= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1709911722; c=relaxed/simple; bh=B47ZjXmKOyAMLhid8Xlwz+oOPX1zHRpOemRv4vSO/iQ=; h=From:To:Cc:Subject:Date:Message-Id:In-Reply-To:References: MIME-Version; b=MR4wpsndwKQYBbWclbzzS8HrQheLNfZ1lLLTMWCVELoWp34nrimk+CbbijXr4Dh2QH2KvSssp+nlJBbdIWv7C/D1OzzVkGvQhQ9uv/zJ8Y0xCNIQZOjg98i4ZnfTk6uOyO0kxjbNDareaIDUFZC0JfCSlHLcAauqtGnQjqz0xlE= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=kO428Zvw; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 0B746C433C7; Fri, 8 Mar 2024 15:28:39 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1709911722; bh=B47ZjXmKOyAMLhid8Xlwz+oOPX1zHRpOemRv4vSO/iQ=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=kO428Zvwc60S9I5oBgHH1u4aTum7m8PoAjVesiFRRKW8xb1o9xfjmiBrRAu7lOb2t jjU4fYxCpZLzeyiG0Yj7Vk13J23CkZAflFOnaTw7y9JUdDdWIHsiC8AHbPu1EoFJYU Mnlr9u51fLSxQRU/m3SQ0FdMvJrY4UV9IWLtA2ZTKt3wAMATqno6iTRPj4rt8mUIMp lgsoVq0pxmJy+nJIoeceEZ0IdTiD8qvXxuAM1jD3HvahdP3bjdFvIY46baLWHtk3CT uxVwqreCHLgASOEWf/i20RCIaTLDnJ1ahGznIvHiG40h4UeJOwmdV6dPN8wSpUzlbf JNVAwJOgcpjog== From: Will Deacon To: linux-kernel@vger.kernel.org Cc: kernel-team@android.com, Will Deacon , iommu@lists.linux.dev, Christoph Hellwig , Marek Szyprowski , Robin Murphy , Petr Tesarik , Dexuan Cui , Nicolin Chen , Michael Kelley Subject: [PATCH v6 1/6] swiotlb: Fix double-allocation of slots due to broken alignment handling Date: Fri, 8 Mar 2024 15:28:24 +0000 Message-Id: <20240308152829.25754-2-will@kernel.org> X-Mailer: git-send-email 2.20.1 In-Reply-To: <20240308152829.25754-1-will@kernel.org> References: <20240308152829.25754-1-will@kernel.org> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Commit bbb73a103fbb ("swiotlb: fix a braino in the alignment check fix"), which was a fix for commit 0eee5ae10256 ("swiotlb: fix slot alignment checks"), causes a functional regression with vsock in a virtual machine using bouncing via a restricted DMA SWIOTLB pool. When virtio allocates the virtqueues for the vsock device using dma_alloc_coherent(), the SWIOTLB search can return page-unaligned allocations if 'area->index' was left unaligned by a previous allocation from the buffer: # Final address in brackets is the SWIOTLB address returned to the caller | virtio-pci 0000:00:07.0: orig_addr 0x0 alloc_size 0x2000, iotlb_align_mask 0x800 stride 0x2: got slot 1645-1649/7168 (0x98326800) | virtio-pci 0000:00:07.0: orig_addr 0x0 alloc_size 0x2000, iotlb_align_mask 0x800 stride 0x2: got slot 1649-1653/7168 (0x98328800) | virtio-pci 0000:00:07.0: orig_addr 0x0 alloc_size 0x2000, iotlb_align_mask 0x800 stride 0x2: got slot 1653-1657/7168 (0x9832a800) This ends badly (typically buffer corruption and/or a hang) because swiotlb_alloc() is expecting a page-aligned allocation and so blindly returns a pointer to the 'struct page' corresponding to the allocation, therefore double-allocating the first half (2KiB slot) of the 4KiB page. Fix the problem by treating the allocation alignment separately to any additional alignment requirements from the device, using the maximum of the two as the stride to search the buffer slots and taking care to ensure a minimum of page-alignment for buffers larger than a page. This also resolves swiotlb allocation failures occuring due to the inclusion of ~PAGE_MASK in 'iotlb_align_mask' for large allocations and resulting in alignment requirements exceeding swiotlb_max_mapping_size(). Fixes: bbb73a103fbb ("swiotlb: fix a braino in the alignment check fix") Fixes: 0eee5ae10256 ("swiotlb: fix slot alignment checks") Cc: Christoph Hellwig Cc: Marek Szyprowski Cc: Robin Murphy Cc: Dexuan Cui Reviewed-by: Michael Kelley Reviewed-by: Petr Tesarik Tested-by: Nicolin Chen Signed-off-by: Will Deacon --- kernel/dma/swiotlb.c | 28 +++++++++++++++------------- 1 file changed, 15 insertions(+), 13 deletions(-) diff --git a/kernel/dma/swiotlb.c b/kernel/dma/swiotlb.c index b079a9a8e087..2ec2cc81f1a2 100644 --- a/kernel/dma/swiotlb.c +++ b/kernel/dma/swiotlb.c @@ -982,7 +982,7 @@ static int swiotlb_search_pool_area(struct device *dev, struct io_tlb_pool *pool phys_to_dma_unencrypted(dev, pool->start) & boundary_mask; unsigned long max_slots = get_max_slots(boundary_mask); unsigned int iotlb_align_mask = - dma_get_min_align_mask(dev) | alloc_align_mask; + dma_get_min_align_mask(dev) & ~(IO_TLB_SIZE - 1); unsigned int nslots = nr_slots(alloc_size), stride; unsigned int offset = swiotlb_align_offset(dev, orig_addr); unsigned int index, slots_checked, count = 0, i; @@ -993,19 +993,18 @@ static int swiotlb_search_pool_area(struct device *dev, struct io_tlb_pool *pool BUG_ON(!nslots); BUG_ON(area_index >= pool->nareas); + /* + * For mappings with an alignment requirement don't bother looping to + * unaligned slots once we found an aligned one. + */ + stride = get_max_slots(max(alloc_align_mask, iotlb_align_mask)); + /* * For allocations of PAGE_SIZE or larger only look for page aligned * allocations. */ if (alloc_size >= PAGE_SIZE) - iotlb_align_mask |= ~PAGE_MASK; - iotlb_align_mask &= ~(IO_TLB_SIZE - 1); - - /* - * For mappings with an alignment requirement don't bother looping to - * unaligned slots once we found an aligned one. - */ - stride = (iotlb_align_mask >> IO_TLB_SHIFT) + 1; + stride = umax(stride, PAGE_SHIFT - IO_TLB_SHIFT + 1); spin_lock_irqsave(&area->lock, flags); if (unlikely(nslots > pool->area_nslabs - area->used)) @@ -1015,11 +1014,14 @@ static int swiotlb_search_pool_area(struct device *dev, struct io_tlb_pool *pool index = area->index; for (slots_checked = 0; slots_checked < pool->area_nslabs; ) { - slot_index = slot_base + index; + phys_addr_t tlb_addr; - if (orig_addr && - (slot_addr(tbl_dma_addr, slot_index) & - iotlb_align_mask) != (orig_addr & iotlb_align_mask)) { + slot_index = slot_base + index; + tlb_addr = slot_addr(tbl_dma_addr, slot_index); + + if ((tlb_addr & alloc_align_mask) || + (orig_addr && (tlb_addr & iotlb_align_mask) != + (orig_addr & iotlb_align_mask))) { index = wrap_area_index(pool, index + 1); slots_checked++; continue; -- 2.44.0.278.ge034bb2e1d-goog