Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp3913794ybl; Mon, 13 Jan 2020 04:56:40 -0800 (PST) X-Google-Smtp-Source: APXvYqyoijMBII09kb+C0tMqydpbsNi5xOb+eLFDVM18BrwVN21zxD8kfO3Nc31GID/8syJ5iYV5 X-Received: by 2002:aca:388:: with SMTP id 130mr11850874oid.89.1578920200005; Mon, 13 Jan 2020 04:56:40 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1578920199; cv=none; d=google.com; s=arc-20160816; b=CSCkmFETDclrydnrqDPOAzKFi22q3WgDdQkRHM/vBoRsBkQsXAjFWNzrdiIhWFJj56 fd9nIpGiA69QOeXSFRasEbyEXAPxHlp0DYVqI8nwCwapJu7oYDMySemYc0z3Qg8470jM wfkzmhEX1KCXEQPlV53wkZTnSS+RSjrJXsuv+PDtpNxMFjmURfyxpfaGbw2GErVW7mtM 35sl0jNUQwd9MJ+pSxwJ/X+iXaMCnbzmRLwC8nywjpY1vSR7vqcDM3iJZqSLx62HTEh8 QxRuqJhxGmmQHlJTkBpD/joI0F5Jzl29k8ZjklLExQk2qUniePswAqq8/IkZukxepg30 MoGA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=vAX4Q5QWRIWB4Xj5ekzOIij5yq759pBLZ4qH5soZkYs=; b=S+7t4L7rntXvNKAvssEaEfd3S9GB4H8m8KzCkawvkak8tKfsIsxH1X7sg8PDoIE9mW 3UQuR0xhR5Cdm2I+ED98iF3IA1q/nP9CR8/Lt3Vwu65PnJ6DCbXFReU1ET7g1U1F+LLA HD5LkaJWhTr4pYJBRHMXjkKVGSpHQw5beHDhSl4lbiY7I2FH3/EdJuyBpy/zEL/AF9MA 8Avg1GL1LP4YV4+P75wG9NvLe6yzTDN9wcx2fhIPqTj7EcsyBFFnQumtnsk1I1y6wQLq kP1hnZ7gSipIq/FPPIWSfC/pEqNPQcb8ZXtxpTSJrES2VhUNRNM9Boiww98xPeuGxSra 5bwA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z11si5827501oic.176.2020.01.13.04.56.27; Mon, 13 Jan 2020 04:56:39 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726985AbgAMMzc (ORCPT + 99 others); Mon, 13 Jan 2020 07:55:32 -0500 Received: from szxga05-in.huawei.com ([45.249.212.191]:8710 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726375AbgAMMzc (ORCPT ); Mon, 13 Jan 2020 07:55:32 -0500 Received: from DGGEMS414-HUB.china.huawei.com (unknown [172.30.72.58]) by Forcepoint Email with ESMTP id DF64D5A19E236C3B78D8; Mon, 13 Jan 2020 20:55:29 +0800 (CST) Received: from [127.0.0.1] (10.173.220.183) by DGGEMS414-HUB.china.huawei.com (10.3.19.214) with Microsoft SMTP Server id 14.3.439.0; Mon, 13 Jan 2020 20:55:19 +0800 Subject: Re: [PATCH] brd: check parameter validation before register_blkdev func To: Ming Lei CC: Jens Axboe , , "linux-kernel@vger.kernel.org" , , Mingfangsen , Guiyao , zhangsaisai , "wubo (T)" References: <342ee238-0e7c-c213-eecc-7062f24985cc@huawei.com> <20200113110003.GA13011@ming.t460p> From: Zhiqiang Liu Message-ID: Date: Mon, 13 Jan 2020 20:55:18 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.2.2 MIME-Version: 1.0 In-Reply-To: <20200113110003.GA13011@ming.t460p> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.173.220.183] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020/1/13 19:04, Ming Lei wrote: > On Fri, Jan 10, 2020 at 01:10:20PM +0800, Zhiqiang Liu wrote: >> >> In brd_init func, rd_nr num of brd_device are firstly allocated >> and add in brd_devices, then brd_devices are traversed to add each >> brd_device by calling add_disk func. When allocating brd_device, >> the disk->first_minor is set to i * max_part, if rd_nr * max_part >> is larger than MINORMASK, two different brd_device may have the same >> devt, then only one of them can be successfully added. >> when rmmod brd.ko, it will cause oops when calling brd_exit. >> >> Follow those steps: >> # modprobe brd rd_nr=3 rd_size=102400 max_part=1048576 >> # rmmod brd >> then, the oops will appear. >> >> Oops log: >> [ 726.613722] Call trace: >> [ 726.614175] kernfs_find_ns+0x24/0x130 >> [ 726.614852] kernfs_find_and_get_ns+0x44/0x68 >> [ 726.615749] sysfs_remove_group+0x38/0xb0 >> [ 726.616520] blk_trace_remove_sysfs+0x1c/0x28 >> [ 726.617320] blk_unregister_queue+0x98/0x100 >> [ 726.618105] del_gendisk+0x144/0x2b8 >> [ 726.618759] brd_exit+0x68/0x560 [brd] >> [ 726.619501] __arm64_sys_delete_module+0x19c/0x2a0 >> [ 726.620384] el0_svc_common+0x78/0x130 >> [ 726.621057] el0_svc_handler+0x38/0x78 >> [ 726.621738] el0_svc+0x8/0xc >> [ 726.622259] Code: aa0203f6 aa0103f7 aa1e03e0 d503201f (7940e260) >> >> Here, we add brd_check_par_valid func to check parameter >> validation before register_blkdev func. >> >> Signed-off-by: Zhiqiang Liu >> --- >> drivers/block/brd.c | 33 ++++++++++++++++++++++++++------- >> +static inline int brd_check_par_valid(void) >> +{ >> + if (unlikely(!rd_nr)) >> + rd_nr = 1; >> + >> + if (unlikely(!max_part)) >> + max_part = 1; >> + >> + if (rd_nr * max_part > MINORMASK) >> + return -EINVAL; >> + >> + return 0; >> + >> +} >> + >> static int __init brd_init(void) >> { >> struct brd_device *brd, *next; >> - int i; >> + int i, ret; >> >> /* >> * brd module now has a feature to instantiate underlying device >> @@ -488,11 +503,15 @@ static int __init brd_init(void) >> * dynamically. >> */ >> >> + ret = brd_check_par_valid(); >> + if (ret) { >> + pr_info("brd: invalid parameter setting!!!\n"); >> + return ret; >> + } >> + > > The max supported partition number is 256, see __alloc_disk_node(). > So even though one bigger number is passed to alloc_disk(), at most > 256 partitions are allowed on that disk. Maybe you can apply the > following way to avoid the issue: > > disk->first_minor = i * disk->minors; > > However, looks 'rd_nr' still needs to be validated(rd_nr < 2 ^ 23). > Thanks for your reply. As you said, minors is limited to DISK_MAX_PARTS in __alloc_disk_node when calling alloc_disk in brd_alloc func. We can set: disk->first_minor = i * disk->minors as your suggestion. As for 'rd_nr', I think we should make sure that 'disk->first_minor' should be not larger than MINORMASK. ->add_disk ->device_add_disk ->__device_add_disk ->blk_alloc_devt ->MKDEV If rd_nr > MINORMASK / min(max_part, DISK_MAX_PARTS), two different brd_device may have the same devt by calling MKDEV in blk_alloc_devt func. For example, MKDEV(1, 0) is equal to MKDEV(1, MINORMASK + 1). So we should check both rd_nr and max_part as follows, static inline int brd_check_par_valid(void) { if (unlikely(!rd_nr)) rd_nr = 1; if (unlikely(!max_part)) max_part = 1; if (max_part > DISK_MAX_PARTS) max_part = DISK_MAX_PARTS; if (rd_nr > MINORMASK / max_part) return -EINVAL; return 0; } I will send the v2 patch.