Received: by 10.223.185.116 with SMTP id b49csp5355735wrg; Tue, 27 Feb 2018 11:51:38 -0800 (PST) X-Google-Smtp-Source: AH8x2271pkcOUZj1g+r1BLs/IyyCmgT2SMvcvm0Mmv+KDEcouRzVLM1bdKvJJTOOWpqkAHnaP/lz X-Received: by 10.98.23.136 with SMTP id 130mr15049413pfx.43.1519761098221; Tue, 27 Feb 2018 11:51:38 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1519761098; cv=none; d=google.com; s=arc-20160816; b=yPU9UPSiBmqztVDMDnX3CzuJXzE6m6pC+FCNQLgKRUB+q9wLUjzGlR1xdAEWpUw/PQ EcA8KprXY+o2usJtSJnzYHHEFqCUFIuya6J2XZeqilSu+D3XkZ58kz/2sLBA1jjQ9hSg mC5sBYrCSLNg0x1H6ybtqMTpT0+XRRUyiElEvV7AOLTTz86+bsF6bSSFJPERq57cSaAv UI1g31H19xZr+AlFKIceDMXfZQ8eF9Zv5CGC5rFkCPb8iM9PjGwb3OSHxz1nDewMyoas +Pwlc0wehPZ49kZwtKL4s2uD/A+XR0TNfbl7350i9UM5F+LSvZpZpYmX0Ggs337CdJlk BErg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:to:cc:in-reply-to:date:subject :mime-version:message-id:from:dkim-signature :arc-authentication-results; bh=SiTlcOGBkLF6YQYIS+fFs4xBdbLWvaTPmMebKvhjd1M=; b=ji20fWMU3FxTvoVPvhNwwhLzblpAa0vv61MZm/ceoxZJfRMzdptu25yjtNVYH6F91h poOl9X6BvldLV/TPexjYO2ITAp77ITefGb7lnSmIlReBXEAhcOp5E64Bl5zxPjTFwIyV ahCV8ExqiiDcbKpGGpNPPSFjT0d6eI19SxZVnSUChNJE0wGvmOr1Q6vMGVMuMUiTrNhe JF01DJCZMx+rh7C+0tK3N+hHcEu/fDDTGXadrteCqLEMJPmCHd7o+vmZFV+AEBDfwUYP EnJFtx/xZPmc49nyaUtaVOtAvhu2JMx80wU6ht8Mm39Stp9NSjgI2fQND7XPYMnlde/5 SWug== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@javigon-com.20150623.gappssmtp.com header.s=20150623 header.b=WFocc90A; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g6si7382051pgr.233.2018.02.27.11.51.23; Tue, 27 Feb 2018 11:51:38 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@javigon-com.20150623.gappssmtp.com header.s=20150623 header.b=WFocc90A; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752005AbeB0TuJ (ORCPT + 99 others); Tue, 27 Feb 2018 14:50:09 -0500 Received: from mail-it0-f48.google.com ([209.85.214.48]:38360 "EHLO mail-it0-f48.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751901AbeB0TuH (ORCPT ); Tue, 27 Feb 2018 14:50:07 -0500 Received: by mail-it0-f48.google.com with SMTP id j7so616497ita.3 for ; Tue, 27 Feb 2018 11:50:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=javigon-com.20150623.gappssmtp.com; s=20150623; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=SiTlcOGBkLF6YQYIS+fFs4xBdbLWvaTPmMebKvhjd1M=; b=WFocc90AvWcCXBfJCEjpScCLhQMEtRo4jShERXCqy2D58jNjujOaIT225bGyvUzEiw tCa4uy2tp8yI31z8nxsQNCLKP03Vx9gh0Lfi0gOquO9UmnKk907U8AlFP0menPvcCgsx 2+sNYx2iUon1kO+3UXlUt80fZ3hxl1rNOiA07AsFhZ9KHIFRuWZxodJMeAlHAu4f4Kvv Lo97nAKRD0kcEfnmMbAGPRHuV1PP8DW5LepTGtT/n7PLgSwIEfe0M5Vx5XhAL5Cggfwm ncsW99kKa5qAUf/wbaGkwgj2V4VvKpZYk17ICqY6dRqAyvBkEQ4YtVBK+Kc6Z6qas/Jv 6DaQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=SiTlcOGBkLF6YQYIS+fFs4xBdbLWvaTPmMebKvhjd1M=; b=OuFhmBhKj6ta0d7Yau8MAzGn9GAvecd6/CSOLAAiFnWGm1OTcfXb2tT4q15W5qLCGr BCBsysfTtKSCo4xdOahiogSWHxsm2eA0ThxElFlcdwbamtPKfLjn9jKbG17hnK6/b4Lu AWHkYnJrSiD2O8J8sWATyJnfsrd7+DPTidBKvzGkNijT+QpqQQ7NbYknZGWZ8Idqjiac YIBb87faEJgk8wKEziOD/hESeT+vu0j1jMKFFnruuOxqr25Zq75XFg6PvUVjgLwytyo9 ZSGqHi1HdyLb8coQBMNJ+DwtaafAEnhRYiO3euz2j1GSEktL6at0cUJ/RZxBNtXjG13x ggNQ== X-Gm-Message-State: APf1xPAAjpDorkJvCbB9basi+oaCXaSBAqu9F7ZITO8KPHJkeKxvI4mN b4+PvAwNfV94gSP5DVBbcM/v3iUd5oI= X-Received: by 10.36.216.8 with SMTP id b8mr18635879itg.107.1519761006460; Tue, 27 Feb 2018 11:50:06 -0800 (PST) Received: from [172.31.1.2] (6164211-cl69.boa.fiberby.dk. [193.106.164.211]) by smtp.gmail.com with ESMTPSA id o31sm5257432ioi.49.2018.02.27.11.50.04 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 27 Feb 2018 11:50:05 -0800 (PST) From: =?utf-8?Q?Javier_Gonz=C3=A1lez?= Message-Id: <5AE7F4CA-DBDF-4A67-A04F-4760D8E729B5@javigon.com> Content-Type: multipart/signed; boundary="Apple-Mail=_D6526EAC-F639-46AF-B84B-BA6806B24E5D"; protocol="application/pgp-signature"; micalg=pgp-sha512 Mime-Version: 1.0 (Mac OS X Mail 11.2 \(3445.5.20\)) Subject: Re: [PATCH 17/19] lightnvm: pblk: implement get log report chunk Date: Tue, 27 Feb 2018 20:50:00 +0100 In-Reply-To: <1883698b-8812-6c7a-cd59-dd24bde49f54@lightnvm.io> Cc: linux-block@vger.kernel.org, linux-kernel@vger.kernel.org, linux-nvme@lists.infradead.org To: =?utf-8?Q?Matias_Bj=C3=B8rling?= References: <1519651038-16845-1-git-send-email-javier@cnexlabs.com> <1519651038-16845-18-git-send-email-javier@cnexlabs.com> <7f82b801-acee-dc07-7604-e0c1c726516e@lightnvm.io> <1883698b-8812-6c7a-cd59-dd24bde49f54@lightnvm.io> X-Mailer: Apple Mail (2.3445.5.20) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --Apple-Mail=_D6526EAC-F639-46AF-B84B-BA6806B24E5D Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 > On 27 Feb 2018, at 19.46, Matias Bj=C3=B8rling wrote: >=20 > On 02/27/2018 03:40 PM, Javier Gonz=C3=A1lez wrote: >>> On 26 Feb 2018, at 20.04, Matias Bj=C3=B8rling = wrote: > >>> Can you help me understand why you want to use The >>> NVM_CHK_ST_HOST_USE? Why would I care if the chunk state is = HOST_USE? >>> A target instance should not be able to see states from other chunks >>> it doesn't own. and in that case, why have a separate state? >> The motivation for this state is that pblk does not maintain a per >> block/chunk state. On the first disk pass however, the state of the >> chunks is unknown, >=20 > I don't understand this. Why is the state unknown? If assuming 1.2 > drive, one can do a scan of first page of each block. If the block > returns success (it is in use), if 0x42ff, the block is empty, and if > reading and it returns 0x4281, it is open. >=20 > thus we need to check for each one individually; Yes, we could do this on init - this is actually how we do recovery after all.. >> after that, we know that good chunks need to be erased. Having this >> extra chunk state, frees us from maintaining this unnecessary = intra-line >> state. Also, at the point I wrote this code, we did not have the = report >> chunk on the erase path, so updating the chunk metadata would be >> something periodic. >> I have taken a look at this again and I solved it in a different way = - >> basically, only maintaining free/closed states, which is not much = more >> expensive. It is included in the next version. >>>> + } >>>> + >>>> + WARN_ONCE(state & NVM_CHK_ST_OPEN, >>>> + "pblk: open chunk in new line: %d\n", >>>> + line->id); >>>> + } >>>> + >>>> + return blk_to_erase; >>>> +} >>>> + >>>> static int pblk_line_prepare(struct pblk *pblk, struct pblk_line = *line) >>>> { >>>> struct pblk_line_meta *lm =3D &pblk->lm; >>>> - int blk_in_line =3D atomic_read(&line->blk_in_line); >>>> + int blk_to_erase; >>>> line->map_bitmap =3D kzalloc(lm->sec_bitmap_len, = GFP_ATOMIC); >>>> if (!line->map_bitmap) >>>> @@ -1110,7 +1184,21 @@ static int pblk_line_prepare(struct pblk = *pblk, struct pblk_line *line) >>>> return -ENOMEM; >>>> } >>>> + /* Bad blocks do not need to be erased */ >>>> + bitmap_copy(line->erase_bitmap, line->blk_bitmap, = lm->blk_per_line); >>>> + >>>> spin_lock(&line->lock); >>>> + >>>> + /* If we have not written to this line, we need to mark up free = chunks >>>> + * as already erased >>>> + */ >>>> + if (line->state =3D=3D PBLK_LINESTATE_NEW) { >>>> + blk_to_erase =3D pblk_prepare_new_line(pblk, line); >>>> + line->state =3D PBLK_LINESTATE_FREE; >>>> + } else { >>>> + blk_to_erase =3D atomic_read(&line->blk_in_line); >>>> + } >>>> + >>>> if (line->state !=3D PBLK_LINESTATE_FREE) { >>>> kfree(line->map_bitmap); >>>> kfree(line->invalid_bitmap); >>>> @@ -1122,15 +1210,12 @@ static int pblk_line_prepare(struct pblk = *pblk, struct pblk_line *line) >>>> line->state =3D PBLK_LINESTATE_OPEN; >>>> - atomic_set(&line->left_eblks, blk_in_line); >>>> - atomic_set(&line->left_seblks, blk_in_line); >>>> + atomic_set(&line->left_eblks, blk_to_erase); >>>> + atomic_set(&line->left_seblks, blk_to_erase); >>>> line->meta_distance =3D lm->meta_distance; >>>> spin_unlock(&line->lock); >>>> - /* Bad blocks do not need to be erased */ >>>> - bitmap_copy(line->erase_bitmap, line->blk_bitmap, = lm->blk_per_line); >>>> - >>>> kref_init(&line->ref); >>>> return 0; >>>> diff --git a/drivers/lightnvm/pblk-init.c = b/drivers/lightnvm/pblk-init.c >>>> index ec39800eea42..cf4f49d48aed 100644 >>>> --- a/drivers/lightnvm/pblk-init.c >>>> +++ b/drivers/lightnvm/pblk-init.c >>>> @@ -401,6 +401,7 @@ static void pblk_line_meta_free(struct = pblk_line *line) >>>> { >>>> kfree(line->blk_bitmap); >>>> kfree(line->erase_bitmap); >>>> + kfree(line->chks); >>>> } >>>> static void pblk_lines_free(struct pblk *pblk) >>>> @@ -440,54 +441,44 @@ static int pblk_bb_get_tbl(struct nvm_tgt_dev = *dev, struct pblk_lun *rlun, >>>> return 0; >>>> } >>>> -static void *pblk_bb_get_log(struct pblk *pblk) >>>> +static void *pblk_bb_get_meta(struct pblk *pblk) >>>> { >>>> struct nvm_tgt_dev *dev =3D pblk->dev; >>>> struct nvm_geo *geo =3D &dev->geo; >>>> - u8 *log; >>>> + u8 *meta; >>>> int i, nr_blks, blk_per_lun; >>>> int ret; >>>> blk_per_lun =3D geo->c.num_chk * geo->c.pln_mode; >>>> nr_blks =3D blk_per_lun * geo->all_luns; >>>> - log =3D kmalloc(nr_blks, GFP_KERNEL); >>>> - if (!log) >>>> + meta =3D kmalloc(nr_blks, GFP_KERNEL); >>>> + if (!meta) >>>> return ERR_PTR(-ENOMEM); >>>> for (i =3D 0; i < geo->all_luns; i++) { >>>> struct pblk_lun *rlun =3D &pblk->luns[i]; >>>> - u8 *log_pos =3D log + i * blk_per_lun; >>>> + u8 *meta_pos =3D meta + i * blk_per_lun; >>>> - ret =3D pblk_bb_get_tbl(dev, rlun, log_pos, = blk_per_lun); >>>> + ret =3D pblk_bb_get_tbl(dev, rlun, meta_pos, = blk_per_lun); >>>> if (ret) { >>>> - kfree(log); >>>> + kfree(meta); >>>> return ERR_PTR(-EIO); >>>> } >>>> } >>>> - return log; >>>> + return meta; >>>> } >>>> -static int pblk_bb_line(struct pblk *pblk, struct pblk_line = *line, >>>> - u8 *bb_log, int blk_per_line) >>>> +static void *pblk_chunk_get_meta(struct pblk *pblk) >>>> { >>>> struct nvm_tgt_dev *dev =3D pblk->dev; >>>> struct nvm_geo *geo =3D &dev->geo; >>>> - int i, bb_cnt =3D 0; >>>> - for (i =3D 0; i < blk_per_line; i++) { >>>> - struct pblk_lun *rlun =3D &pblk->luns[i]; >>>> - u8 *lun_bb_log =3D bb_log + i * blk_per_line; >>>> - >>>> - if (lun_bb_log[line->id] =3D=3D NVM_BLK_T_FREE) >>>> - continue; >>>> - >>>> - set_bit(pblk_ppa_to_pos(geo, rlun->bppa), = line->blk_bitmap); >>>> - bb_cnt++; >>>> - } >>>> - >>>> - return bb_cnt; >>>> + if (geo->c.version =3D=3D NVM_OCSSD_SPEC_12) >>>> + return pblk_bb_get_meta(pblk); >>>> + else >>>> + return pblk_chunk_get_info(pblk); >>>=20 >>> This 1.2 or 2.0 would be nice to have inside the lightnvm core. A >>> target should not care weather it is a 1.2 or 2.0 device. >> It is not an easy thing to do since the bad block format and chunk >> report return completely different formats. As I explained in a >> different thread, doing this in core would force 1.2 to understand = fake >> chunk formats, which are not meaningful. It is better to leave the = logic >> at the target level, where we know what to do with the format. >=20 > What if the core layer exported a generic "chunk meta" interface, and > then filled it as is done in pblk. In the core, it will then fill out > what it can fill out, and leave the rest of the fields untouched. The > pblk code can then if () in certain places if for example wp attribute > can,'t be used? >=20 It is basically what we do in pblk now. It is easy to abstract in core; I'm just concerned that hiding the chunk metadata will end up making the pblk code more complex - in the end doing a check on the version is meaningful for the reader. > In the interest of time, and to get this moving, we can ignore it for = now. Ok. Let's do it this way now and we can see how to abstract it when we start implementing features based on these fields. The first one will be wear levelling, which I expect to have for next window. >>>> } >>>> static int pblk_luns_init(struct pblk *pblk, struct ppa_addr = *luns) >>>> @@ -516,6 +507,7 @@ static int pblk_luns_init(struct pblk *pblk, = struct ppa_addr *luns) >>>> rlun =3D &pblk->luns[i]; >>>> rlun->bppa =3D luns[lunid]; >>>> + rlun->chunk_bppa =3D luns[i]; >>>> sema_init(&rlun->wr_sem, 1); >>>> } >>>> @@ -695,8 +687,125 @@ static int pblk_lines_alloc_metadata(struct = pblk *pblk) >>>> return -ENOMEM; >>>> } >>>> -static int pblk_setup_line_meta(struct pblk *pblk, struct = pblk_line *line, >>>> - void *chunk_log, long *nr_bad_blks) >>>> +static int pblk_setup_line_meta_12(struct pblk *pblk, struct = pblk_line *line, >>>> + void *chunk_meta) >>>> +{ >>>> + struct nvm_tgt_dev *dev =3D pblk->dev; >>>> + struct nvm_geo *geo =3D &dev->geo; >>>> + struct pblk_line_meta *lm =3D &pblk->lm; >>>> + int i, chk_per_lun, nr_bad_chks =3D 0; >>>> + >>>> + chk_per_lun =3D geo->c.num_chk * geo->c.pln_mode; >>>> + >>>> + for (i =3D 0; i < lm->blk_per_line; i++) { >>>> + struct pblk_chunk *chunk =3D &line->chks[i]; >>>> + struct pblk_lun *rlun =3D &pblk->luns[i]; >>>> + u8 *lun_bb_meta =3D chunk_meta + i * chk_per_lun; >>>> + >>>> + /* >>>> + * In 1.2 spec. chunk state is not persisted by the = device. Thus >>>> + * some of the values are reset each time pblk is = instantiated. >>>> + */ >>>> + if (lun_bb_meta[line->id] =3D=3D NVM_BLK_T_FREE) >>>> + chunk->state =3D NVM_CHK_ST_HOST_USE; >>>> + else >>>> + chunk->state =3D NVM_CHK_ST_OFFLINE; >>>> + >>>> + chunk->type =3D NVM_CHK_TP_W_SEQ; >>>> + chunk->wi =3D 0; >>>> + chunk->slba =3D -1; >>>> + chunk->cnlb =3D geo->c.clba; >>>> + chunk->wp =3D 0; >>>> + >>>> + if (!(chunk->state & NVM_CHK_ST_OFFLINE)) >>>> + continue; >>>> + >>>> + set_bit(pblk_ppa_to_pos(geo, rlun->bppa), = line->blk_bitmap); >>>> + nr_bad_chks++; >>>> + } >>>> + >>>> + return nr_bad_chks; >>>> +} >>>> + >>>> +static int pblk_setup_line_meta_20(struct pblk *pblk, struct = pblk_line *line, >>>> + struct nvm_chk_meta *meta) >>>> +{ >>>> + struct nvm_tgt_dev *dev =3D pblk->dev; >>>> + struct nvm_geo *geo =3D &dev->geo; >>>> + struct pblk_line_meta *lm =3D &pblk->lm; >>>> + int i, nr_bad_chks =3D 0; >>>> + >>>> + for (i =3D 0; i < lm->blk_per_line; i++) { >>>> + struct pblk_chunk *chunk =3D &line->chks[i]; >>>> + struct pblk_lun *rlun =3D &pblk->luns[i]; >>>> + struct nvm_chk_meta *chunk_meta; >>>> + struct ppa_addr ppa; >>>> + >>>> + ppa =3D rlun->chunk_bppa; >>>> + ppa.m.chk =3D line->id; >>>> + chunk_meta =3D pblk_chunk_get_off(pblk, meta, ppa); >>>> + >>>> + chunk->state =3D chunk_meta->state; >>>> + chunk->type =3D chunk_meta->type; >>>> + chunk->wi =3D chunk_meta->wi; >>>> + chunk->slba =3D chunk_meta->slba; >>>> + chunk->cnlb =3D chunk_meta->cnlb; >>>> + chunk->wp =3D chunk_meta->wp; >>>> + >>>> + if (!(chunk->state & NVM_CHK_ST_OFFLINE)) >>>> + continue; >>>> + >>>> + if (chunk->type & NVM_CHK_TP_SZ_SPEC) { >>>> + WARN_ONCE(1, "pblk: custom-sized chunks = unsupported\n"); >>>> + continue; >>>> + } >>>> + >>>> + set_bit(pblk_ppa_to_pos(geo, rlun->chunk_bppa), >>>> + = line->blk_bitmap); >>>> + nr_bad_chks++; >>>> + } >>>> + >>>> + return nr_bad_chks; >>>> +} >>>> + >>>> +static long pblk_setup_line_meta(struct pblk *pblk, struct = pblk_line *line, >>>> + void *chunk_meta, int line_id) >>>> +{ >>>> + struct nvm_tgt_dev *dev =3D pblk->dev; >>>> + struct nvm_geo *geo =3D &dev->geo; >>>> + struct pblk_line_mgmt *l_mg =3D &pblk->l_mg; >>>> + struct pblk_line_meta *lm =3D &pblk->lm; >>>> + long nr_bad_chks, chk_in_line; >>>> + >>>> + line->pblk =3D pblk; >>>> + line->id =3D line_id; >>>> + line->type =3D PBLK_LINETYPE_FREE; >>>> + line->state =3D PBLK_LINESTATE_NEW; >>>> + line->gc_group =3D PBLK_LINEGC_NONE; >>>> + line->vsc =3D &l_mg->vsc_list[line_id]; >>>> + spin_lock_init(&line->lock); >>>> + >>>> + if (geo->c.version =3D=3D NVM_OCSSD_SPEC_12) >>>> + nr_bad_chks =3D pblk_setup_line_meta_12(pblk, line, = chunk_meta); >>>> + else >>>> + nr_bad_chks =3D pblk_setup_line_meta_20(pblk, line, = chunk_meta); >>>> + >>>> + chk_in_line =3D lm->blk_per_line - nr_bad_chks; >>>> + if (nr_bad_chks < 0 || nr_bad_chks > lm->blk_per_line || >>>> + chk_in_line < lm->min_blk_line) = { >>>> + line->state =3D PBLK_LINESTATE_BAD; >>>> + list_add_tail(&line->list, &l_mg->bad_list); >>>> + return 0; >>>> + } >>>> + >>>> + atomic_set(&line->blk_in_line, chk_in_line); >>>> + list_add_tail(&line->list, &l_mg->free_list); >>>> + l_mg->nr_free_lines++; >>>> + >>>> + return chk_in_line; >>>> +} >>>> + >>>> +static int pblk_alloc_line_meta(struct pblk *pblk, struct = pblk_line *line) >>>> { >>>> struct pblk_line_meta *lm =3D &pblk->lm; >>>> @@ -710,7 +819,13 @@ static int pblk_setup_line_meta(struct pblk = *pblk, struct pblk_line *line, >>>> return -ENOMEM; >>>> } >>>> - *nr_bad_blks =3D pblk_bb_line(pblk, line, chunk_log, = lm->blk_per_line); >>>> + line->chks =3D kmalloc(lm->blk_per_line * sizeof(struct = pblk_chunk), >>>> + = GFP_KERNEL); >>>> + if (!line->chks) { >>>> + kfree(line->erase_bitmap); >>>> + kfree(line->blk_bitmap); >>>> + return -ENOMEM; >>>> + } >>>> return 0; >>>> } >>>> @@ -722,9 +837,9 @@ static int pblk_lines_init(struct pblk *pblk) >>>> struct pblk_line_mgmt *l_mg =3D &pblk->l_mg; >>>> struct pblk_line_meta *lm =3D &pblk->lm; >>>> struct pblk_line *line; >>>> - void *chunk_log; >>>> + void *chunk_meta; >>>> unsigned int smeta_len, emeta_len; >>>> - long nr_bad_blks =3D 0, nr_free_blks =3D 0; >>>> + long nr_free_chks =3D 0; >>>> int bb_distance, max_write_ppas; >>>> int i, ret; >>>> @@ -743,6 +858,7 @@ static int pblk_lines_init(struct pblk *pblk) >>>> l_mg->log_line =3D l_mg->data_line =3D NULL; >>>> l_mg->l_seq_nr =3D l_mg->d_seq_nr =3D 0; >>>> l_mg->nr_free_lines =3D 0; >>>> + atomic_set(&l_mg->sysfs_line_state, -1); >>>> bitmap_zero(&l_mg->meta_bitmap, PBLK_DATA_LINES); >>>> lm->sec_per_line =3D geo->c.clba * geo->all_luns; >>>> @@ -841,53 +957,31 @@ static int pblk_lines_init(struct pblk *pblk) >>>> goto fail_free_bb_aux; >>>> } >>>> - chunk_log =3D pblk_bb_get_log(pblk); >>>> - if (IS_ERR(chunk_log)) { >>>> - pr_err("pblk: could not get bad block log (%lu)\n", >>>> - = PTR_ERR(chunk_log)); >>>> - ret =3D PTR_ERR(chunk_log); >>>> + chunk_meta =3D pblk_chunk_get_meta(pblk); >>>> + if (IS_ERR(chunk_meta)) { >>>> + pr_err("pblk: could not get chunk log (%lu)\n", >>>> + = PTR_ERR(chunk_meta)); >>>> + ret =3D PTR_ERR(chunk_meta); >>>> goto fail_free_lines; >>>> } >>>> for (i =3D 0; i < l_mg->nr_lines; i++) { >>>> - int chk_in_line; >>>> - >>>> line =3D &pblk->lines[i]; >>>> - line->pblk =3D pblk; >>>> - line->id =3D i; >>>> - line->type =3D PBLK_LINETYPE_FREE; >>>> - line->state =3D PBLK_LINESTATE_FREE; >>>> - line->gc_group =3D PBLK_LINEGC_NONE; >>>> - line->vsc =3D &l_mg->vsc_list[i]; >>>> - spin_lock_init(&line->lock); >>>> - >>>> - ret =3D pblk_setup_line_meta(pblk, line, chunk_log, = &nr_bad_blks); >>>> + ret =3D pblk_alloc_line_meta(pblk, line); >>>> if (ret) >>>> - goto fail_free_chunk_log; >>>> + goto fail_free_chunk_meta; >>>> - chk_in_line =3D lm->blk_per_line - nr_bad_blks; >>>> - if (nr_bad_blks < 0 || nr_bad_blks > lm->blk_per_line || >>>> - chk_in_line < lm->min_blk_line) = { >>>> - line->state =3D PBLK_LINESTATE_BAD; >>>> - list_add_tail(&line->list, &l_mg->bad_list); >>>> - continue; >>>> - } >>>> - >>>> - nr_free_blks +=3D chk_in_line; >>>> - atomic_set(&line->blk_in_line, chk_in_line); >>>> - >>>> - l_mg->nr_free_lines++; >>>> - list_add_tail(&line->list, &l_mg->free_list); >>>> + nr_free_chks +=3D pblk_setup_line_meta(pblk, line, = chunk_meta, i); >>>> } >>>> - pblk_set_provision(pblk, nr_free_blks); >>>> + pblk_set_provision(pblk, nr_free_chks); >>>> - kfree(chunk_log); >>>> + kfree(chunk_meta); >>>> return 0; >>>> -fail_free_chunk_log: >>>> - kfree(chunk_log); >>>> +fail_free_chunk_meta: >>>> + kfree(chunk_meta); >>>> while (--i >=3D 0) >>>> pblk_line_meta_free(&pblk->lines[i]); >>>> fail_free_lines: >>>> diff --git a/drivers/lightnvm/pblk-sysfs.c = b/drivers/lightnvm/pblk-sysfs.c >>>> index ccfb3abd2773..1ce5b956c622 100644 >>>> --- a/drivers/lightnvm/pblk-sysfs.c >>>> +++ b/drivers/lightnvm/pblk-sysfs.c >>>> @@ -142,6 +142,40 @@ static ssize_t pblk_sysfs_ppaf(struct pblk = *pblk, char *page) >>>> return sz; >>>> } >>>> +static ssize_t pblk_sysfs_line_state_show(struct pblk *pblk, char = *page) >>>> +{ >>>> + struct pblk_line_meta *lm =3D &pblk->lm; >>>> + struct pblk_line_mgmt *l_mg =3D &pblk->l_mg; >>>> + struct pblk_line *line; >>>> + int line_id =3D atomic_read(&l_mg->sysfs_line_state); >>>> + ssize_t sz =3D 0; >>>> + int i; >>>> + >>>> + if (line_id < 0 || line_id >=3D l_mg->nr_lines) >>>> + return 0; >>>> + >>>> + sz =3D snprintf(page, PAGE_SIZE, >>>> + = "line\tchunk\tstate\ttype\twear-index\tslba\t\tcnlb\twp\n"); >>>> + >>>> + line =3D &pblk->lines[line_id]; >>>> + >>>> + for (i =3D 0; i < lm->blk_per_line; i++) { >>>> + struct pblk_chunk *chunk =3D &line->chks[i]; >>>> + >>>> + sz +=3D snprintf(page + sz, PAGE_SIZE - sz, >>>> + = "%d\t%d\t%d\t%d\t%d\t\t%llu\t\t%llu\t%llu\n", >>>> + line->id, i, >>>> + chunk->state, >>>> + chunk->type, >>>> + chunk->wi, >>>> + chunk->slba, >>>> + chunk->cnlb, >>>> + chunk->wp); >>>> + } >>>> + >>>> + return sz; >>>> +} >>>> + >>>> static ssize_t pblk_sysfs_lines(struct pblk *pblk, char *page) >>>> { >>>> struct nvm_tgt_dev *dev =3D pblk->dev; >>>> @@ -398,6 +432,29 @@ static ssize_t pblk_sysfs_stats_debug(struct = pblk *pblk, char *page) >>>> } >>>> #endif >>>> + >>>> +static ssize_t pblk_sysfs_line_state_store(struct pblk *pblk, = const char *page, >>>> + size_t len) >>>> +{ >>>> + struct pblk_line_mgmt *l_mg =3D &pblk->l_mg; >>>> + size_t c_len; >>>> + int line_id; >>>> + >>>> + c_len =3D strcspn(page, "\n"); >>>> + if (c_len >=3D len) >>>> + return -EINVAL; >>>> + >>>> + if (kstrtouint(page, 0, &line_id)) >>>> + return -EINVAL; >>>> + >>>> + if (line_id < 0 || line_id >=3D l_mg->nr_lines) >>>> + return -EINVAL; >>>> + >>>> + atomic_set(&l_mg->sysfs_line_state, line_id); >>>> + >>>> + return len; >>>> +} >>>> + >>>> static ssize_t pblk_sysfs_gc_force(struct pblk *pblk, const char = *page, >>>> size_t len) >>>> { >>>> @@ -529,6 +586,11 @@ static struct attribute sys_lines_info_attr =3D = { >>>> .mode =3D 0444, >>>> }; >>>> +static struct attribute sys_line_state_attr =3D { >>>> + .name =3D "line_state", >>>> + .mode =3D 0644, >>>> +}; >>>> + >>>> static struct attribute sys_gc_force =3D { >>>> .name =3D "gc_force", >>>> .mode =3D 0200, >>>> @@ -572,6 +634,7 @@ static struct attribute *pblk_attrs[] =3D { >>>> &sys_stats_ppaf_attr, >>>> &sys_lines_attr, >>>> &sys_lines_info_attr, >>>> + &sys_line_state_attr, >>>> &sys_write_amp_mileage, >>>> &sys_write_amp_trip, >>>> &sys_padding_dist, >>>> @@ -602,6 +665,8 @@ static ssize_t pblk_sysfs_show(struct kobject = *kobj, struct attribute *attr, >>>> return pblk_sysfs_lines(pblk, buf); >>>> else if (strcmp(attr->name, "lines_info") =3D=3D 0) >>>> return pblk_sysfs_lines_info(pblk, buf); >>>> + else if (strcmp(attr->name, "line_state") =3D=3D 0) >>>> + return pblk_sysfs_line_state_show(pblk, buf); >>>> else if (strcmp(attr->name, "max_sec_per_write") =3D=3D 0) >>>> return pblk_sysfs_get_sec_per_write(pblk, buf); >>>> else if (strcmp(attr->name, "write_amp_mileage") =3D=3D 0) >>>> @@ -628,6 +693,8 @@ static ssize_t pblk_sysfs_store(struct kobject = *kobj, struct attribute *attr, >>>> return pblk_sysfs_set_sec_per_write(pblk, buf, len); >>>> else if (strcmp(attr->name, "write_amp_trip") =3D=3D 0) >>>> return pblk_sysfs_set_write_amp_trip(pblk, buf, len); >>>> + else if (strcmp(attr->name, "line_state") =3D=3D 0) >>>> + return pblk_sysfs_line_state_store(pblk, buf, len); >>>> else if (strcmp(attr->name, "padding_dist") =3D=3D 0) >>>> return pblk_sysfs_set_padding_dist(pblk, buf, len); >>>> return 0; >>>> diff --git a/drivers/lightnvm/pblk.h b/drivers/lightnvm/pblk.h >>>> index 6e1fcd1a538a..bc31c67b725f 100644 >>>> --- a/drivers/lightnvm/pblk.h >>>> +++ b/drivers/lightnvm/pblk.h >>>> @@ -201,6 +201,8 @@ struct pblk_rb { >>>> struct pblk_lun { >>>> struct ppa_addr bppa; >>>> + struct ppa_addr chunk_bppa; >>>> + >>>> struct semaphore wr_sem; >>>> }; >>>> @@ -297,6 +299,7 @@ enum { >>>> PBLK_LINETYPE_DATA =3D 2, >>>> /* Line state */ >>>> + PBLK_LINESTATE_NEW =3D 9, >>>> PBLK_LINESTATE_FREE =3D 10, >>>> PBLK_LINESTATE_OPEN =3D 11, >>>> PBLK_LINESTATE_CLOSED =3D 12, >>>> @@ -412,6 +415,15 @@ struct pblk_smeta { >>>> struct line_smeta *buf; /* smeta buffer in persistent = format */ >>>> }; >>>> +struct pblk_chunk { >>>> + int state; >>>> + int type; >>>> + int wi; >>>> + u64 slba; >>>> + u64 cnlb; >>>> + u64 wp; >>>> +}; >>>> + >>>=20 >>> How come the code replicate the nvm_chk_meta data structure? >> There is no need to maintain the chunk reserved bits in nvm_chk_meta = and >> waste memory in the process, as these are alive for the whole life of >> the pblk instance. This structure only maintains the necessary bits. >=20 > nvm_chk_meta is 32 bytes: >=20 > struct nvm_chk_meta { > u8 state; /* 0 1 */ > u8 type; /* 1 1 */ > u8 wli; /* 2 1 */ > u8 rsvd[5]; /* 3 5 */ > u64 slba; /* 8 8 */ > u64 cnlb; /* 16 8 */ > u64 wp; /* 24 8 */ >=20 > /* size: 32, cachelines: 1, members: 7 */ > /* last cacheline: 32 bytes */ > }; >=20 > pblk_chunk is 40 bytes: >=20 > struct pblk_chunk { > int state; /* 0 4 */ > int type; /* 4 4 */ > int wi; /* 8 4 */ >=20 > /* XXX 4 bytes hole, try to pack */ >=20 > u64 slba; /* 16 8 */ > u64 cnlb; /* 24 8 */ > u64 wp; /* 32 8 */ >=20 > /* size: 40, cachelines: 1, members: 6 */ > /* sum members: 36, holes: 1, sum holes: 4 */ > /* last cacheline: 40 bytes */ > }; >=20 > If assuming 64K chunks, pblk chunk uses 2560KB (and requires an extra > traversal when the chunk meta information is returned to copy over the > data to the new data structure), while nvm_chk_meta uses 2048KB. > Granted, the nvm_chk_meta state/type/wli fields will be slightly > slower to access due to the bit shifting, but I think it is worth it. > One can measure and see if there is any performance difference. >=20 Ok. This is not in the fast path now. We can look into it if it gets there at some point. I'll fix in the next version. Javier --Apple-Mail=_D6526EAC-F639-46AF-B84B-BA6806B24E5D Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP -----BEGIN PGP SIGNATURE----- iQIzBAEBCgAdFiEEU1dMZpvMIkj0jATvPEYBfS0leOAFAlqVtmgACgkQPEYBfS0l eOALDw/9FkMn4r//lni6ysItj3Kry/WaHfbVv0IxO2xwyLGNKHzN3hZQvgasvZqg cYwZLiPh3weev8tAbIhoEq/14o6HcaeULc8+7NnF0h+eA3OUDHSHpFADXLUZEpyg sciIHXC2nSQmEY0AhS/VzXi9mh2vhOJwo7pZqbsoFkNX9rWLpO9Xb76bcro1Y8kI iij1b/bX8LWmQArJuCxUG3kj45bZMg1lU4aKbYpx3lRGfjD9qBkG+zb14tg6OEpS Wuy60RHXcn2x/jGZ5O+kbi+JQO3ciGKP2sXej12rIVPFXSbh10FPRKL5EcGnHeAB 9HpteXQDWyYZpcFvtYnLFOgRfoSnaLEOd4lEqLBWJEcx7a0xuarTcs3BmEBYapfS IwhQdRri31vuOP6OnrcfMD5IDYByDrkmMFhKsZqI1q1RqSKIWHiCaWa6dYQNonxi NGyIEWYtJdhq1wkOiKCGMbbTznLdzIhpsGEJazOedBgc/u3pHAnoyC00hdOqtEJW lB0EdPpjRkmWLNGVDP1jNVaAeDXJzzqiawF8t6kxg+RFfVVN+l7N9Szjd3rVXXdg +FL1n7eNtrWpC010wDH3ZzBTjmBcA1aJBEo5/G1ZGkts4LHZRJ9Eh1DGfKV4y3OJ DxLGUrJrncgZ+KZaNdcwRQNjDL4HnZVxz6Uc1d92lxqWBfa9V44= =6hiW -----END PGP SIGNATURE----- --Apple-Mail=_D6526EAC-F639-46AF-B84B-BA6806B24E5D--