From: NeilBrown <neilb@suse.com>
To: Konstantin Khlebnikov <khlebnikov@yandex-team.ru>,
        Konstantin Khlebnikov <koct9i@gmail.com>, Shaohua Li <shli@kernel.org>
Date: Mon, 21 Nov 2016 12:23:10 +1100
Cc: "linux-kernel\@vger.kernel.org" <linux-kernel@vger.kernel.org>,
        linux-raid@vger.kernel.org, linux-block@vger.kernel.org,
        Jens Axboe <axboe@kernel.dk>, Christoph Hellwig <hch@lst.de>
Subject: Re: [BUG 4.4.26] bio->bi_bdev == NULL in raid6 return_io()
In-Reply-To: <c7f3d4e3-0ef6-a6d8-7c8b-bbdb903af7a9@yandex-team.ru>
References: <251e243a-ebcd-ae83-0850-a2143d2423ca@yandex-team.ru> <20161107194627.hsdk7zqoxznxdixl@kernel.org> <CALYGNiMSEH-tnfzYMKuYe4KRaY6+s8KeKgUUSZRK5fQiwv3y_g@mail.gmail.com> <c7f3d4e3-0ef6-a6d8-7c8b-bbdb903af7a9@yandex-team.ru>
User-Agent: Notmuch/0.22.1 (http://notmuchmail.org) Emacs/24.5.1 (x86_64-suse-linux-gnu)
Message-ID: <87r365eidd.fsf@notabene.neil.brown.name>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
        micalg=pgp-sha256; protocol="application/pgp-signature"
Sender: linux-kernel-owner@vger.kernel.org
Content-Length: 3012
Lines: 81

--=-=-=
Content-Type: text/plain

On Sun, Nov 20 2016, Konstantin Khlebnikov wrote:

> On 07.11.2016 23:34, Konstantin Khlebnikov wrote:
>> On Mon, Nov 7, 2016 at 10:46 PM, Shaohua Li <shli@kernel.org> wrote:
>>> On Sat, Nov 05, 2016 at 01:48:45PM +0300, Konstantin Khlebnikov wrote:
>>>> return_io() resolves request_queue even if trace point isn't active:
>>>>
>>>> static inline struct request_queue *bdev_get_queue(struct block_device *bdev)
>>>> {
>>>>       return bdev->bd_disk->queue;    /* this is never NULL */
>>>> }
>>>>
>>>> static void return_io(struct bio_list *return_bi)
>>>> {
>>>>       struct bio *bi;
>>>>       while ((bi = bio_list_pop(return_bi)) != NULL) {
>>>>               bi->bi_iter.bi_size = 0;
>>>>               trace_block_bio_complete(bdev_get_queue(bi->bi_bdev),
>>>>                                        bi, 0);
>>>>               bio_endio(bi);
>>>>       }
>>>> }
>>>
>>> I can't see how this could happen. What kind of tests/environment are these running?
>>
>> That was a random piece of production somewhere.
>> Cording to time all crashes happened soon after reboot.
>> There're several raids, probably some of them were still under resync.
>>
>> For now we have only few machines with this kernel. But I'm sure that
>> I'll get much more soon =)
>
> I've added this debug patch for catching overflow of active stripes in bio
>
> --- a/drivers/md/raid5.c
> +++ b/drivers/md/raid5.c
> @@ -164,6 +164,7 @@ static inline void raid5_inc_bi_active_stripes(struct bio *bio)
>   {
>          atomic_t *segments = (atomic_t *)&bio->bi_phys_segments;
>          atomic_inc(segments);
> +       BUG_ON(!(atomic_read(segments) & 0xffff));
>   }
>
> And got this. Counter in %edx = 0x00010000
>
> So, looks like one bio (discard?) can cover more than 65535 stripes

65535 stripes - 256M.  I guess that is possible.  Christoph has
suggested that now would be a good time to stop using bi_phys_segments
like this.

I have some patches which should fix this.  I'll post them shortly.  I'd
appreciate it if you would test and confirm that they work (and don't
break anything else)

Thanks,
NeilBrown

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQIcBAEBCAAGBQJYMkx/AAoJEDnsnt1WYoG5rSYQALWCGXj69j/FuU1hyk+Uk9cg
dRpP+xyTJmbyTP4ZAyWIgxzJyluxkbKozHIjw+d1guG7fw74CeUokO/9twVYIpE/
kHI7dL8hjQtCsXhdP7GS4XC3eJDg8sFEu37h0dx0+jlEjDAlXvpLi0k2LkpItzOt
UFRtOOqGmdszsPbux9t2fDQWV14jCDI2DjzJKWeAEEFHcQnLWEIWqlyPhYTHpUwr
mKd10j8Hh0yTwfVJb7fAvv6LxjKw0v8TDv1SqxI0NCOornmIkEVCrBsG3F07NmbD
QGYNh0HoFtyjVFhIWz56x6EaQfRkFQUyHFHJwMkrpkdzxiD77YwJ4t0zzA3OLHAS
kq7Uq54CBmBxA5/5UyVkMdHd1OR1whUN8fs8CAWlWxP8SFRIL4h1RXtl4HM2KWfF
njRd0VXfK1l7oKbjIFCFH5eNF/xkXe13dCyJmkanRWU78ZG5HN3KN6xvzeygfhQm
eLRvbUgQzeRQ7/3NyAq0yx3KgOC2ik5B70DH7pi21hvfTzXxx+OKvf6PSDI6pFg7
0J9XXvK04NF05J0BEdtrpKFqmehx8jkU7XeTAnKYbHKt3LybOzSqImKhZhEd+4Gx
iUskpfH/gcORTVVhslE+uD1ocYK4+DWQWM7FWkvoI6dnaLdeLYut+UYDoouOXfDX
j2H/k9VSajYcgThxam8J
=qM9R
-----END PGP SIGNATURE-----
--=-=-=--