2005-09-01 19:17:04

by Ed L. Cashin

[permalink] [raw]
Subject: Re: aoe fails on sparc64

"David S. Miller" <[email protected]> writes:

> From: Ed L Cashin <[email protected]>
...
>> OK. 67553994410557440 is 61440 byte swapped in 64 bits, and 30MB is
>> 61440 sectors, so this should be a simple byte order fix.
>
> More strangely, the upper and lower 32-bit words are swapped.
> The bytes within each 32-bit word are swapped correctly.
>
> So the calculation maybe should be something like:
>
> __le32 *p = (__le32 *) &id[100 << 1];
> u32 high32 = le32_to_cpup(p);
> u32 low32 = le32_to_cpup(p + 1);
>
> ssize = (((u64)high32 << 32) | (u64) low32);
>
> But that doesn't make any sense, and even ide_fix_driveid() in
> drivers/ide/ide-iops.c does a le64_to_cpu() for this value:
>
> id->lba_capacity_2 = __le64_to_cpu(id->lba_capacity_2);
>
> I wonder if this is some artifact of how AOE devices encode
> this field when sending it to the client.

Well, an EtherDrive blade just copies the ATA identify response data
into a network packet without looking at it. The vblade, though, has
to set the lba_capacity and lba_capacity_2 fields itself.

The aoe driver looks OK, but it turns out there's a byte swapping bug
in the vblade that could be related if he's running the vblade on a
big endian host (even though he said it was an x86 host), but I
haven't heard back from the original poster yet.

The vblade bug was the omission of swapping the bytes in each short.
The fix below shows what I mean:

diff -urNp a-exp/ata.c b-exp/ata.c
--- a-exp/ata.c 2005-09-01 10:19:11.000000000 -0400
+++ b-exp/ata.c 2005-09-01 10:19:12.000000000 -0400
@@ -55,24 +55,29 @@ setfld(ushort *a, int idx, int len, char
}

static void
-setlba28(ushort *p, vlong lba)
+setlba28(ushort *ident, vlong lba)
{
- p += 60;
- *p++ = lba & 0xffff;
- *p = lba >> 16 & 0x0fffffff;
+ uchar *cp;
+
+ cp = (uchar *) &ident[60];
+ *cp++ = lba;
+ *cp++ = lba >>= 8;
+ *cp++ = lba >>= 8;
+ *cp++ = (lba >>= 8) & 0xf;
}

static void
-setlba48(ushort *p, vlong lba)
+setlba48(ushort *ident, vlong lba)
{
- p += 100;
- *p++ = lba;
- lba >>= 16;
- *p++ = lba;
- lba >>= 16;
- *p++ = lba;
- lba >>= 16;
- *p = lba;
+ uchar *cp;
+
+ cp = (uchar *) &ident[100];
+ *cp++ = lba;
+ *cp++ = lba >>= 8;
+ *cp++ = lba >>= 8;
+ *cp++ = lba >>= 8;
+ *cp++ = lba >>= 8;
+ *cp++ = lba >>= 8;
}

void


--
Ed L Cashin <[email protected]>


2005-09-01 19:45:50

by David Miller

[permalink] [raw]
Subject: Re: aoe fails on sparc64

From: Ed L Cashin <[email protected]>
Date: Thu, 01 Sep 2005 15:13:52 -0400

> The aoe driver looks OK, but it turns out there's a byte swapping bug
> in the vblade that could be related if he's running the vblade on a
> big endian host (even though he said it was an x86 host), but I
> haven't heard back from the original poster yet.

I see, thanks for looking into this.

2005-09-03 16:06:34

by Jim MacBaine

[permalink] [raw]
Subject: Re: aoe fails on sparc64

On 9/1/05, Ed L Cashin <[email protected]> wrote:

> The aoe driver looks OK, but it turns out there's a byte swapping bug
> in the vblade that could be related if he's running the vblade on a
> big endian host (even though he said it was an x86 host), but I
> haven't heard back from the original poster yet.

It is in fact a x86_64 kernel, but with a mostly x86 userland. Vblade
is pure x86 code.

> The vblade bug was the omission of swapping the bytes in each short.
> The fix below shows what I mean:

Unfortunately it doesn't fix anything here. The client still reports
the same wrong size as before. The dmesg output is identical, too.

Regards,
Jim

2005-09-06 20:37:06

by Ed L. Cashin

[permalink] [raw]
Subject: Re: aoe fails on sparc64

Jim MacBaine <[email protected]> writes:

> On 9/1/05, Ed L Cashin <[email protected]> wrote:
>
>> The aoe driver looks OK, but it turns out there's a byte swapping bug
>> in the vblade that could be related if he's running the vblade on a
>> big endian host (even though he said it was an x86 host), but I
>> haven't heard back from the original poster yet.
>
> It is in fact a x86_64 kernel, but with a mostly x86 userland. Vblade
> is pure x86 code.
>
>> The vblade bug was the omission of swapping the bytes in each short.
>> The fix below shows what I mean:
>
> Unfortunately it doesn't fix anything here. The client still reports
> the same wrong size as before. The dmesg output is identical, too.

Let's take this discussion off the lkml, because I doubt there's a
problem with the aoe driver in the kernel, and I can easily follow up
to the lkml with a synopsis if it turns out I'm wrong.

Jim MacBaine, I'm going to ask for more details in a separate email.

--
Ed L Cashin <[email protected]>