Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S262085AbVEXPAB (ORCPT ); Tue, 24 May 2005 11:00:01 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S262084AbVEXO7u (ORCPT ); Tue, 24 May 2005 10:59:50 -0400 Received: from 1-1-4-20a.ras.sth.bostream.se ([82.182.72.90]:28821 "EHLO garbo.kenjo.org") by vger.kernel.org with ESMTP id S262086AbVEXO64 (ORCPT ); Tue, 24 May 2005 10:58:56 -0400 Subject: Re: NFS corruption on 2.6.11.7 From: Kenneth Johansson To: Trond Myklebust Cc: Linux Kernel Mailing List In-Reply-To: <1116936088.10707.39.camel@lade.trondhjem.org> References: <1116888428.5206.14.camel@tiger> <1116894917.11483.111.camel@lade.trondhjem.org> <1116929711.6237.8.camel@tiger> <1116936088.10707.39.camel@lade.trondhjem.org> Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-Uz1cBhTCjvNCKFQSHaNZ" Date: Tue, 24 May 2005 16:58:51 +0200 Message-Id: <1116946731.6237.23.camel@tiger> Mime-Version: 1.0 X-Mailer: Evolution 2.2.2 Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 5681 Lines: 177 --=-Uz1cBhTCjvNCKFQSHaNZ Content-Type: text/plain Content-Transfer-Encoding: quoted-printable On Tue, 2005-05-24 at 08:01 -0400, Trond Myklebust wrote: > ty den 24.05.2005 Klokka 12:15 (+0200) skreiv Kenneth Johansson: >=20 > > > > :/export/home/ken /home/ken nfs rw,v3,rsize=3D32768,wsize=3D32768,h= ard,udp,lock,addr=3Damd 0 0 > > > >=20 > > >=20 > > > I'm seeing no problems at all with this on a loopback mount with > > > 2.6.12-rc4. Mind giving us some more details on your setup? > > >=20 > > > Cheers, > > > Trond >=20 > Does the above export line mean that you are running with amd? If so, This only means that I had no imagination naming the computer and simply used the name of the cpu manufacturer used in the computer.=20 > could you retry using an ordinary NFS mount (preferably a loopback mount > - i.e. mount something over "localhost"). This works OK. > Again, please could you give us more details on how you are doing these > tests: what hardware (i.e. what NIC, switch, server, memory,...), lsmod > output, (and ditto for the server). The only new thing is. 0000:00:0e.0 Ethernet controller: D-Link System Inc Gigabit Ethernet Adapte= r (rev 11) And the driver is sk98lin compiled into the kernel.=20 Everything else has been the same for over a year. hmm I did change the switch also but I do not remember what I got.=20 I do not get any problem reading with a osx client also in gigabit speed but the client cpu is much slower so it's not exactly the same thing. > How are you using your scripts? Are you first running one on the server, > then the other on the client, are you deleting the old files before you > start a new run, etc. Telnet to the server run the gen part then run the cmp on the client. And y= es I do delete the files otherwise they would more or less only be in the c= ache. >=20 > > I did some more investigation what type of data error I get and it look= s > > a bit strange. I always get 28 bytes wrong in a sequence some times thi= s > > is data repeated from previous in the file but not always. Anybody kno= w > > what cache line size this cpu has? > >=20 > > processor : 0 > > vendor_id : AuthenticAMD > > cpu family : 6 > > model : 8 > > model name : AMD Athlon(TM) XP 2200+ > > stepping : 0 > > cpu MHz : 1802.998 > > cache size : 256 KB > > fdiv_bug : no > > hlt_bug : no > > f00f_bug : no > > coma_bug : no > > fpu : yes > > fpu_exception : yes > > cpuid level : 1 > > wp : yes > > flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge = mca cmov pat pse36 mmx fxsr sse pni syscall mmxext 3dnowext 3dnow > > bogomips : 3547.13 > >=20 > > Here is a sample if three files with errors in them. > >=20 > > file 13 "od -Ax -tx1z" > >=20 > >=20 > > -924dc0 df b3 0c 89 2d a2 83 da 1c 08 f2 66 da f6 6b f4 >....-......f.= .k.< > > +924dc0 43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd >C.*....v.&..$= =3D..< > >=20 > > -924dd0 af c2 44 57 9a 13 01 43 84 bf 99 c3 1b 16 8a 00 >..DW...C.....= ...< > > +924dd0 3e 64 d7 bd 4f 8d 26 cf 4f 4f 2c 62 1b 16 8a 00 >>d..O.&.OO,b.= ...< > >=20 > >=20 > > 28 bytes wrong in a sequence > > The data is a repeat from previous data in the file. > >=20 > > >grep "43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd" 13_org=20 > > 924d40 43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd >C.*....v.&..$= =3D..< > >=20 > > >grep "43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd" 13_err=20 > > 924d40 43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd >C.*....v.&..$= =3D..< > > 924dc0 43 11 2a f4 98 09 d5 76 aa 26 83 00 24 3d 11 fd >C.*....v.&..$= =3D..< > >=20 > > 924dc0 is a copy of 924d40 > > 128 bytes offset > >=20 > >=20 > > file 14 "od -Ax -tx1z" > >=20 > > -0912f0 91 45 bb cd eb 4f 01 d3 69 27 88 b5 7d 7d 17 8d >.E...O..i'..}= }..< > > +0912f0 b8 3f 4e 5d 2e 86 ed c0 51 79 fe ec 3e 53 c9 29 >.?N]....Qy..>= S.)< > >=20 > > -091300 7d 94 8e f9 81 d0 c2 4a b5 8e c6 af b0 03 4c 16 >}......J.....= .L.< > > +091300 d9 05 ac 0d fc eb 00 71 17 bd fb 3e b0 03 4c 16 >.......q...>.= .L.< > >=20 > > >grep "b8 3f 4e 5d 2e 86 ed c0 51 79 fe ec 3e 53 c9 29" 14_err > > 0912b0 b8 3f 4e 5d 2e 86 ed c0 51 79 fe ec 3e 53 c9 29 >.?N]....Qy..>S= .)< > > 0912f0 b8 3f 4e 5d 2e 86 ed c0 51 79 fe ec 3e 53 c9 29 >.?N]....Qy..>S= .)< > >=20 > > 28 bytes wrong > > 64 bytes offset > >=20 > >=20 > > file 16 "od -Ax -tx1z" > >=20 > > -635200 c3 1d f2 b8 c4 d5 12 c1 3f 48 e6 9d dc 98 1f e5 >........?H...= ...< > > +635200 c3 1d f2 b8 c4 d5 12 c1 00 10 00 00 00 d0 ec 08 >.............= ...< > >=20 > > -635210 9e 54 e7 f1 49 5b 1e d0 9f e2 7c 26 24 cb 98 24 >.T..I[....|&$= ..$< > > +635210 00 10 00 00 00 90 14 08 00 10 00 00 00 50 25 06 >.............= P%.< > >=20 > > -635220 25 fc 63 2a bf 07 b4 c0 cf a1 67 9b ef 01 5d 6d >%.c*......g..= .]m< > > +635220 00 10 00 00 bf 07 b4 c0 cf a1 67 9b ef 01 5d 6d >..........g..= .]m< > >=20 > > 28 bytes wrong=20 > > This time the data is not from this file. > >=20 > >=20 > >=20 > >=20 >=20 --=-Uz1cBhTCjvNCKFQSHaNZ Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (GNU/Linux) iD8DBQBCk0ErmDGOmJIy9x8RAj4YAKCJm7g13VSxE7SPjxBJwL2GIacBGACeNjry sB+Y5Eiq1/9qX7xvDtR1H/Y= =B+u+ -----END PGP SIGNATURE----- --=-Uz1cBhTCjvNCKFQSHaNZ-- - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/