2007-03-28 02:19:52

by NeilBrown

[permalink] [raw]
Subject: Re: Odd log message associated with NFS

On Tuesday March 27, [email protected] wrote:
> On Tue, Mar 27, 2007 at 11:40:48AM -0700, Phy Prabab wrote:
> > Hello,
> >
> > I am currently running 2.6.21-rc5 and I am seeing quite a bit of this
> > message in my log files:
> > kernel: rpcsvc: received unknown control message:-2144992132/-1
> >
> > This machine is an dual dual core opteron with 3Ware 9650 (14 disk
> > RAID10), 4G RAM, 64bit kernel, nfsutils-1.0.12 running knfsd. This
> > machine is a file server.
>
> Just a 'me, too':

For the 'unknown control message' messages,
see: "fix typo in svc_udp_recvfrom" previously on
[email protected].

Fix is in -mm and below.

For "svc: bad direction" .... Don't know... garbage on the net maybe?
Or maybe not. I have seen something like that before, but haven't
been able to pin it on anything yet.
NeilBrown


--- linux-2.6.orig/net/sunrpc/svcsock.c
+++ linux-2.6/net/sunrpc/svcsock.c
@@ -779,7 +779,7 @@ svc_udp_recvfrom(struct svc_rqst *rqstp)
}

clear_bit(SK_DATA, &svsk->sk_flags);
- while ((err == kernel_recvmsg(svsk->sk_sock, &msg, NULL,
+ while ((err = kernel_recvmsg(svsk->sk_sock, &msg, NULL,
0, 0, MSG_PEEK | MSG_DONTWAIT)) < 0 ||
(skb = skb_recv_datagram(svsk->sk_sk, 0, 1, &err)) == NULL) {
if (err == -EAGAIN) {


>
> Mar 16 16:57:06 Marvin kernel: svc: bad direction 268435456, dropping request
> Mar 16 17:58:19 Marvin kernel: svc: bad direction 268435456, dropping request
> Mar 16 19:55:49 Marvin kernel: svc: bad direction 268435456, dropping request
> ...
> Mar 17 04:30:03 Marvin kernel: svc: bad direction 268435456, dropping request
> Mar 17 06:28:52 Marvin kernel: pppd(795): unaligned trap at 0000020000207f90: 000000011fd8d33d 29 1
> Mar 17 06:28:55 Marvin kernel: inet_sk_reselect_saddr(): shifting inet->saddr from 82.149.xxx.xxx to 89.166.xxx.xxx
> Mar 17 09:05:21 Marvin kernel: rpcsvc: received unknown control message:-1024/186028080
> Mar 17 14:17:04 Marvin kernel: rpcsvc: received unknown control message:-1024/186028040
> Mar 17 14:17:31 Marvin kernel: rpcsvc: received unknown control message:-1024/186028040
> Mar 17 14:49:14 Marvin kernel: rpcsvc: received unknown control message:-1024/186028080
> Mar 17 14:49:15 Marvin kernel: rpcsvc: received unknown control message:-1024/186028040
>
>
>
> This is from my little 'server', alpha ev4 'noname', 2.6.21-rc3, compiled
> with gcc 4.1.2
>
> client is mostly an alpha ev6 2.6.21-rc3, gcc 4.1.2.
>
> Apart from the messages NFS seems to run just fine currently.
>
> Bye,
> Thorsten
>
>
>
> --
> | Thorsten Kranzkowski Internet: [email protected] |
> | Mobile: ++49 170 1876134 Snail: Kiebitzstr. 14, 49324 Melle, Germany |
> | Ampr: dl8bcu@db0lj.#rpl.deu.eu, [email protected] [44.130.8.19] |
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [email protected]
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/


2007-03-28 18:55:44

by Thorsten Kranzkowski

[permalink] [raw]
Subject: Re: Odd log message associated with NFS

On Wed, Mar 28, 2007 at 12:19:52PM +1000, Neil Brown wrote:
> On Tuesday March 27, [email protected] wrote:
> > On Tue, Mar 27, 2007 at 11:40:48AM -0700, Phy Prabab wrote:
> > > kernel: rpcsvc: received unknown control message:-2144992132/-1
> >
> > Just a 'me, too':
>
> For the 'unknown control message' messages,
> see: "fix typo in svc_udp_recvfrom" previously on
> [email protected].
>
> Fix is in -mm and below.

will try, thanks.

> For "svc: bad direction" .... Don't know... garbage on the net maybe?
> Or maybe not. I have seen something like that before, but haven't

I don't think so as it would mean corruption at other places, too. The
Network is quite uninteresting: single ethernet segment, 2 switches
involved, longest cable 8 meters ...

> been able to pin it on anything yet.
> NeilBrown

I _had_ problems with the previous kernel (same version, same config,
different (older) compiler (gcc 3.4.5 instead of 4.1.2), on both client and server)
It would randomly (usually within 30 mins) 'lock up' NFS, where the client
sends out requests but doesn't get any answer (apart from tcp-ack) from the
server. Only a forced umount/remount recovered that (until the next lock-up).
But I'm willing to attribute that to a somehow broken gcc.

again, thanks for commenting.

bye,
Thorsten

> --- linux-2.6.orig/net/sunrpc/svcsock.c
> +++ linux-2.6/net/sunrpc/svcsock.c
> @@ -779,7 +779,7 @@ svc_udp_recvfrom(struct svc_rqst *rqstp)
> }
>
> clear_bit(SK_DATA, &svsk->sk_flags);
> - while ((err == kernel_recvmsg(svsk->sk_sock, &msg, NULL,
> + while ((err = kernel_recvmsg(svsk->sk_sock, &msg, NULL,
> 0, 0, MSG_PEEK | MSG_DONTWAIT)) < 0 ||
> (skb = skb_recv_datagram(svsk->sk_sk, 0, 1, &err)) == NULL) {
> if (err == -EAGAIN) {
>
>
> >
> > Mar 16 16:57:06 Marvin kernel: svc: bad direction 268435456, dropping request
> > Mar 16 17:58:19 Marvin kernel: svc: bad direction 268435456, dropping request
> > Mar 16 19:55:49 Marvin kernel: svc: bad direction 268435456, dropping request
> > ...
> > Mar 17 04:30:03 Marvin kernel: svc: bad direction 268435456, dropping request

--
| Thorsten Kranzkowski Internet: [email protected] |
| Mobile: ++49 170 1876134 Snail: Kiebitzstr. 14, 49324 Melle, Germany |
| Ampr: dl8bcu@db0lj.#rpl.deu.eu, [email protected] [44.130.8.19] |

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
NFS maillist - [email protected]
https://lists.sourceforge.net/lists/listinfo/nfs