Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757417AbXJ0SBS (ORCPT ); Sat, 27 Oct 2007 14:01:18 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752815AbXJ0SBJ (ORCPT ); Sat, 27 Oct 2007 14:01:09 -0400 Received: from x35.xmailserver.org ([64.71.152.41]:32819 "EHLO x35.xmailserver.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752340AbXJ0SBI (ORCPT ); Sat, 27 Oct 2007 14:01:08 -0400 X-AuthUser: davidel@xmailserver.org Date: Sat, 27 Oct 2007 11:01:04 -0700 (PDT) From: Davide Libenzi X-X-Sender: davide@alien.or.mcafeemobile.com To: Willy Tarreau cc: Marc Lehmann , Eric Dumazet , Linux Kernel Mailing List Subject: Re: epoll design problems with common fork/exec patterns In-Reply-To: <20071027173828.GY10199@1wt.eu> Message-ID: References: <20071027062236.GA12476@schmorp.de> <4722F575.8080204@cosmosbay.com> <20071027085125.GC12326@schmorp.de> <20071027173828.GY10199@1wt.eu> X-GPG-FINGRPRINT: CFAE 5BEE FD36 F65E E640 56FE 0974 BF23 270F 474E X-GPG-PUBLIC_KEY: http://www.xmailserver.org/davidel.asc MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2463 Lines: 69 On Sat, 27 Oct 2007, Willy Tarreau wrote: > On Sat, Oct 27, 2007 at 09:59:07AM -0700, Davide Libenzi wrote: > > On Sat, 27 Oct 2007, Marc Lehmann wrote: > > > > > > Please provide some code to illustrate one exact problem you have. > > > > > > // assume there is an open epoll set that listens for events on fd 5 > > > if (fork () = 0) > > > { > > > close (5); > > > // fd 5 is now removed from the epoll set of the parent. > > > _exit (0); > > > } > > > > Hmmm ... what? I assume you know that: > > > > 1) A file descriptor is a userspace view/handle of a kernel object > > > > 2) The kernel object has a use-count for as many file descriptors that > > have been handed out to userspace > > > > 3) A close() decreases the internal counter by one > > > > 4) The kernel object gets effectively closed when the internal counter > > goes to zero > > > > 5) A fork() acts as a dup() on the file descriptors by hence bumping up > > its internal counter > > > > 6) Epoll removes the file from the set, when the *kernel* object gets > > closed (internal use-count goes to zero) > > > > With that in mind, how can the code snippet above trigger a removal from > > the epoll set? > > Davide, > > from what I understand, Marc is not asking for the code above to remove > the fd from the epoll set, but he's in fact complaining that he *observed* > that the fd was removed from the epoll set in the *parent* process when > the child closes it, which is of course not expected at all. As strange > as it looks like, this might need investigation. It is possible that there > is some strange bug somewhere in some kernel versions. That would be *really* strange, since epoll hooks in __fput() in order to perform proper cleanup. This means that, in the case above, the file will be really closed in the parent too. That, I think, would trigger way more serious problems in userspace. > Marc, I think that if you indicate the last kernel version on which you > observed this and provide a very short and easy reproducer, it would > help everyone investigating this. Basically something which reports "OK" > or "KO". Of course. That'd be great. - Davide - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/