Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S965074AbXAGUGF (ORCPT ); Sun, 7 Jan 2007 15:06:05 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S965051AbXAGUGE (ORCPT ); Sun, 7 Jan 2007 15:06:04 -0500 Received: from mx1.redhat.com ([66.187.233.31]:52554 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S965074AbXAGUGD (ORCPT ); Sun, 7 Jan 2007 15:06:03 -0500 Date: Sun, 7 Jan 2007 15:05:53 -0500 From: Dave Jones To: Alan , David Woodhouse , Tilman Schmidt , Linux Kernel Mailing List Subject: Re: OT: character encodings (was: Linux 2.6.20-rc4) Message-ID: <20070107200553.GA15101@redhat.com> Mail-Followup-To: Dave Jones , Alan , David Woodhouse , Tilman Schmidt , Linux Kernel Mailing List References: <20070107114439.GC21613@flint.arm.linux.org.uk> <45A0F060.9090207@imap.cc> <1168182838.14763.24.camel@shinybook.infradead.org> <20070107153833.GA21133@flint.arm.linux.org.uk> <20070107182151.7cc544f3@localhost.localdomain> <20070107191730.GD21133@flint.arm.linux.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20070107191730.GD21133@flint.arm.linux.org.uk> User-Agent: Mutt/1.4.2.2i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1701 Lines: 41 On Sun, Jan 07, 2007 at 07:17:30PM +0000, Russell King wrote: > commit 24ebead82bbf9785909d4cf205e2df5e9ff7da32 > tree 921f686860e918a01c3d3fb6cd106ba82bf4ace6 > parent 264166e604a7e14c278e31cadd1afb06a7d51a11 > author Rafa³ Bilski 1167691774 +0100 > committer Dave Jones 1167799119 -0500 > > and looking at that "author" closer with od: > > 0000140 74 68 6f 72 20 52 61 66 61 b3 20 42 69 6c 73 6b > t h o r R a f a ³ B i l s k > > clearly not UTF-8. I doubt whether any of the commits I do on my > en_GB ISO-8859-1 systems end up being UTF-8 encoded. This has been bugging me for a while. Viewing the mail I applied in mutt shows his name correctly as Rafał Applying it with git-applymbox and viewing the log on master.kernel.org with git log shows Rafa And then later when put into email it turns into Rafa³ > But the point is there is charset damage which has happened _long_ before > Linus' action. There is no character set defined for the contents of git > repositories, and as such the output of the git tools can not be > interpreted as any one single character set. If there's something I should be doing when I commit that I'm not, I'll be happy to change my scripts. My $LANG is set to en_US.UTF-8 which should DTRT to the best of my knowledge, but clearly, that isn't the case. Dave -- http://www.codemonkey.org.uk - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/