Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755957AbXKFI1P (ORCPT ); Tue, 6 Nov 2007 03:27:15 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751991AbXKFI06 (ORCPT ); Tue, 6 Nov 2007 03:26:58 -0500 Received: from smtpoutm.mac.com ([17.148.16.71]:50459 "EHLO smtpoutm.mac.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750951AbXKFI05 convert rfc822-to-8bit (ORCPT ); Tue, 6 Nov 2007 03:26:57 -0500 In-Reply-To: <20071106063305.GA26163@stusta.de> References: <472B8DAF.9080706@schaufler-ca.com> <20071103164303.GA26707@ubuntu> <20071106063305.GA26163@stusta.de> Mime-Version: 1.0 (Apple Message framework v752.2) Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed Message-Id: Cc: "Ahmed S. Darwish" , Casey Schaufler , akpm@osdl.org, torvalds@osdl.org, linux-security-module@vger.kernel.org, linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8BIT From: Kyle Moffett Subject: Re: [PATCH] Smackv10: Smack rules grammar + their stateful parser Date: Tue, 6 Nov 2007 03:26:12 -0500 To: Adrian Bunk X-Mailer: Apple Mail (2.752.2) Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1024 Lines: 23 On Nov 06, 2007, at 01:33:05, Adrian Bunk wrote: > Can you limit this to 7bit ASCII and use isascii() somewhere? > > Otherwise I'd expect funny things to happen when you e.g. use > isspace() on the UTF-8 encoded character ?. Actually, you don't need to. You tell them it expects UTF-8 encoded strings and be done with it. All US-ASCII characters from 0 through 127 (IE: high bit clear) are exactly the same in UTF-8, and UTF-8 special characters have the high bit set in all bytes. Therefore you just assume that anything with the high bit set is part of a word and you can handle basic UTF-8. (It doesn't work on special UTF-8 space characters like nonbreaking space and similar, but handling those is significantly more complicated). Cheers, Kyle Moffett - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/