[OAI-implementers] character vs entity references

Ed Summers ehs@pobox.com
Tue, 4 Nov 2003 10:44:48 -0600


On Tue, Nov 04, 2003 at 09:58:55AM -0500, Todd White wrote:
> $string =~ tr/\0-\x{ff}//UC;

Search for tr/ in the following pages for some fun Perl archaeology. 

    http://www.perldoc.com/perl5.005_03/pod/perlop.html
    http://www.perldoc.com/perl5.6.0/pod/perlop.html
    http://www.perldoc.com/perl5.6.1/pod/perlop.html

You can see the UC modifiers were introduced in 5.6.0 and quickly 
dropped in 5.6.1 (and in versions thereafter). 5.6.0 is a notoriously
buggy release, I think in part because of it's UTF8 handling. These
problems have been fixed in versions >= 5.8.0, which is the first
recommended release of Perl for safely working with UTF8.

Funny, I always thought Perl held backwards compatability sacrosanct...
not including Perl6 of course :) 

You might be interested in this list for Perl library folks: 
http://perl4lib.perl.org for discussion of Perl esoterica and more.

//Ed