[OAI-implementers] character encoding

Todd White tmwhite@merit.edu
Thu, 30 Oct 2003 15:55:08 -0500 (EST)


i sent a message to the list some time ago and, while working on other
non-XML and non-OAI projects, i've been closing watching the list in hopes
of finding the solution to my encoding problem.  i'm embarrassed to admit
that this encoding problem remains.

perhaps i should provide some details...

DATA STORAGE:  Oracle
DATA DELIVERY:  DBI.pm
OAI CONSTRUCTOR:  Perl script (using Embperl)
WEB SERVER:  Apache

in other words, i have a single Perl script, in the form an Embperl file,
that draws the data from Oracle, via DBI, then i simply loop through the
data and wrap each element with the appropriate XML tag before returning
the whole mess through STDOUT.

i'm guessing that i should encode each character to UTF-8 as it passes
through the script, but as yet, i'm not sure how to best do this.

any helpful tips, advice, rants, etc. will be most welcome.  i thank you
in advance.

 -Todd