[OAI-implementers] implementation of non-English characters w/UTF-8?

Charles Blair chas at jump.lib.uchicago.edu
Tue Sep 13 17:27:47 EDT 2005


> Meaning, we have non-English characters that are "choking" when we
> test our Data Provider.  [Think "e" with the accent over it
> http://lib-app1.usc.edu:8085/oaidp?verb=GetRecord&identifier=oai:usc:digitalarchive:bh
> e/ bhe-m27&metadataPrefix=oai_dc (surname after first name of
> "Elmo").]

this seems to be encoded in iso-8859-1. a unix command like this will
convert it to utf-8:

recode -v ISO-8859-1/..UTF-8 < oaidp.xml > r-oaidp.xml

(recode is GNU recode.)



More information about the OAI-implementers mailing list