[OAI-implementers] valid character encoding

Thomas G. Habing thabing@uiuc.edu
Wed, 13 Aug 2003 09:49:32 -0500


Todd White wrote:
> is there a limited number of valid character encodings for a valid OAI
> repository?
> 
> the encoding i am using is "ISO-8859-1"  this is to support some special
> characters in our metadata that were not supported by UTF-8.
> 
> when i tested our newly developed OAI respository software using the
> web-based Open Archives Initiative - Repository Explorer
> (http://oai.dlib.vt.edu/cgi-bin/Explorer/oai2.0/testoai) it told me...
> 
>   XML Schema Validation Error !
>   Illegal character encoding in XML
> 
> here's the URL to our repository:
>   http://michiganteacher.net/oai
> 
> 
> _______________________________________________
> OAI-implementers mailing list
> List information, archives, preferences and to unsubscribe:
> http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers
> 
> 

The OAI spec mandates that all XML responses must be encoded as UTF-8.

I believe that every character from ISO-8859-1 has a Unicode point so you 
should be able to encode them as UTF-8 with some translation.

Regards,
	Tom

-- 
Thomas Habing
Research Programmer, Digital Library Projects
University of Illinois at Urbana-Champaign
155 Grainger Engineering Library Information Center, MC-274
thabing@uiuc.edu, (217) 244-4425
http://dli.grainger.uiuc.edu