[OAI-implementers] Complex objects/embedded metadata/multiple schemas

Roy Tennant roy.tennant at ucop.edu
Mon Aug 1 11:23:25 EDT 2005


Speaking from the perspective of a nascent service provider, I would  
have to say "the more the merrier" when it comes to exposing metadata  
via OAI-PMH. As a service provider, I can make my own choices as to  
which metadata format I prefer to harvest, and having a much richer  
schema (albeit one that may take some review and post-processing for  
my particular purposes), is still desirable over having no choice at  
all. Having said that, I would say that I would prefer to have the  
metadata exposed as separate descriptive metadata packages if  
possible rather than in a METS wrapper, since as you know the METS  
wrapper adds complexity. Specifically, I mean the data provider might  
offer three different schemas for the metadata:

1) the required simple DC
2) a qualified DC
3) MODS

as an example. This would allow me as a service provider to easily  
consume the metadata without first having to "unpack" and understand  
your particular METS profile. Especially since we are putting the  
metadata into our own METS profile on our end ;-). My two cents, for  
what it's worth.
Roy Tennant

On Jul 26, 2005, at 10:42 AM, Tansley, Robert wrote:

> Hi all,
>
> This issue has been skirted in a few previous mails.  Right now,  
> DSpace
> can expose metadata about complex objects via its OAI-PMH data  
> provider
> interface, either as MPEG-21 DIDL or METS.
>
> One problem of course is that since METS is essentially an envelope  
> for
> other kinds of metadata, it's pretty pointless to just say "I support
> METS".  It's one step further than "I support XML" but to get
> interoperability you need to go quite a bit further.
>
> In other words, as an OAI-PMH service provider (or a tool hoping to do
> some resource harvesting/mirroring) I can't just ask for the metadata
> prefix "mets" and hope to understand what comes out.  E.g. will it  
> have
> qualified DC in it, or MODS?
>
> Perhaps, rather than by namespace/schema, I really need to specify by
> profile (and in the case of DSpace even that's not likely to be  
> uniform
> across the board).  Should I be using a metadataPrefix like
> "mets_dspace" or somesuch?  Just using "mets" might be setting an
> expectation of interoperability/understandability that just isn't  
> there.
> On the other hand, using "mets_dspace" might be removing a possibility
> -- if I expose METS which has qualified Dublin Core in it, a harvester
> might at least be able to extract and use that, even if it doesn't
> understand some of the other metadata embedded in the METS.
>
> Another issue is whether having multiple schemas in harvested  
> records is
> going to cause a problem for harvesters.  Even in the simple case of
> METS with some qualified DC in it, there's the METS and QDC  
> schemas.  I
> don't have enough experience on the harvester/service provider side of
> things to know whether this will cause headaches.
>
> Is this pushing really beyond what OAI-PMH was designed for?
>
> All thoughts/comments welcomed...
>
>  Robert TANSLEY / Digital Media Systems Programme / HP Labs
>   http://www.hpl.hp.com/personal/Robert_Tansley/
>
> _______________________________________________
> OAI-implementers mailing list
> List information, archives, preferences and to unsubscribe:
> http://www.openarchives.org/mailman/listinfo/oai-implementers
>
>
>
>




More information about the OAI-implementers mailing list