[OAI-implementers] returning *data* (as opposed to metadata)

Andy Powell a.powell@ukoln.ac.uk
Wed, 25 Jul 2001 23:09:23 +0100 (BST)

On Wed, 25 Jul 2001, herbert van de sompel wrote:

> But, obviously, if a data provider has full-content in a well-structured
> format, he may as well consider exposing references in a preparsed
> manner as yet another separate "information packet": doing so would lead
> to 3 "metadata-formats" related to the same content: metadata, data,
> references.

Hmmm... the protocol is named the 'The Open Archives Initiative Protocol
for Metadata Harvesting".  Using it to harvest "Data" seems reasonably
non-intuitive to me, given this name.

I have no problem with the notion that one person's metadata is another
person's data.  It is a pretty extreme view to say that data in the form
of the full content of an article is metadata - which is what you appear
to be saying above?

I don't disagree that the protocol *could* be used to harvest data - I
just wonder if it *should* be used in that way.  Particularly at this
stage in the life of the protocol?

Can someone clarify the differences/advantages of harvesting data directly
using OAI vs. harvesting metadata using OAI followed by harvesting data
using HTTP based on the URL in the metadata?

Distributed Systems and Services
UKOLN, University of Bath, Bath, BA2 7AY, UK       a.powell@ukoln.ac.uk
http://www.ukoln.ac.uk/ukoln/staff/a.powell      Voice: +44 1225 323933
Resource Discovery Network http://www.rdn.ac.uk/   Fax: +44 1225 826838