From susanne.dobratz@rz.hu-berlin.de Mon Apr 8 19:44:48 2002 From: susanne.dobratz@rz.hu-berlin.de (Susanne Dobratz) Date: Mon, 08 Apr 2002 20:44:48 +0200 Subject: [OAI-implementers] invitation to open archives forum workshop in europe, May 13.-14. Pisa, Italy Message-ID: <3CB1E520.2070708@rz.hu-berlin.de> This is a multi-part message in MIME format. --------------040002070507030507000606 Content-Type: multipart/alternative; boundary="------------090403010201050705040905" --------------090403010201050705040905 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit Dear OAI-implementers Members, apologies for cross-posting. I'd like to invite you to a workshop about OAI usage and experience in Europe. The organizing committee is very pleased to invite you to take part in the 1st Open Archive Forum Workshop that will be held in Pisa on May 13-14, 2002 . The workshop aims to set up a European forum on the open archive activities. In particular, it intends to bring together organizations which works in the field in order to favor the exchange of information and the establishment of new collaborative links. Best wishes, Susanne Dobratz. --------------090403010201050705040905 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: 7bit

Dear OAI-implementers Members,


apologies for cross-posting.

I'd like to invite you to a workshop about OAI usage and experience in Europe.

The organizing committee is very pleased to invite you to take part in the 1st Open Archive Forum Workshop that will be held in Pisa on May 13-14, 2002 . The workshop aims to set up a European forum on the open archive activities. In particular, it intends to bring together organizations which works in the field in order to favor the exchange of information and the establishment of new collaborative links.        
 

Best wishes,
Susanne Dobratz.
--------------090403010201050705040905-- --------------040002070507030507000606 Content-Type: text/html; charset=us-ascii; name="invitation-neu.html" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="invitation-neu.html" pisainvitation

 
 
Creating a European Forum 
for Open Archives Activities

13-14th May 2002, Pisa Italy

 

****************************************************************************

The “open archives” approach is gaining popularity in the scientific, cultural and learning communities. The technology adopted makes it possible to disseminate and exploit content that is currently not accessible to the wider public at a relatively low cost. If the full potential of this approach is fulfilled, it will have a significant impact on the building of new and more powerful service provision.

 

The Open Archives Forum (OA-Forum) is an EC 5th Framework Accompanying Measure for supporting the dissemination of information about European activities that adopt an open-archives based approach. OA-Forum aims at building a forum where European and national initiatives that use this approach can share their experiences and co-ordinate the development of software tools and infrastructures. Special attention is dedicated to those initiatives which are implementing or using the Open Archives Initiative Metadata Harvesting Protocol (< http://www.openarchives.org >) but the forum is also alert to other emerging solutions.

 

The First OA-Forum Workshop seeks to bring together researchers, technical implementers and project managers who are experimenting, or are willing to experiment, with  the open archives approach. The goal of the Workshop is to set up the basis of a European forum for sharing experiences and solutions and encouraging networking among projects. In order to achieve this objective the workshop will consists of presentations given by invited speakers as well as  small group sessions where participants can discuss key issues, possibly raised during the workshop itself. Possible topics are:

 

* Communities

What communities can benefit from the open archive approach? Do incentives, technical and organisational challenges, etc. differ in different communities? Which communities are already implementing the Open Archive Initiative Metadata Harvesting Protocol or other solutions? What kinds of services are emerging?

* Definition of the key vocabulary

Can we agree on  basic terminology? What are the main concepts that are used in the open archive everyday activity? Do these concepts differ for different communities? Are the definitions of these concepts clear? Can we jointly build a glossary

* Organizational Issues

What lessons about organisational issues, both barriers to adoption and positive factors, can we learn from early adopters? Which are the business models adopted? Are there issues related to subject-based archives as opposed to institutional-based archives? Can organisations reach consensus on metadata formats?

* Technical issues

Which kind of software tools are needed to support the open archives approach? What tools are already being used? Who is developing them? In Europe ? Elsewhere? Are they available to other implementers? Is there an appropriate “Open Source” environment for these tools?

* Building services in an open archive environment

What lessons have been learned by early-adopters of the open archives approach in the context of service provision? Can we extend openness to services? How can this be supported? What  kind of organizational and technical infrastructures are needed to facilitate this development?

 

The discussions and results of the Workshop will be summarised in a Workshop report that will be published on the OA-Forum Website.

 

Who should attend

 

People from organisations or projects that implement, or are considering implementing , open archives. Technical, research and project managers who are interested in  the open archives approach.

 

Invited Speakers (to be completed)

Fabio Asnicar,  International School For Advanced Studies, Italy --   Francoise Genova  , Strasbourg Astronomical Data Center, France   --  Michael Nelson, Old Dominon University, USA  --   Umberto Straccia , IEI-CNR, Italy  --   Martin Veseley, CERN, Switzerland  -- 

 

Venue

 

13-14th May, 2002

 

CNR

Area della Ricerca di Pisa,

Via G. Moruzzi, 1

Pisa (Italy)

 
 

The registration form and the workshop agenda are available on the Project Website ( http://www.oaforum.org ).


Booking deadline:
19 April 2002


If you have queries  please do not hesitate to  contact Donatella Castelli   (tel. +39 050 3152902, castelli@iei.pi.cnr.it ) or Francesca Borri  (tel. +39 050 3153470, borri @iei.pi.cnr.it ) .   

 

 

--------------040002070507030507000606-- From ldodds@ingenta.com Wed Apr 10 16:52:40 2002 From: ldodds@ingenta.com (Leigh Dodds) Date: Wed, 10 Apr 2002 16:52:40 +0100 Subject: [OAI-implementers] Clarification on deleted records Message-ID: Hi, I'm not clear on the following wording in the specification, could someone clarify this for me? "When returning a harvested record or identifier of a record, the ListRecords and ListIdentifiers service requests may indicate a status of "deleted". This status means that an item has been deleted and therefore no record can be disseminated from it. The length of time that a given repository keeps track of deleted items is not defined by the protocol. Therefore, the only guaranteed method in the protocol for determining whether a record can be returned by a repository (its corresponding item still exists) is through the GetRecord service request." What I'm not sure about is why the only guaranteed method for determing whether a record can be returned is GetRecord. The second sentence (reading "This status...") notes that if the status is deleted, then no record can be returned. So we already have a definitive answer don't we? Digging further I see that the GetRecord schema notes that a record *can* be returned with a deleted status -- which seems contrary to the above? However if an identifer doesn't exist then no record will be returned in the GetRecord response. So is it the case that records for items with a deleted status will always be available (i.e. the metadata is can still be harvested) but after a period (determined by the archive) GetRecord may subsequently return no metadata. Thanks in advance, L. -- Leigh Dodds, Research Group, Ingenta | "Pluralitas non est ponenda http://weblogs.userland.com/eclectic | sine necessitate" http://www.xml.com/pub/xmldeviant | -- William of Ockham From hussein@vt.edu Wed Apr 10 19:47:46 2002 From: hussein@vt.edu (Hussein Suleman) Date: Wed, 10 Apr 2002 14:47:46 -0400 Subject: [OAI-implementers] Clarification on deleted records References: Message-ID: <3CB488D2.3060700@vt.edu> hi if deleted records are tracked by the archive then there is no problem - both ListRecords and GetRecord can return a deleted status and service providers can act accordingly. if deleted records are not tracked, it is quite conceivable that an archive may, for example, delete a row in the database when an item is deleted. thus, future ListRecords requests will not reflect that the item is deleted (since it simply no longer exists). in such a case, the service provider has to regularly issue GetRecord with the identifiers of previously harvested records in order to confirm that each of them does still exist. alternatively, the service provider can issue GetRecord just before presenting the record through a user interface - that makes network utilization dependent on the needs of actual users. a third solution is to just throw everything away every now and then and reharvest from scratch. to answer your last question, if an archive tracks deleted records, GetRecord (or ListRecords) for a deleted item will return only the header and not the metadata (or about). any archive that tracks deleted items ought to store at least the identifiers and the dates when the items were deleted or harvesting will not work properly. hope this helps. ttfn ----hussein Leigh Dodds wrote: > Hi, > > I'm not clear on the following wording in the specification, could > someone clarify this for me? > > "When returning a harvested record or identifier of a record, the > ListRecords and ListIdentifiers service requests may indicate a status > of "deleted". This status means that an item has been deleted and > therefore no record can be disseminated from it. The length of time > that a given repository keeps track of deleted items is not defined by > the protocol. Therefore, the only guaranteed method in the protocol > for determining whether a record can be returned by a repository > (its corresponding item still exists) is through the GetRecord service > request." > > What I'm not sure about is why the only guaranteed method > for determing whether a record can be returned is GetRecord. > The second sentence (reading "This status...") notes that if > the status is deleted, then no record can be returned. So we > already have a definitive answer don't we? > > Digging further I see that the GetRecord schema notes that a > record *can* be returned with a deleted status -- which seems > contrary to the above? > > However if an identifer doesn't exist then no record will be returned in > the GetRecord response. > > So is it the case that records for items with a deleted status will always > be available (i.e. the metadata is can still be harvested) but > after a period (determined by the archive) GetRecord may > subsequently return no metadata. > > Thanks in advance, > > L. > > -- ====================================================================== hussein suleman - hussein@vt.edu - vtcs - http://www.husseinsspace.com ====================================================================== From herbertv@lanl.gov Wed Apr 10 22:09:51 2002 From: herbertv@lanl.gov (herbert van de sompel) Date: Wed, 10 Apr 2002 15:09:51 -0600 Subject: [OAI-implementers] Public release of the OAI-PMH v.2.0 Message-ID: <3CB4AA1F.8E4AB21E@lanl.gov> Apologies for cross-posting: Public release of the OAI-PMH v.2.0 is scheduled for June 1st 2002. This release is the completion of a process begun in January 2001 when Version 1.0 of the OAI-PMH became publicly available. A press release with details about this release is available at http://www.openarchives.org/news/OAI-2.0A-PR.html. OAI Executive: Carl Lagoze Herbert Van de Sompel -- Herbert Van de Sompel digital library research & prototyping Los Alamos National Laboratory - Research Library + 1 (505) 667 1267 / http://lib-www.lanl.gov/~herbertv/ objects on mirror sites are closer than they appear From ajk@mds.rmit.edu.au Thu Apr 11 00:35:14 2002 From: ajk@mds.rmit.edu.au (Alan Kent) Date: Thu, 11 Apr 2002 09:35:14 +1000 Subject: [OAI-implementers] Clarification on deleted records In-Reply-To: <3CB488D2.3060700@vt.edu>; from Hussein Suleman on Wed, Apr 10, 2002 at 02:47:46PM -0400 References: <3CB488D2.3060700@vt.edu> Message-ID: <20020411093514.B19611@io.mds.rmit.edu.au> On Wed, Apr 10, 2002 at 02:47:46PM -0400, Hussein Suleman wrote: > hi > ... if deleted records are not tracked... the > service provider has to regularly issue GetRecord with the identifiers > of previously harvested records in order to confirm that each of them > does still exist... I am not suggesting it today, but I can see the time when someone may therefore want a new command to say 'are these 1,000 identifiers still valid?'. Doing a GetRecord per identifier one by one could be pretty slow and will unnecessarily return the record (when all you want to know is if it exists). I was initially thinking 'no-one would ever do GetRecord for every record in the database - its too slow! doing ListRecords or ListIdentifiers would always be faster', but then I realised you could always do a GetRecord on records past a certain age. Even then though, was it ETDCAT that had 500,000 records that came into existance one the same day? 1/2 million GetRecord requests is not a very serious option. I am not proposing a new verb though. Sending 1,000 identifiers or whatever through in a request I suspect won't be a serious option until OAI requests are embedded in something like SOAP. POST would work I guess, but GET???!?! (that would be one long URL!). Periodic complete rehavesting seems the best solution with the protocol as is. Alan From francois@fsconsult.com Thu Apr 11 13:14:28 2002 From: francois@fsconsult.com (Francois Schiettecatte) Date: Thu, 11 Apr 2002 08:14:28 -0400 Subject: [OAI-implementers] Message-ID: Hi First, apologies if you get this message twice. We just wanted to update everyone as to the progress of the my.OAI Search Engine. We have just deployed a number of updates as follows: - Added account creation, you can now create you own accounts should you wish to, this will give you access to your own search history, saved searches, document folders and preferences. You can also set saved searches to run on a regular basis to alert you, via email, of new content as it is added to the databases. We should point out that we update the databases once a week, so you will get updates once a week at most, even if you set the saved search to run daily. - For those of you who donšt want to create an account, you can access the search interface as a guest user. - Search terms are now highlighted in both the summary and the documents. - We have added two buttons to the search form to allow you to select and de-select all the databases with a single click. - We have added extensive help on how to search the databases. - We have added direct links to the documents in the search results if there are any external links, for example, all arXiv abstracts contain links which point back to the actual document stored in arXiv, so you can get to those documents from the search results rather than have to display the document first. Likewise all links and email addresses embedded in the abstracts are converted to HTML links. The URL to the search engine is: http://www.myoai.com/ As always, we welcome feedback, suggestions, comments, critiques, etc... Best regards Francois From simeon@cs.cornell.edu Thu Apr 11 14:23:43 2002 From: simeon@cs.cornell.edu (Simeon Warner) Date: Thu, 11 Apr 2002 09:23:43 -0400 (EDT) Subject: [OAI-implementers] Clarification on deleted records In-Reply-To: <20020411093514.B19611@io.mds.rmit.edu.au> Message-ID: On Thu, 11 Apr 2002, Alan Kent wrote: > On Wed, Apr 10, 2002 at 02:47:46PM -0400, Hussein Suleman wrote: > > hi > > ... if deleted records are not tracked... the > > service provider has to regularly issue GetRecord with the identifiers > > of previously harvested records in order to confirm that each of them > > does still exist... > > I am not suggesting it today, but I can see the time when someone > may therefore want a new command to say 'are these 1,000 identifiers > still valid?'. Doing a GetRecord per identifier one by one could > be pretty slow and will unnecessarily return the record (when all > you want to know is if it exists). > > I was initially thinking 'no-one would ever do GetRecord for every > record in the database - its too slow! doing ListRecords or ListIdentifiers > would always be faster', but then I realised you could always do a > GetRecord on records past a certain age. Even then though, was it > ETDCAT that had 500,000 records that came into existance one the > same day? 1/2 million GetRecord requests is not a very serious > option. > > I am not proposing a new verb though. Sending 1,000 identifiers > or whatever through in a request I suspect won't be a serious option > until OAI requests are embedded in something like SOAP. POST would > work I guess, but GET???!?! (that would be one long URL!). This is one of the reasons why a multi-item GetRecord was not included in v2.0. This may be more reasonable if OAI is later embedded in something like SOAP. > Periodic complete rehavesting seems the best solution with the > protocol as is. That is probably the safest approach with v1.1. (In v2.0, this should not be necessary for a repository that claims to track deleted records forever.) Cheers, Simeon. > Alan > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > From yding@TNC.ORG Fri Apr 12 17:08:50 2002 From: yding@TNC.ORG (Yi-Lun Ding) Date: Fri, 12 Apr 2002 12:08:50 -0400 Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecords calls Message-ID: I am thinking of implementing OAI, but am a little wary of the load requirements of ListIdentifiers and ListRecords for large document repositories. One, there is the bandwidth requirement of transferring huge blocks of data. Two, the process would have to go through each record in the database and check the TimeModified/Set attributes. How are people dealing with this issue? Thanks, yi-lun From jyoung@oclc.org Fri Apr 12 19:26:25 2002 From: jyoung@oclc.org (Young,Jeff) Date: Fri, 12 Apr 2002 14:26:25 -0400 Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecor ds calls Message-ID: Yi-Lun, Our theses and dissertations repository has over 4 million records. Performance was so bad in my OAI v1.1 implementation that it was effectively unusable for this size repository. I expect to have it resolved in my 2.0 upgrade. The way I plan to deal with it is to have our OAI server examine the from and until dates to see if they imply a harvest of the repository in its entirity. This should be a reasonable expectation the first time a client harvests a repository. If so, I will read the database directly from beginning to end without going through the indexes. I also plan to use the compression feature of OAIv2. Lastly, I'm currently going through the new server code looking for optimization opportunities, of which there are plenty. Our OAI server and harvester software will be available as open-source. The server is written as a Java Servlet and includes an abstract database interface to allow access to any database engine that implements it. There will even be an implementation of the abstract database class included to treat a file system as a repository. I would encourage you to use an existing open-source implementation of OAI. They are available in a variety of flavors if Java Servlets aren't to your taste. Information about existing implementations is available on http://www.openarchives.org/tools/tools.html. Expect announcements of OAIv2 upgrades in the coming weeks. The more interest there is in reusing these tools, the better we will make them. Sincerely, Jeff > -----Original Message----- > From: Yi-Lun Ding [mailto:yding@TNC.ORG] > Sent: Friday, April 12, 2002 12:09 PM > To: oai-implementers@oaisrv.nsdl.cornell.edu > Subject: [OAI-implementers] Server Load for ListIdentifiers, > ListRecords > calls > > > I am thinking of implementing OAI, but am a little wary of the load > requirements of ListIdentifiers and ListRecords for large document > repositories. One, there is the bandwidth requirement of > transferring huge > blocks of data. Two, the process would have to go through > each record in > the database and check the TimeModified/Set attributes. > > How are people dealing with this issue? > > Thanks, > > yi-lun > > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > From yding@TNC.ORG Fri Apr 12 22:03:32 2002 From: yding@TNC.ORG (Yi-Lun Ding) Date: Fri, 12 Apr 2002 17:03:32 -0400 Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecords calls In-Reply-To: Message-ID: Jeff: Have you thought about replicating the metadata in another database, and letting the secondary database handle the crawl calls, e.g., ListRecords, ListIdentifiers? Even with an elegant solution, I am still concerned about the load on my primary database. I am tempted to just return "Service Unavailable" for anything that requires a big db dump. I have not seen anything about 2.0 yet, but are there considerations to limit certain calls by hostname and/or by time? Also, the combination of our middleware and object-oriented database schema may limit me in terms of existing solutions. -----Original Message----- From: Young,Jeff [mailto:jyoung@oclc.org] Sent: Friday, April 12, 2002 2:26 PM To: 'yding@TNC.ORG'; oai-implementers@oaisrv.nsdl.cornell.edu Subject: RE: [OAI-implementers] Server Load for ListIdentifiers, ListRecords calls Yi-Lun, Our theses and dissertations repository has over 4 million records. Performance was so bad in my OAI v1.1 implementation that it was effectively unusable for this size repository. I expect to have it resolved in my 2.0 upgrade. The way I plan to deal with it is to have our OAI server examine the from and until dates to see if they imply a harvest of the repository in its entirity. This should be a reasonable expectation the first time a client harvests a repository. If so, I will read the database directly from beginning to end without going through the indexes. I also plan to use the compression feature of OAIv2. Lastly, I'm currently going through the new server code looking for optimization opportunities, of which there are plenty. Our OAI server and harvester software will be available as open-source. The server is written as a Java Servlet and includes an abstract database interface to allow access to any database engine that implements it. There will even be an implementation of the abstract database class included to treat a file system as a repository. I would encourage you to use an existing open-source implementation of OAI. They are available in a variety of flavors if Java Servlets aren't to your taste. Information about existing implementations is available on http://www.openarchives.org/tools/tools.html. Expect announcements of OAIv2 upgrades in the coming weeks. The more interest there is in reusing these tools, the better we will make them. Sincerely, Jeff > -----Original Message----- > From: Yi-Lun Ding [mailto:yding@TNC.ORG] > Sent: Friday, April 12, 2002 12:09 PM > To: oai-implementers@oaisrv.nsdl.cornell.edu > Subject: [OAI-implementers] Server Load for ListIdentifiers, > ListRecords > calls > > > I am thinking of implementing OAI, but am a little wary of the load > requirements of ListIdentifiers and ListRecords for large document > repositories. One, there is the bandwidth requirement of > transferring huge > blocks of data. Two, the process would have to go through > each record in > the database and check the TimeModified/Set attributes. > > How are people dealing with this issue? > > Thanks, > > yi-lun > > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > From mln@ils.unc.edu Fri Apr 12 22:20:09 2002 From: mln@ils.unc.edu (Michael L. Nelson) Date: Fri, 12 Apr 2002 17:20:09 -0400 (EDT) Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecords calls In-Reply-To: Message-ID: On Fri, 12 Apr 2002, Yi-Lun Ding wrote: > Jeff: > > Have you thought about replicating the metadata in another database, and > letting the secondary database handle the crawl calls, e.g., ListRecords, > ListIdentifiers? though my DP is much smaller, I implemented the capability for the NACA OAI interface to redirect the harvester to a back-up version (on another network) if the load on the host machine was above a configurable parameter. I did this with an http 302 status code. In doing this, however, you should be careful to set requestURL element to what the harvester asked, not the actual URL (i.e., a.foo.org, not b.bar.org). assuming you have the resources to do so, running mirrors / backups of your repository is the the right thing to do (tm). > > Even with an elegant solution, I am still concerned about the load on my > primary database. I am tempted to just return "Service Unavailable" for > anything that requires a big db dump. I have not seen anything about 2.0 > yet, but are there considerations to limit certain calls by hostname and/or > by time? 2.0 does not address nor prohibit this; it is out of scope of the protocol itself, but can be accomplished using the standard set of http mechanisms. it is entirely possible for you to limit access to your repository based on hostname, time, passwords, etc. you'd probably want to avoid shutting off things like "ListRecords" altogether (you could become non-complaint that way), but you could play with the number of records returned before a resumptionToken is issued, intervals specified in your 503 response, etc. Contextually dependent harvesting is sure to be a reality as more DPs come online. regards, Michael > Also, the combination of our middleware and object-oriented > database schema may limit me in terms of existing solutions. > > -----Original Message----- > From: Young,Jeff [mailto:jyoung@oclc.org] > Sent: Friday, April 12, 2002 2:26 PM > To: 'yding@TNC.ORG'; oai-implementers@oaisrv.nsdl.cornell.edu > Subject: RE: [OAI-implementers] Server Load for ListIdentifiers, > ListRecords calls > > > Yi-Lun, > > Our theses and dissertations repository has over 4 million records. > Performance was so bad in my OAI v1.1 implementation that it was effectively > unusable for this size repository. I expect to have it resolved in my 2.0 > upgrade. > > The way I plan to deal with it is to have our OAI server examine the from > and until dates to see if they imply a harvest of the repository in its > entirity. This should be a reasonable expectation the first time a client > harvests a repository. If so, I will read the database directly from > beginning to end without going through the indexes. I also plan to use the > compression feature of OAIv2. Lastly, I'm currently going through the new > server code looking for optimization opportunities, of which there are > plenty. > > Our OAI server and harvester software will be available as open-source. The > server is written as a Java Servlet and includes an abstract database > interface to allow access to any database engine that implements it. There > will even be an implementation of the abstract database class included to > treat a file system as a repository. > > I would encourage you to use an existing open-source implementation of OAI. > They are available in a variety of flavors if Java Servlets aren't to your > taste. Information about existing implementations is available on > http://www.openarchives.org/tools/tools.html. Expect announcements of OAIv2 > upgrades in the coming weeks. The more interest there is in reusing these > tools, the better we will make them. > > Sincerely, > > Jeff > > > -----Original Message----- > > From: Yi-Lun Ding [mailto:yding@TNC.ORG] > > Sent: Friday, April 12, 2002 12:09 PM > > To: oai-implementers@oaisrv.nsdl.cornell.edu > > Subject: [OAI-implementers] Server Load for ListIdentifiers, > > ListRecords > > calls > > > > > > I am thinking of implementing OAI, but am a little wary of the load > > requirements of ListIdentifiers and ListRecords for large document > > repositories. One, there is the bandwidth requirement of > > transferring huge > > blocks of data. Two, the process would have to go through > > each record in > > the database and check the TimeModified/Set attributes. > > > > How are people dealing with this issue? > > > > Thanks, > > > > yi-lun > > > > > > _______________________________________________ > > OAI-implementers mailing list > > OAI-implementers@oaisrv.nsdl.cornell.edu > > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > > > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > --- Michael L. Nelson NASA Langley Research Center m.l.nelson@larc.nasa.gov MS 158, Hampton, VA 23681 http://www.ils.unc.edu/~mln/ +1 757 864 8511 +1 757 864 8342 (f) From jyoung@oclc.org Fri Apr 12 22:29:08 2002 From: jyoung@oclc.org (Young,Jeff) Date: Fri, 12 Apr 2002 17:29:08 -0400 Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecor ds calls Message-ID: I see. My 45 million record database isn't being used by any other system, so I never really worried about performance degradation. You may be on the right track with the secondary database idea. Put an OAI server on top of both of them, but make sure the primary repository is behind your firewall. Access from outside the firewall should only go to the secondary repository. Run an OAI harvester against the primary on a regular basis to keep the secondary current. That way, the load on your main system will be minimal. Jeff > -----Original Message----- > From: Yi-Lun Ding [mailto:yding@TNC.ORG] > Sent: Friday, April 12, 2002 5:04 PM > To: oai-implementers@oaisrv.nsdl.cornell.edu > Subject: RE: [OAI-implementers] Server Load for ListIdentifiers, > ListRecords calls > > > Jeff: > > Have you thought about replicating the metadata in another > database, and > letting the secondary database handle the crawl calls, e.g., > ListRecords, > ListIdentifiers? > > Even with an elegant solution, I am still concerned about the > load on my > primary database. I am tempted to just return "Service > Unavailable" for > anything that requires a big db dump. I have not seen > anything about 2.0 > yet, but are there considerations to limit certain calls by > hostname and/or > by time? Also, the combination of our middleware and object-oriented > database schema may limit me in terms of existing solutions. > > -----Original Message----- > From: Young,Jeff [mailto:jyoung@oclc.org] > Sent: Friday, April 12, 2002 2:26 PM > To: 'yding@TNC.ORG'; oai-implementers@oaisrv.nsdl.cornell.edu > Subject: RE: [OAI-implementers] Server Load for ListIdentifiers, > ListRecords calls > > > Yi-Lun, > > Our theses and dissertations repository has over 4 million records. > Performance was so bad in my OAI v1.1 implementation that it > was effectively > unusable for this size repository. I expect to have it > resolved in my 2.0 > upgrade. > > The way I plan to deal with it is to have our OAI server > examine the from > and until dates to see if they imply a harvest of the > repository in its > entirity. This should be a reasonable expectation the first > time a client > harvests a repository. If so, I will read the database directly from > beginning to end without going through the indexes. I also > plan to use the > compression feature of OAIv2. Lastly, I'm currently going > through the new > server code looking for optimization opportunities, of which there are > plenty. > > Our OAI server and harvester software will be available as > open-source. The > server is written as a Java Servlet and includes an abstract database > interface to allow access to any database engine that > implements it. There > will even be an implementation of the abstract database class > included to > treat a file system as a repository. > > I would encourage you to use an existing open-source > implementation of OAI. > They are available in a variety of flavors if Java Servlets > aren't to your > taste. Information about existing implementations is available on > http://www.openarchives.org/tools/tools.html. Expect > announcements of OAIv2 > upgrades in the coming weeks. The more interest there is in > reusing these > tools, the better we will make them. > > Sincerely, > > Jeff > > > -----Original Message----- > > From: Yi-Lun Ding [mailto:yding@TNC.ORG] > > Sent: Friday, April 12, 2002 12:09 PM > > To: oai-implementers@oaisrv.nsdl.cornell.edu > > Subject: [OAI-implementers] Server Load for ListIdentifiers, > > ListRecords > > calls > > > > > > I am thinking of implementing OAI, but am a little wary of the load > > requirements of ListIdentifiers and ListRecords for large document > > repositories. One, there is the bandwidth requirement of > > transferring huge > > blocks of data. Two, the process would have to go through > > each record in > > the database and check the TimeModified/Set attributes. > > > > How are people dealing with this issue? > > > > Thanks, > > > > yi-lun > > > > > > _______________________________________________ > > OAI-implementers mailing list > > OAI-implementers@oaisrv.nsdl.cornell.edu > > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > > > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers > From cjg@ecs.soton.ac.uk Fri Apr 12 18:26:19 2002 From: cjg@ecs.soton.ac.uk (Christopher Gutteridge) Date: Fri, 12 Apr 2002 18:26:19 +0100 Subject: [OAI-implementers] EPrints.org & OAI2.0a3 Message-ID: <20020412172619.GB10447@ecs.soton.ac.uk> I can't implement OAI2.0a3 ListIdentifiers without looking up the entire record, whereas in 1.1 I could just list the ID, which was much faster. I'm unhappy about the metadataPrefix filtering on ListIdentifiers and the requirement to list SetSpecs (that's not as big a problem in ListRecords). As I see it, ListIdentifiers is now largely redundant. Also, it would be much easier if ListRecords could list all records which match set & from & until and then give header only for records which they could not generate metadata in the requested metadataPrefix. And, I think that requests for higher granularity than you support could just give for example on a YYYY-MM-DD archive, all records from that day (rounded down) until that day (rounding up). I do really like the new error handling and single schema though. I've got a working version though, which bodes well for OAI2.0a3 being in the next eprints release (sometime soon) -- Christopher Gutteridge -- cjg@ecs.soton.ac.uk -- +44 (0)23 8059 4833 >O___, ________________________________(___)_____________________________________ | | | | Now Playing: "Heaven is | Beware of altruism. It is based on | | Oblivion" from Psycho Drama - | self-deception, the root of all | | Ultraviolence | evil. -- From "The Notebooks of | | | Lazarus Long" by Robert Heinlein | |___________________________________|______________________________________| From ajk@mds.rmit.edu.au Mon Apr 15 09:07:55 2002 From: ajk@mds.rmit.edu.au (Alan Kent) Date: Mon, 15 Apr 2002 18:07:55 +1000 Subject: [OAI-implementers] Server Load for ListIdentifiers, ListRecords calls In-Reply-To: ; from Yi-Lun Ding on Fri, Apr 12, 2002 at 12:08:50PM -0400 References: Message-ID: <20020415180754.H10852@io.mds.rmit.edu.au> On Fri, Apr 12, 2002 at 12:08:50PM -0400, Yi-Lun Ding wrote: > I am thinking of implementing OAI, but am a little wary of the load > requirements of ListIdentifiers and ListRecords for large document > repositories. One, there is the bandwidth requirement of transferring huge > blocks of data. Two, the process would have to go through each record in > the database and check the TimeModified/Set attributes. > > How are people dealing with this issue? > > Thanks, > > yi-lun Not sure exactly of your concerns for "bandwidth" for large document repositories. If you don't want people to harvest a copy of the data, then what do you want to use OAI for? (note: you can provide only metadata about documents - not the full document via OAI - which would be much shorter). As to checking the time modified attribute, I plan to store the collection in a database which can query on time modified (and supported metadata prefixes) so to identify which information to return is a matter of doing a simple query. If you don't have a database engine capable of doing such queries, it will be slower. Alan From comyn@utk.edu Sat Apr 20 22:30:35 2002 From: comyn@utk.edu (Paul Cummins) Date: Sat, 20 Apr 2002 17:30:35 -0400 Subject: [OAI-implementers] OAI - PHP tools or scripts Message-ID: <3CC1A5BB.32043.10FB8C6@localhost> Hi, I just joined this list. AT the univ. of Tennessee Library, I have been working on other projects, but now it looks like I'm in the right spot to work on our OAI related projects. I've been looking at all the tools/interfaces that are available and I noticed in a couple of the search pages that a few php scripts were refered to, but not accessible. Does anyone have any information on any OAI implementation in php? --Paul Cummins UT Library Systems From lagoze@cs.cornell.edu Wed Apr 24 12:01:32 2002 From: lagoze@cs.cornell.edu (Carl Lagoze) Date: Wed, 24 Apr 2002 07:01:32 -0400 Subject: [OAI-implementers] OAI and DCMI announce DC-simple Schema Declaration Message-ID: <706871B20764CD449DB0E8E3D81C4D4302A206C7@opus.cs.cornell.edu> DCMI and OAI are pleased to announce an XML schema for unqualified DC metadata that facilitates the declaration of modular metadata components. The Dublin Core Metadata Initiative and the Open Archives Initiative have been cooperating on metadata issues for some time. Unqualified DC metadata is the default metadata set used in the OAI Protocol for Metadata Harvesting, intended to promote cross-domain interoperability. Other, domain-specific sets are encouraged as well, as envisaged in the modular metadata framework that both communitites have been striving for. This schema has been developed for use with the OAI Protocol, and has been discussed at length in the DC-Architecture working group. It is expected that the schema will be of use for other applications as well, and will be hosted on the DCMI Website and maintained by representatives of both groups. This development is an important landmark in the development of web-based metadata services, reflecting as it does the convergence of community consensus and the development of enabling infrastructure to support that consensus. The schema is available at: http://dublincore.org/schemas/xmls/simpledc20020312.xsd This and other schemas will be linked and described for readers at http://dublincore.org/schemas/ More information on the Open Archives Initiative is available at: http://www.openarchives.org Stuart Weibel Executive Director Dublin Core Metadata Initiative OCLC Office of Research Herbert Van de Sompel Open Archives Initiative Executive Digital Library Research & Prototyping Los Alamos National Laboratory - Research Library Carl Lagoze Open Archives Initiative Executive Computing and Information Science Cornell University Ithaca, NY 14853 USA From r90126009@ms90.ntu.edu.tw Fri Apr 26 08:40:20 2002 From: r90126009@ms90.ntu.edu.tw (Gloria Chang) Date: Fri, 26 Apr 2002 15:40:20 +0800 Subject: [OAI-implementers] question about OAI protocol requests Message-ID: <004701c1ecf5$b67de090$14c6708c@sandy> This is a multi-part message in MIME format. ------=_NextPart_000_0042_01C1ED38.AC0FC270 Content-Type: text/plain; charset="big5" Content-Transfer-Encoding: quoted-printable Hello,=20 =20 I am the research assistant at National Taiwan University.=20 We are just studying and thinking of implementing OAI. But I have some questions about the OAI: =20 1. Is there any situation in which we can increase or decrease the = protocol requests at will? 2. Only the 6 functions -- GetRecord, Identify, ListIdentifier, = ListMatadataFormats, ListRecords,=20 and ListSets -- are protocol requests. Why? And, why did the = committee who draws up the=20 OAI-PMH at beginning choose these 6 protocal requests? =20 Gloria Chang Library and Information Science National Taiwan University Taipei, Taiwan. e-mail: r90126009@ms90.ntu.edut.w ------=_NextPart_000_0042_01C1ED38.AC0FC270 Content-Type: text/html; charset="big5" Content-Transfer-Encoding: quoted-printable
Hello,
 
I = am the=20 research assistant at National=20 Taiwan=20 University.=20
We = are just=20 studying and thinking of implementing OAI.
But = I=20 have some questions about the OAI:
 
1. Is=20 there any situation in which we can increase or decrease the = protocol=20 requests at will?
2. = Only the 6=20 functions -- GetRecord, Identify, ListIdentifier, = ListMatadataFormats,=20 ListRecords,
   =20 and ListSets -- are protocol requests. Why? And, why did the = committee=20 who draws up the
   =20 OAI-PMH at=20 beginning choose these 6 protocal requests?
 
Gloria Chang
Library and Information = Science
National Taiwan University
Taipei, Taiwan.
e-mail: r90126009@ms90.ntu.edut.w
------=_NextPart_000_0042_01C1ED38.AC0FC270-- From simeon@cs.cornell.edu Fri Apr 26 20:36:47 2002 From: simeon@cs.cornell.edu (Simeon Warner) Date: Fri, 26 Apr 2002 15:36:47 -0400 (EDT) Subject: [OAI-implementers] Clarification on deleted records In-Reply-To: Message-ID: I can't see a reply to this, apologies if these questions were already answered. On Wed, 10 Apr 2002, Leigh Dodds wrote: > I'm not clear on the following wording in the specification, could > someone clarify this for me? > > "When returning a harvested record or identifier of a record, the > ListRecords and ListIdentifiers service requests may indicate a status > of "deleted". This status means that an item has been deleted and > therefore no record can be disseminated from it. The length of time > that a given repository keeps track of deleted items is not defined by > the protocol. Therefore, the only guaranteed method in the protocol > for determining whether a record can be returned by a repository > (its corresponding item still exists) is through the GetRecord service > request." > > What I'm not sure about is why the only guaranteed method > for determing whether a record can be returned is GetRecord. > The second sentence (reading "This status...") notes that if > the status is deleted, then no record can be returned. So we > already have a definitive answer don't we? GetRecord and ListMetadataFormats are the only ways to request information about a specific item (they have identifier as an argument). If a repository does not keep track of 'deleted' items and change the datestamp accordingly then nothing will show up in ListRecords or ListIdentifiers when and item is deleted. > Digging further I see that the GetRecord schema notes that a > record *can* be returned with a deleted status -- which seems > contrary to the above? Yes, a can be returned (if the repository keeps track of deleted records), but no block. The documentation really means the block when it says 'record' here. > However if an identifer doesn't exist then no record will be returned in > the GetRecord response. > > So is it the case that records for items with a deleted status will always > be available (i.e. the metadata is can still be harvested) but > after a period (determined by the archive) GetRecord may > subsequently return no metadata. For as long as a repository keeps a record of 'deleted' items, it can return a which has a
block but no block. If, after some period, it forgets about the item (stops keeping 'deleted' status) then no block will be returned. (These concepts will be much better documented in v2.0 but the ideas are not significantly changed.) Cheers, Simeon. > Thanks in advance, > > L. > > From simeon@cs.cornell.edu Fri Apr 26 20:36:47 2002 From: simeon@cs.cornell.edu (Simeon Warner) Date: Fri, 26 Apr 2002 15:36:47 -0400 (EDT) Subject: [OAI-implementers] Clarification on deleted records In-Reply-To: Message-ID: I can't see a reply to this, apologies if these questions were already answered. On Wed, 10 Apr 2002, Leigh Dodds wrote: > I'm not clear on the following wording in the specification, could > someone clarify this for me? > > "When returning a harvested record or identifier of a record, the > ListRecords and ListIdentifiers service requests may indicate a status > of "deleted". This status means that an item has been deleted and > therefore no record can be disseminated from it. The length of time > that a given repository keeps track of deleted items is not defined by > the protocol. Therefore, the only guaranteed method in the protocol > for determining whether a record can be returned by a repository > (its corresponding item still exists) is through the GetRecord service > request." > > What I'm not sure about is why the only guaranteed method > for determing whether a record can be returned is GetRecord. > The second sentence (reading "This status...") notes that if > the status is deleted, then no record can be returned. So we > already have a definitive answer don't we? GetRecord and ListMetadataFormats are the only ways to request information about a specific item (they have identifier as an argument). If a repository does not keep track of 'deleted' items and change the datestamp accordingly then nothing will show up in ListRecords or ListIdentifiers when and item is deleted. > Digging further I see that the GetRecord schema notes that a > record *can* be returned with a deleted status -- which seems > contrary to the above? Yes, a can be returned (if the repository keeps track of deleted records), but no block. The documentation really means the block when it says 'record' here. > However if an identifer doesn't exist then no record will be returned in > the GetRecord response. > > So is it the case that records for items with a deleted status will always > be available (i.e. the metadata is can still be harvested) but > after a period (determined by the archive) GetRecord may > subsequently return no metadata. For as long as a repository keeps a record of 'deleted' items, it can return a which has a
block but no block. If, after some period, it forgets about the item (stops keeping 'deleted' status) then no block will be returned. (These concepts will be much better documented in v2.0 but the ideas are not significantly changed.) Cheers, Simeon. > Thanks in advance, > > L. > > From mln@ils.unc.edu Sat Apr 27 22:31:38 2002 From: mln@ils.unc.edu (Michael L. Nelson) Date: Sat, 27 Apr 2002 17:31:38 -0400 (EDT) Subject: [OAI-implementers] Deadline extended - CFP: Distributed Computing Architectures for Digital Libraries In-Reply-To: Message-ID: FYI, the deadlines for DCADL 02 have been extended: May 15, 2002 Paper Submission May 23, 2002 Notification of Acceptance June 1, 2002 Camera-ready copies due OAI applications are very welcome! regards, Michael > > Call for Papers > > Workshop on Distributed Computing Architectures for Digital Libraries > http://www.cs.odu.edu/~jbollen/icpp2002/ > > to be held in conjunction with the > > 31st International Conference on Parallel Processing (ICPP 2002) > Vancouver, Canada August 18-21, 2002 > http://www.eecg.toronto.edu/icpp2002/ > > Workshop Co-Chairs: > > Johan Bollen > Department of Computer Science > Old Dominion University > Norfolk VA, 23529, USA > http://www.cs.odu.edu/~jbollen/ > > Michael L. Nelson > NASA Langley Research Center > Hampton VA 23681 > http://mln.larc.nasa.gov/~mln/ > > Description: > > Digital libraries (DLs) are increasingly common on the Web, > providing ordered, vetted digital collections to targeted user groups. > To date, much of DL research has focused on the acquisition and > representation of digital objects, optimizing and personalizing user > services, and interoperability efforts. Few DLs employ mirrors, much less > some of the more sophisticated, non-client-server architectures found > in WWW deployment, e.g. peer-to-peer systems and distributed storage > architectures. Although these new architectures have been succesfully > applied to a large number of Internet services, they have had little > impact on DL research. Are they technically suitable for DL use, or do > social and economic issues prevent their adoption? > > This workshop will explore these issues as well as highlight some of the > more novel DL architectures. A range of theoretical, technical, > and speculative papers are sought to discuss and propose alternate DL > architectures and approaches. Papers are requested in the following > and related topics: > > - Peer-to-Peer (P2P) systems > - Adaptive digital libraries > - Wireless access to digital libraries > - Grid computing > - Distributed searching > - Metadata harvesting > - Distributed storage systems > - Serverless storage and information retrieval > - DL requirements vs. ordinary web requirements > - Provenance, trust, integrity and archival issues > in distributed digital libraries > > > Important Dates: > > April 20, 2002 Paper Submission > May 15, 2002 Notification of acceptance > June 1, 2002 Camera-ready copies due > > Submission details: > > Authors are invited to submit research contributions representing > original, previously unpublished work. Submitted papers will be carefully > evaluated by the technical committee for originality, significance, > technical soundness, and clarity of exposition. Submissions will only > be accepted in PDF, emailed to the co-chairs. Accepted papers will be > published by IEEE Computer Society Press as proceedings of the ICPP'2002 > workshops. All submitted papers must be formatted according to the author > guideline provided by IEEE Computer Society Press (two column-format), > and accepted papers must not exceed six pages. Please contact the > co-chairs with any questions. > > Technical Committee > > - Kurt Bollacker, Long Now Foundation (kurt@longnow.org) > - Johan Bollen, Old Dominion University (jbollen@cs.odu.edu) > - Ed Chi, XEROX PARC (echi@parc.xerox.com) > - Fabio Crestani, Strathclyde (fabioc@cs.strath.ac.uk) > - Cliff Joslyn, Los Alamos National Laboratory (joslyn@lanl.gov) > - Thomas Krichel, Long Island University (thomas.krichel@liu.edu) > - Michael Nelson, NASA Langley Research Center (m.l.nelson@larc.nasa.gov) > - Luis M. Rocha, Los Alamos National Laboratory (rocha@lanl.gov) > - Herbert Van de Sompel, Los Alamos National Laboratory > - Simeon Warner, Cornell (simeon@cs.cornell.edu) > - Mohammad Zubair, Old Dominion University (zubair@cs.odu.edu) > > _______________________________________________ > OAI-general mailing list > OAI-general@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-general > --- Michael L. Nelson NASA Langley Research Center m.l.nelson@larc.nasa.gov MS 158, Hampton, VA 23681 http://www.ils.unc.edu/~mln/ +1 757 864 8511 +1 757 864 8342 (f) From mln@ils.unc.edu Sun Apr 28 17:10:58 2002 From: mln@ils.unc.edu (Michael L. Nelson) Date: Sun, 28 Apr 2002 12:10:58 -0400 (EDT) Subject: [OAI-implementers] question about OAI protocol requests In-Reply-To: <004701c1ecf5$b67de090$14c6708c@sandy> Message-ID: Gloria, A lot of the history & motivation of OAI can be found in these papers (in this order): http://www.dlib.org/dlib/february00/vandesompel-ups/02vandesompel-ups.html http://www.dlib.org/dlib/february00/vandesompel-oai/02vandesompel-oai.html http://www.cs.cornell.edu/lagoze/papers/oai-jcdl.pdf http://www.arl.org/newsltr/217/mhp.html other papers are listed at: http://www.openarchives.org/documents/papers.html but I'd recommend those 4 as a place to start. regards, Michael On Fri, 26 Apr 2002, Gloria Chang wrote: > Hello, > > I am the research assistant at National Taiwan University. > We are just studying and thinking of implementing OAI. > But I have some questions about the OAI: > > 1. Is there any situation in which we can increase or decrease the protocol requests at will? > 2. Only the 6 functions -- GetRecord, Identify, ListIdentifier, ListMatadataFormats, ListRecords, > and ListSets -- are protocol requests. Why? And, why did the committee who draws up the > OAI-PMH at beginning choose these 6 protocal requests? > > Gloria Chang > Library and Information Science > National Taiwan University > Taipei, Taiwan. > e-mail: r90126009@ms90.ntu.edut.w > --- Michael L. Nelson NASA Langley Research Center m.l.nelson@larc.nasa.gov MS 158, Hampton, VA 23681 http://www.ils.unc.edu/~mln/ +1 757 864 8511 +1 757 864 8342 (f) From daphne.charles@rchme.co.uk Mon Apr 29 11:34:52 2002 From: daphne.charles@rchme.co.uk (Charles, Daphne) Date: Mon, 29 Apr 2002 11:34:52 +0100 Subject: [OAI-implementers] Questions from a novice Message-ID: <2AD0C5E20039D61181800008C74B955112CAF4@swn15.rchme.uk> Hi, Having read the Harvesting Protocol I have a couple of questions. (I tried to submit these to the OAI website, but no response, so hence the general broadcast) Can this only be implemented with PERL CGI scripts? We are building our site with Active Server Pages on an Oracle database, and ASPs could certainly be used to produce output in XML format. Oracle also has some excellent XML tools. How is the information returned from the repository following a request? ASPs could generate output in the correct format with an XML extension, but this would be on the repository server, so how does it get transferred to the service provider? Similarly for status codes if there is no XML output. Regards Daphne ----------------------------------------------------------------------- Daphne Charles Tel: +44 1793 414894 Senior Developer Fax: +44 1793 414906 National Monuments Record Centre Email: daphne.charles@rchme.co.uk English Heritage Kemble Drive, Swindon, Wilts SN2 2GZ ---------------------------------------------------------------------- From tim@tim.brody.btinternet.co.uk Mon Apr 29 12:11:57 2002 From: tim@tim.brody.btinternet.co.uk (Tim Brody) Date: Mon, 29 Apr 2002 12:11:57 +0100 Subject: [OAI-implementers] Questions from a novice References: <2AD0C5E20039D61181800008C74B955112CAF4@swn15.rchme.uk> Message-ID: <003001c1ef6e$ad6ff080$640aa8c0@Advocate> OAI is based on the HTTP protocol (i.e. the Web). There are a number of implementations of the protocol that use Perl, but there are also implementations in Java, PHP, C (I think). You should be able to implement OAI in any programming language that is supported by a Web server. In OAI 1.x, and 2.0, OAI requests are in the form of a URI (i.e. you can type them into a web browser's location and they'll work). The Web server responds with XML, just the same as it would normally respond with HTML to human clients. In ASP you should be able to change the header that the Web server sends out, including the MIME type (which must be text/xml for OAI responses), or change the status code in the event of an error. You will also need to make sure you are producing XML in the UTF-8 encoding - there may be an option in ASP to do this. n.b. I would suggest waiting for OAI 2.0 (June 1st) before putting a lot of effort into building a 1.1 OAI export. All the best, Tim. ----- Original Message ----- From: "Charles, Daphne" To: Sent: Monday, April 29, 2002 11:34 AM Subject: [OAI-implementers] Questions from a novice > > Hi, > > Having read the Harvesting Protocol I have a couple of questions. (I tried > to submit these to the OAI website, but no response, so hence the general > broadcast) > > Can this only be implemented with PERL CGI scripts? > > We are building our site with Active Server Pages on an Oracle database, and > ASPs could certainly be used to produce output in XML format. Oracle also > has some excellent XML tools. > > How is the information returned from the repository following a request? > ASPs could generate output in the correct format with an XML extension, but > this would be on the repository server, so how does it get transferred to > the service provider? Similarly for status codes if there is no XML output. > > Regards > > Daphne > > > ----------------------------------------------------------------------- > Daphne Charles Tel: +44 1793 414894 > Senior Developer Fax: +44 1793 > 414906 > National Monuments Record Centre Email: > daphne.charles@rchme.co.uk > English Heritage > Kemble Drive, Swindon, Wilts SN2 2GZ > ---------------------------------------------------------------------- > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers From thabing@uiuc.edu Mon Apr 29 15:41:26 2002 From: thabing@uiuc.edu (Thomas G. Habing) Date: Mon, 29 Apr 2002 09:41:26 -0500 Subject: [OAI-implementers] Questions from a novice References: <2AD0C5E20039D61181800008C74B955112CAF4@swn15.rchme.uk> Message-ID: <3CCD5B95.CA2DCAC@uiuc.edu> Hi, We have done a number of OAI implementations using ASP, and all of our scripts are freely available. Go to http://oai.grainger.uiuc.edu/ and click on the 'Provider Tools' link ( http://oai.grainger.uiuc.edu/ProviderTools/ ) Our scripts use MS Access or SQL Server as the database. However, the database interface uses ODBC, so hooking up an Oracle DB to our scripts shouldn't be a problem. We also have an early implementation of the OAI 2.0 protocol, but we are not making it publicly available yet, so I would concur with Tim Brody to wait on the OAI 2.0 specification and tools before investing a lot of work on an OAI provider. We are happy to provide some support for our 1.1 provider tools if you want to try them out prior to the 2.0 releases. Kind Regards, Tom Habing -- Thomas G. Habing Research Programmer, Digital Library Projects University of Illinois at Urbana-Champaign 155 Grainger Engineering Library Information Center, MC-274 thabing@uiuc.edu, (217) 244-4425 http://dli.grainger.uiuc.edu "Charles, Daphne" wrote: > > Hi, > > Having read the Harvesting Protocol I have a couple of questions. (I tried > to submit these to the OAI website, but no response, so hence the general > broadcast) > > Can this only be implemented with PERL CGI scripts? > > We are building our site with Active Server Pages on an Oracle database, and > ASPs could certainly be used to produce output in XML format. Oracle also > has some excellent XML tools. > > How is the information returned from the repository following a request? > ASPs could generate output in the correct format with an XML extension, but > this would be on the repository server, so how does it get transferred to > the service provider? Similarly for status codes if there is no XML output. > > Regards > > Daphne > > ----------------------------------------------------------------------- > Daphne Charles Tel: +44 1793 414894 > Senior Developer Fax: +44 1793 > 414906 > National Monuments Record Centre Email: > daphne.charles@rchme.co.uk > English Heritage > Kemble Drive, Swindon, Wilts SN2 2GZ > ---------------------------------------------------------------------- > > _______________________________________________ > OAI-implementers mailing list > OAI-implementers@oaisrv.nsdl.cornell.edu > http://oaisrv.nsdl.cornell.edu/mailman/listinfo/oai-implementers