[OAI-implementers] converting filenames of metadata records

Thomas Krämer kraemert@smail.uni-koeln.de
Wed, 22 Oct 2003 16:40:06 +0200


Hello

i am developping a middleware, that uses metadata harvested with the 
DLESE OAI software.

thus, there is a directory with hundreds of metadata records, that are 
not sorted nor can queries be formulated in order to retrieve the 
relevant among them.

Q1:Am i right assuming that repositories DO NOT offer any search 
interfaces, but provide their entire metadata and nothing more?

Q2:Am i right assuming that the DLESE OAI software has the apache lucene 
search api integrated, but is not yet working?


however, i am currently trying to use the apache lucene search api to 
index these records and make them searchable.

certain problem appears, when i try to read a record :


     SAXBuilder builder = new SAXBuilder();
     try {
       Document doc = builder.build(recordfile.getAbsoluteFile());
       Element root = doc.getRootElement();
       listChildren(root, 0);
     }

i always get an io.FileNotFoundException, as the oai-pmh changes the 
host separator  " : " into " %3A "
the pathname indicated at debugging is the correct one (using the "%3A" 
, such as the record files on my system)

but the exception tells me :

java.io.FileNotFoundException: 
/home/tom/mwd/metadata/7374617475733D696E7072657373/oai_dc/oai:sammelpunkt.philo.at:103.xml 
(No such file or directory)

i am working on a linux system.



Q3:Is it possible to tell the DLESE OAI Software to save the records on 
the local system using ":" instead of the hex representation, or, to 
wrap the records filename in a way that
admits the java native classes to open the records?



Thanks a lot for any hint

Thomas