[OAI-implementers] How to verify a download worked?

Alan Kent ajk@mds.rmit.edu.au
Mon, 4 Mar 2002 15:14:44 +1100

Hi All,

I was wondering if anyone has good schemes for verifying if a download
of metadata 'worked'. For example, I crawled the arXiv site and got
around 60,000 records. However, it turns out the site actually has
190,000 or so records. So I only got 1/3 of the site!

Has anyone used any clever tricks to verify how well a crawl worked?
I now have to work out if my crawler has been discarding one in three
records! :-(

