Harvesting other Metadata Formats than OAI-DCΒΆ

By default, Sickle’s mapping of the record XML into Python dictionaries is tailored to work only with Dublin-Core-encoded metadata payloads. Other formats most probably won’t be mapped correctly, especially if they are more hierarchically structured than Dublin Core.

In case your want to harvest these more complex formats, you have to write your own record model class by subclassing the default implementation that unpacks the metadata XML:

from sickle.models import Record

class MyRecord(Record):
    # Your XML unpacking implementation goes here.
    pass

Note

Take a look at the implementation of sickle.models.Record to get an idea of how to do this.

Next, associate your implementation with OAI verbs in the Sickle object. In this case, we want the Sickle object to use our implementation to represent items returned by ListRecords and GetRecord responses:

sickle = Sickle('http://...')
sickle.class_mapping['ListRecords'] = MyRecord
sickle.class_mapping['GetRecord'] = MyRecord

If you need to rewrite all item implementations, you can also provide a complete mapping to the Sickle object at instantiation:

my_mapping = {
    'ListRecords': MyRecord,
    'GetRecord': MyRecord,
    # ...
}

sickle = Sickle('http://...', class_mapping=my_mapping)