I’d love to hear from anyone in library-land who has used/heard of/rejected/embraced/toyed with MODS (metadata object description schema) as the next evolutionary step from MARC.
I asked this question more locally about a year ago when I first started seriously to inform myself about metadata and got either blank or ‘don’t touch that’ looks back then. Since then the MODS question has hit me again and I am wondering what I don’t know about it for it not to be more broadly known and used. From my reading it appears to be a brilliant alternative to MARC in the new world of things like interoperability, repositories, electronic databases…. It looks damn easy to understand (no marc tags for the uninitiated to navigate) and maps well to anything from the simplest Dublin Core to the most complex ONIX.
One negative comment I did hear was that it cannot catch all MARC data but when I wrote up a table comparing all the fields one could want in a repository I could not see any problem that way at all: the only thing it loses is the excess MARC bits and pieces that are simply not relevant to repositories and the sharing and searching of collections in the new info world. Things like the 246 indicator that codes a varying title as being a “spine” title — what’s a spine in an electronic resource anyway?
Thoughts? Would love feedback since am planning on soon applying all I’ve read about it to the real test world and seeing what happens.
November 8, 2006 at 20:38
When we needed an XML format for Copac (a union library catalogue) we chose the MODS schema. It seemed a pointless exercise creating a DTD/Schema of our own and MODS was available, was being actively supported by the LoC and did
most of what we wanted. We ended up using the MODS extension
element to code up library local holdings information, but even this may not be needed in the future as MODS may be
adding new elements to hold such data.
As you say, it is easy enough to transform the MODS into
pretty much anything else via an XSLT stylesheet.
MODS is also easier for Joe Public to understand than
MARCXML. The downside is that the verboseness of XML
makes for large records.
November 23, 2006 at 08:22
Thanks, Ashley. How long have you been using MODS with Copac now?
November 23, 2006 at 16:42
Neil, only our “experimental” interface (called v3 on the
home page) is currently using MODS. From some time in Q12007
the main interface will be switched over to the MODS
database. So far we’ve loaded about 18 million MODS records
over the last few months. Quite a few more millions to go….
December 3, 2006 at 06:09
Subject: Re: [office-metadata] MODS
* From: Bruce D’Arcus
* To: office-metadata
* Date: Thu, 10 Aug 2006 18:01:22 -0400
Hi Patrick,
On Aug 10, 2006, at 4:58 PM, Patrick Durusau wrote:
> One of my library contacts at the Extreme conference has suggested
> that if we are going to standardize on bibliographic data that we
> should use MODS (Library of Congress).
Yeah, of course a library person would say that. Talk to an archivist
and you’ll get a different answer.
> If that would work it would save us from having to write such a system.
I could write a book on what I know about MODS and related stuff, so
will just try to boil it down to no, NO, NO!! You (and many of the
librarians) are not thinking about this problem right.
I say this after having spent massive amounts of time working with the
Library of Congress on improving MODS, and subsequently promoting its
use. Really, I spent far more energy promoting MODS outside the library
world than the librarians have. I even at once wanted to include it in
ODF.
I was wrong, and I’ll try to just give you a rough outline of an
explanation …
> Are there any downsides to using it?
Yes.
1) the schema is needlessly complicated in many places, and too limited
again, why use one framework for a document and another for the
in others
2) it is inconsistent, which has other side-effects (see below)
3) it is all based on XML schema; a completely monolithic solution
4) because of 3, awkward to extend
5) also because of 3, really loose
6) not designed for this use case (missing some pieces; see above)
7) because of 2, linking is awkward and limited
bibliographic sources, which themselves are documents?
I could really go on and on about the problems with it for this use
case. And if you read my blog, you’ll note there’s probably nobody in
the world that knows more about MODS than I outside of the Library of
Congress itself.
Last year I stood up before a room full of library people at a
conference (where I was invited to speak) and told them their XML
schemas (I meant MODS among them) were over-the-top bloated, and they
needed to get serious about RDF. I showed them an example.
Why on earth does it make any sense for a community (the librarians) to
invent monstrous schemas like this when they could have and should have
done it in RDF? DC and DCQ covers so much of what’s in MODS, but
because it didn’t cover some other things they did need, they went off
and invented their own schemas.
And when I and others from the citation focus say “but we need x, y, z”
the Library of Congress has to approve all of that, and they really
don’t care about some of our needs.
Monolithic schemas of this sort are the information equivalents of
dinosaurs.
This is EXACTLY the problem we need to avoid. I’m absolutely serious;
this approach is broken.
My own summaries of the issue related to the conference talk:
… and a few (brief) of responses (to the talk):
Note: WRT to Rob’s question today, I only come to RDF out of practical
frustration with the alternatives. My book was authored using MODS as
the bibliographic data source, and when I finished and went through
trying to clean it up (in part by normalizing), I realized it was
hopeless. I’ve converted it all to RDF and haven’t looked back. And I
don’t even typically use RDF tools!
> I am thinking that it might get us some mileage in the library
> community.
The LoC is actively working with MS on the ECMA spec. I invited them to
join the ODF metadata SC. I don’t draw a very positive conclusion from
that.
> On the other hand, we should mention that the NLM (National Library of
> Medicine) metadata system may be preferred by some users.
Exactly; there are probably 20 bibliographic schemas out there. The
ONLY sensible thing that will actually work is to do as I’ve been
suggesting. Forget monolithic schemas developed by particular
communities, and instead think “modules.”
Am working on a demo of sorts using XSLT
Bruce