Abstract

The DLF/Aquifer Implementation Guidelines for Shareable MODS Records were created to facilitate the creation of rich, sharable metadata for use in aggregated digital humanities collections. While guideline creators recognize most data providers do not meet the criteria set forth in this document, this study attempts to quantify current levels of conformance to the base requirements set forth by DLF/Aquifer MODS guidelines. By analyzing collections for which MODS records are currently made available to OAI-PMH service providers, predictions can be made as to the nature and extent of future normalization processes required by service providers and the nature and extent of training and education required by data providers wishing to expose MODS records useful in a variety of contexts.

Documents

Supporting documents

Note: See final paper for full bibliography

Resulting documents

Project commentary

Goals

Background

Open Archives Initiative - Protocol for Metadata Harvesting

Quality / Shareable Metadata

DLF/Aquifer Project & MODS

DLF/Aquifer Project

"Promote effective use of distributed digital library content for teaching, learning, and research in the area of American culture and life."

DLF/Aquifer Implementation Guidelines for Shareable MODS Records
Metadata Object Description Schema 3.2 (MODS)

Goals of Study

Methodology

Test set

Repositories in Test Set
Repository Name (Base URL) Records Harvested Records Extracted
A Celebration of Women Writers (Base URL) 304 304
OCLC Research Publications (Base URL) 852 852
University of Tennessee Libraries, Digital Library Center (Base URL) 859 859
Southern Spaces (Base URL) 62 62
Digital Books from UIUC and the Open Content Alliance (Base URL) 767 766
University of Chicago Library Metadata Repository (Base URL) 372 372
Indiana University Digital Library Program (Base URL) 14,425 721
Deep Blue at the University of Michigan (Base URL) 24,299 967
Library of Congress Digitized Historical Collections (Base URL) 292,000 747
The University of Michigan, University Library,
Digital Library Production Service Collections
(Base URL)
9,589 639
  343,529 6,289
(DLF/MODS Portal Data Contributors)

Tests

SQL queries to test full specification (including required subelements, attribute/value pairs, and any element content requirements) of the nine requirements (and required if applicable) specified by DLF/Aquifer MODS guidelines.

General result categories:

Futher heurisitic testing may give deeper insight into quality issues.

Summary of Results

Summary of Percentage of Records with Required MODS Elements
Required Element Percentage of Records with Element
<titleInfo>99.89%
<typeOfResource>84.57%
<originInfo>30.56%
<language>5.82%
<physicalDescription>27.51%
<subject>83.54%
<location>17.97%
<accessCondition>37.89%
<recordInfo>18.56%
(All required subelements, attribute/value pairs, and valid content must be present to satisfy requirement.)

Metadata Deficiencies

Resolution

Results & Discussion

<titleInfo><title>

Requirements
Conformance Issues
Conformance Solutions

<typeOfResource>

Requirements
Conformance Issues
Conformance Solutions

<originInfo>

Requirements
Conformance Issues
Declared Date/time Schemas
Schema Occurrences
iso8601970
marc2288
w3cdtf3296
(MARC not recommended)
Conformance Solutions

<language>

Requirements
Conformance Issues
Conformance Solutions

<physicalDescription>

Requirements
Conformance Issues
Conformance Solutions

<subject>

Requirements
Conformance Issues
Subject Authority Occurrences
Authority Occurrences
lcsh6392
local5272
lctgm240
 74
rvm73
mesh45
lcshac1
GNIS1
Total:12098
Subject Subelement Occurrences
Subelement Occurrences
topic16979
geographic2717
name1607
temporal1210
hierarchicalGeographic941
geographicCode359
genre335
cartographics43
titleInfo42
Total:24233
Conformance Solutions

<location>

Requirements
Conformance Issues
Conformance Solutions

<accessCondition>

Requirements
Conformance Issues
Conformance Solutions

<recordInfo>

Requirements
Conformance Issues
Conformance Solutions

Conclusion