Metadata – Page 13 – DigitalKoans

Persistent Identifier Linking Infrastructure (PLIN) Project

ARROW and the University of Southern Queensland have established the Persistent Identifier Linking Infrastructure (PLIN) Project.

As outlined in the project’s Executive Summary, its goals are to:

Support adoption and use of persistent identifiers and shared persistent identifier management services by the project stakeholders.

Plan for a sustainable, shared identifier management infrastructure that enables persistence of identifiers and associated services over archival lengths of time.

The project’s anticipated outcomes are:

Best practice and policy guides for the use of persistent identifiers in Australian e-learning, e-research, and e-science communities.

Use cases describing community requirements for identifiers and business process analysis relating to these use cases.

E-Framework representations of persistent identifier management services that support the business requirements for identifiers.

A "pilot" shared persistent identifier management infrastructure usable by the project stakeholders over the lifetime of the project. The pilot infrastructure will include services for creating, accessing and managing persistent digital identifiers over their lifetime. The pilot infrastructure will interoperate with other DEST funded systemic infrastructure. The development phase of the pilot will use an agile development methodology that will allow the inclusion of "value-added" services for managing resources using persistent identifiers to be included in the development program if resources permit.

Software tools to help applications use the shared persistent identifier infrastructure more easily.

Report on options and proposals for sustaining, supporting (including outreach) and governing shared persistent identifier management infrastructure

The PLIN Projet will base its work on the CNRI Handle System. The below excerpt from the Handle System home page describes its primary features:

The Handle System® is a general purpose distributed information system that provides efficient, extensible, and secure identifier and resolution services for use on networks such as the Internet. It includes an open set of protocols, a namespace, and a reference implementation of the protocols. The protocols enable a distributed computer system to store identifiers, known as handles, of arbitrary resources and resolve those handles into the information necessary to locate, access, contact, authenticate, or otherwise make use of the resources. This information can be changed as needed to reflect the current state of the identified resource without changing its identifier, thus allowing the name of the item to persist over changes of location and other related state information.

EAD 2002 Schema Released

The EAD Schema Working Group (SAA/EADWG) has released the EAD 2002 Schema.

Two syntaxes are available: Relax NG Schema (RNG) and (W3C Schema XSD; requires the EAD XLink Schema).

Version 1.0 to Version 2002 conversion tools are available at EAD v1 to EAD v2002 Conversion.

For further information about the Encoded Archival Description (EAD), see the EAD Help Pages.

DLF/NSDL OAI Best Practices Wiki

The Digital Library Federation and NSDL OAI and Shareable Metadata Best Practices Working Group’s OAI Best Practices Wiki has a number of resources relevant to the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) and related metadata issues.

The Tools and Strategies for Using and Enhancing/Extending the OAI Protocol section is of particular interest. It includes information about OAI-PMH data provider and service provider registries, software solutions and packages, and static repositories and gateways; metadata management and added value tools as well as OAI and character validation tools; and using SRU/W, collection description schema, and NSDL safe transforms.

Is OAI-PMH Too Labor-Intensive?

OAI-PMH permits metadata harvesting from disciplinary archives, institutional repositories, and other digital archives. This allows the creation of specialized search services using this harvested metadata. OAI-PMH is a key technology for the open access movement, but does it require too much human intervention?

An interesting message on JISC-REPOSITORIES by Santy Chumbe, Technical Officer of the PerX project, suggests that it may. He says:

We have learned that in despite of its relative simplicity, an OAI-PMH service can be harder to implement and maintain than expected. We have spent a lot of effort harvesting, normalising and maintaining metadata obtained from OAI data providers. In particular the issue of metadata quality is an important factor here. A summary of our experiences dealing with OAI-PMH can be found at http://eprints.rclis.org/archive/00006394. . . . A final report outlining the maintenance issues involved in the project is in progress but the experience gained suggests that successful ongoing maintenance of OAI targets would require a mixture of automated and manual approaches and that the level of ongoing maintenance is high.

Test Driving the CrossRef Simple-Text Query Tool for Finding DOIs

CrossRef has made a DOI finding tool publicly available. It’s called Simple-Text Query. You can get the details at Barbara Quint’s article "Linking Up Bibliographies: DOI Harvesting Tool Launched by CrossRef."

What caught my eye in Quint’s article was this: "Users can enter whole bibliographies with citations in almost any bibliographic format and receive back the matching Digital Object Identifiers (DOIs) for these references to insert into their final bibliographies."

Well not exactly. I cut and pasted just the "9 Repositories, E-Prints, and OAI" section of the Scholarly Electronic Publishing Bibliography into Simple-Text Query. Result: error message. I had exceeded the 15,360 character limit. So, suggestion one: put the limit on the Simple-Text Query page.

So them I counted out 15,360 characters of the section and pasted that. Just kidding. I pasted the first six references. Result?

Alexander, Martha Latika, and J. N. Gautam. “Institutional Repositories for Scholarly Communication: Indian Initiatives.” Serials: The Journal for the Serials Community 19, no. 3 (2006): 195-201.
No doi match found.

Allard, Suzie, Thura R. Mack, and Melanie Feltner-Reichert. “The Librarian’s Role in Institutional Repositories: A Content Analysis of the Literature.” Reference Services Review 33, no. 3 (2005): 325-336.
doi:10.1108/00907320510611357
http://dx.doi.org/10.1108/00907320510611357

Allen, James. “Interdisciplinary Differences in Attitudes towards Deposit in Institutional Repositories.” Manchester Metropolitan University, 2005.
http://eprints.rclis.org/archive/00005180/
Reference not parsed

Allinson, Julie, and Roddy MacLeod. “Building an Information Infrastructure in the UK.” Research Information (October/November 2006).
http://www.researchinformation.info/rioctnov06digital.html
Reference not parsed

Anderson, Greg, Rebecca Lasher, and Vicky Reich. “The Computer Science Technical Report (CS-TR) Project: A Pioneering Digital Library Project Viewed from a Library Perspective.” The Public-Access Computer Systems Review 7, no. 2 (1996): 6-26.
http://epress.lib.uh.edu/pr/v7/n2/ande7n2.html
Reference not parsed

Andreoni, Antonella, Maria Bruna Baldacci, Stefania Biagioni, Carlo Carlesi, Donatella Castelli, Pasquale Pagano, Carol Peters, and Serena Pisani. “The ERCIM Technical Reference Digital Library: Meeting the Requirements of a European Community within an International Federation.” D-Lib Magazine 5 (December 1999).
http://www.dlib.org/dlib/december99/peters/12peters.html
Reference not parsed

Hmmm. According to Quint’s article:

I asked Brand if CrossRef could reach open access material. She assured me it could, but it clearly did not give the free and sometimes underdefined material any preference.

Looks like the open access capabilities may need some fine tuning. D-Lib Magazine and The Public-Access Computer Systems Review are not exactly obscure e-journals. Since my references are formatted in the Chicago style by EndNote, I don’t think that the reference format is the issue. In fact, Quint’s article says: "The Simple-Text Query can retrieve DOIs for journal articles, books, and chapters in any reference citation style, although it works best with standard styles."

Conclusion: I play with it some more, but Simple-Text Query may be best for conventional, mainstream journal references.

Collex: Remixable Metadata for Humanists to Create Collections and Exhibits

What is Collex? The project’s About page describes it in part as follows:

Collex is a set of tools designed to aid students and scholars working in networked archives and federated repositories of humanities materials: a sophisticated COLLections and EXhibits mechanism for the semantic web.

Collex allows users to collect, annotate, and tag online objects and to repurpose them in illustrated, interlinked essays or exhibits. It functions within any modern web browser without recourse to plugins or downloads and is fully networked as a server-side application. By saving information about user activity (the construction of annotated collections and exhibits) as ‘remixable’ metadata, the Collex system writes current practice into the scholarly record and permits knowledge discovery based not only on the characteristics or ‘facets’ of digital objects, but also on the contexts in which they are placed by a community of scholars.

A detailed description of the project is available in "COLLEX: Semantic Collections & Exhibits for the Remixable Web."

You can see Collex in action at the NINES (a Networked Interface for Nineteenth-Century Electronic Scholarship) project, which also uses IVANHOE ("a shared, online playspace for readers interested in exploring how acts of interpretation get made and reflecting on what those acts mean or might mean") and Juxta ("a cross-platform tool for collating and analyzing any kind or number of textual objects").

The About 9s page identifies key objectives of the NINES project as follows:

It will create a robust framework to support the authority of digital scholarship and its relevance in tenure and other scholarly assessment procedures.

It will help to establish a real, practical publishing alternative to the paper-based academic publishing system, which is in an accelerating state of crisis.

It will address in a coordinated and practical way the question of how to sustain scholarly and educational projects that have been built in digital forms.

It will establish a base for promoting new modes of criticism and scholarship promised by digital tools.

People Metadata

A message by Liddy Nevile on DC-General has spawned an interesting thread about the need to have a metadata scheme that describes people. Other participants note related efforts, such as BIO, the FOAF Vocabulary Specification, GEDCOM, the North Carolina Encoded Archival Context (EAC) Project, and the XHTML Friends Network.

DOIs for Books Gain Ground

According to CrossRef, the official DOI registration agency, over a half-million DOIs have been assigned to books or book chapters, and twenty of its members are using DOIs in this fashion.

What’s a DOI? Here’s a short description from CrossRef

The DOI, or digital object identifier, serves as a persistent, actionable identifier for intellectual property online. DOIs can be assigned at any level of granularity, and therefore provide publishers with an extensible platform for a variety of applications. And DOI links don’t break. Even if a publisher needs to migrate publications from one system to another, or if the content moves from one publisher to another, the DOI never changes.

While the use of DOIs for book chapters is especially interesting, DOIs can be utilized for smaller book sections as this example of an entry for Ian Fleming in the Oxford Dictionary of National Biography illustrates. (Notice the DOI, "Ian Lancaster Fleming (1908â€“1964): doi:10.1093/ref:odnb/33168," at the bottom of the entry.)

New Digital Image Documentation from TASI

The Technical Advisory Service for Images (TASI) has issued new documentation dealing with digital image issues:

TASI has also created new guides to assist users in identifying appropriate materials:

New OAI-PMH Guidelines

The Open Archives Initiative has issued Conveying Rights Expressions about Metadata in the OAI-PMH Framework, a new Implementation Guidelines document aimed at clarifying the important issue of how to express rights information about harvested metadata in OAI-PMH.

From the document:

Data providers might want to associate rights expressions with the metadata to indicate how it may be used, shared, and modified after it has been harvested. This specification defines how rights information pertaining to the metadata should be included in responses to OAI-PMH requests. The described technique:

Is based on delivering rights expressions that apply to metadata included in OAI-PMH responses. It uses the optional containers that have been defined as part of the OAI-PMH specification. As a result, no changes to the protocol are made, and compatibility with all existing OAI-PMH implementations is maintained.

Is not tied to any particular rights expression language. This document makes use of Creative Commons and GNU licenses, but the use of these specific languages is for illustrative purposes only.

Essential reading for OAI-PMH geeks.