Presentations from the Open Access Collections Workshop

Presentations from the Open Access Collections workshop are now available.

Here are selected presentations:

Muradora Version 1.2.1 Released: Federated Identity and Authorization for Fedora

The DRAMA (Digital Repository Authorization Middleware Architecture) team has released version 1.2.1 of Muradora.

Here's an excerpt from the Muradora home page that describes Muradora:

Muradora is an easy to use repository application that supports federated identity (via Shibboleth authentication) and flexible authorization (using XACML). Muradora leverages the modularity, flexibility and scalability of the well-known Fedora repository.

Muradora's unique vision is one where Fedora forms the core back-end repository, while different front-end applications (such as portlets or standalone web interfaces) can all talk to the same instance of Fedora, and yet maintain a consistent approach to access control.

Read more about it at "Muradora 1.2.1 Release."

VALA 2008 Presentations

Presentations from the VALA 2008 conference are now available.

Here's a selection of presentations:

Michelle McLean has blogged a number of VALA 2008 sessions in Connecting Librarian postings.

E-Print Preservation: SHERPA DP: Final Report of the SHERPA DP Project

JISC has released SHERPA DP: Final Report of the SHERPA DP Project.

Here's an excerpt from the "Executive Summary":

The SHERPA DP project (2005–2007) investigated the preservation of digital resources stored by institutional repositories participating in the SHERPA project. An emphasis was placed on the preservation of e-prints—research papers stored in an electronic format, with some support for other types of content, such as electronic theses and dissertations.

The project began with an investigation of the method that institutional repositories, as Content Providers, may interact with Service Providers. The resulting model, framed around the OAIS, established a Co-operating archive relationship, in which data and metadata is transferred into a preservation repository subsequent to it being made available. . . .

The Arts & Humanities Data Service produced a demonstrator of a Preservation Service, to investigate the operation of the preservation service and accepted responsibility for the preservation of the digital objects for a three-year period (two years of project funding, plus one year).

The most notable development of the Preservation Service demonstrator was the creation of a reusable service framework that allows the integration of a disparate collection of software tools and standards. The project adopted Fedora as the basis for the preservation repository and built a technical infrastructure necessary to harvest metadata, transfer data, and perform relevant preservation activities. Appropriate software tools and standards were selected, including JHOVE and DROID as software tools to validate data objects; METS as a packaging standard; and PREMIS as a basis on which to create preservation metadata. . . .

A number of requirements were identified that were essential for establishing a disaggregated service for preservation, most notably some method of interoperating with partner institutions and he establishment of appropriate preservation policies. . . . In its role as a Preservation Service, the AHDS developed a repository-independent framework to support the EPrints and DSpace-based repositories, using OAI-PMH as common method of connecting to partner institutions and extracting digital objects.

JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions

JISC has published JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions: A Review of the 4-04 Programme on Digital Preservation and Asset Management in Institutions for the JISC Information Environment: Part II: Programme Synthesis.. The report covers a number of projects, including LIFE, MANDATE, PARADIGM, PRESERV, and SHERPA DP.

Here's an excerpt from UKOLN News:

Written by Maureen Pennock, DCC researcher at UKOLN, the study provides a comprehensive and categorised overview of the outputs from the entire programme. Categories include training, costs and business models, life cycles, repositories, case studies, and assessment and surveys. Each category includes detailed information on project outputs and references a number of re-usable project-generated tools that range from software services to checklists and guidance.

REPOMAN-L (Institutional Repository Managers' Mailing List) Launched

Richard Griscom, University of Pennsylvania, and Leah Vanderjagt, University of Alberta, have launched REPOMAN-L (Institutional Repository Managers' Mailing List).

Here's an excerpt from the announcement:

We have created REPOMAN-L (Institutional Repository Managers' Mailing List) as an open forum for the discussion of issues, great and small, that confront repository managers. We hope that you will subscribe and participate enthusiastically, and use this list for problem-solving and sharing of advice; for example:

  • to poll the group on practices at their institutions
  • to ask about any aspect of development from policy to outreach
  • initiatives to software evaluation
  • to share links to useful tools and references
  • to explore rationale around decisions you're making about your repository. . . .

The list is purposefully unaffiliated with any institution, initiative, repository software platform, or conceptual idea such as open access; the list would of course not exclude discussion of these areas, but we ask subscribers to consider initiating these discussions on lists set up specifically for the topics and then bring summaries of relevance to this list.

Podcasts from Clever Collections: A National Showcase of Technical Innovations for Digital Collections

Podcasts of sessions at APSR's Clever Collections: A National Showcase of Technical Innovations for Digital Collections conference are now available.

Here are the titles of some repository-oriented presentations:

  • "Enhancing Research Collections by Harvesting Community Annotations"
  • "Integrating Repositories with Researcher Environments"
  • "Object Re-Use and Exchange (ORE): Practice and Experience in the Open Language Archives Community"
  • "The Repository Interoperability Framework"
  • "Taking Aim and (Mostly) Hitting Our Targets: from DART to ARCHER"

New Mailing Lists: JISC-SHIBBOLETH-LIBRARIES and Sword-app-tech

Two mailing lists have been recently established: JISC-SHIBBOLETH-LIBRARIES and sword-app-tech.

Excerpt from the JISC-SHIBBOLETH announcement:

Many institutions are now at the stage with their implementation of federated access management where issues directly impacting libraries are being considered and managed. This includes discovery processes for end-users, testing and changing access to federated service providers, dealing with different user definitions, managing license and resource information and changing send-user information.

To help support this process we have established a separate mailing list to enable discussion and exchange of views directly relating to library issues.

Excerpt from the Fedora-commons-users announcement:

A new mailing list has been created for discussion, bug reports, implementations questions and development ideas relating to SWORD (Simple Web-service Offering Repository Deposit).

SWORD is a protocol for interoperable deposit between repository platforms. It was developed by a JISC project during 2007, building on earlier work to define a deposit protocol, and is based on the Atom Publishing Protocol.

Citation, Location, and Deposition in Discipline & Institutional Repositories

The JISC CLADDIER project has published Citation, Location, and Deposition in Discipline & Institutional Repositories: CLADDIER Project Report III, Recommendations for Data/Publication Linkage.

Here's an excerpt from the abstract:

A key aim of the CLADDIER project is to investigate the cross-linking and citation of resources (in particular data and their associated publications) held in institutional and subject-based repositories within the research sector. Typically traditional citations are partial in that they are "backward citations", referring to work which influenced the current research, and they only cite other formal publications, ignoring other artefacts which are the output of research, in particular research data. Online repositories storing more dynamic digital objects gives the opportunity to provide a more complete picture of the relationships between them, with backward and forward citations to data and publications being propagated between repositories.

This report motivates the cross-citations of data from the CLADDIER use case example, and considers the approaches which have been implemented to harvest and propagate citation information. Most of these existing approaches depend on centralised services, which were considered unsatisfactory in an environment where independent repositories wish to maintain control of their resources and do not wish to be dependant on third-party services. Criteria are identified for building a Citation Notification Service to propagate citation references and links between repositories, including using a peer-to-peer protocol. A number of different architectures are proposed and evaluated.

The requirement for a light-weight peer-to-peer service which is as widely applicable as possible lead to the selection of Linkback services, in particular Trackback which provides an existing simple specification which can be implemented quickly and adapted to the requirements of citation notification. A detailed description the Trackback protocol is then given, together with the design of the adaptations and extensions identified as required for citation notification. This extended Trackback protocol has been implemented in the STFC ePubs institutional repository; this implementation is described and a use case is described.

Geoffrey Bilder has commented on the report in "CLADDIER Final Report."

Institutional Repositories, Tout de Suite

Institutional Repositories, Tout de Suite, the latest Digital Scholarship publication, is designed to give the reader a very quick introduction to key aspects of institutional repositories and to foster further exploration of this topic through liberal use of relevant references to online documents and links to pertinent websites. It is under a Creative Commons Attribution-Noncommercial 3.0 United States License, and it can be freely used for any noncommercial purpose in accordance with the license.

University of Michigan Libraries Release the UMich OAI Toolkit

The University of Michigan Libraries have released the UMich OAI Toolkit.

Here's an excerpt from the announcement:

This toolkit contains both harvester and data provider, both written in Perl. . . .

UMHarvester is a robust tool using LWP for harvesting nigh on every OAI data provider available. It allows for incremental harvesting, has multiple re-try options, and a batch harvest tool (Batch_UMHarvest) that can automatically perform incremental harvesting.

UMProvider relies heavily on libxml (XML::LibXML) and will store the data in nearly any relational database. It functions by harvesting from a database of records, making rights determinations from a separate database, and providing the resulting set of records.

Originally, only the UMHarvester was available from UM's DLXS software site. The UMProvider tool is newly developed and takes the place of our DLXS data provider tool.

New Release of BioMed Central's Open Repository, a Hosted Institutional Repository Service

BioMed Central has released version 1.4.9 of Open Repository, its DSpace-based, hosted institutional repository service.

Here's an excerpt from the press release:

Open Repository version 1.4.9 has several new features that are designed to enhance the customer experience. The release offers an improved user interface, making it easier for customers to browse and submit their material online. Additionally, institutions can convert their Word, Excel, PowerPoint, Text and RTF documents to PDF format. Customers can also set up RSS feeds, and customize lists and search fields, adding value to the already robust platform.

Stable Version of SPECTRa Released: Software for Depositing Chemical Data into Repositories

A stable version of SPECTRa has been released. SPECTRa is designed to facilitate the deposit of chemical data into digital repositories.

The JISC-funded SPECTRa (Submission, Preservation and Exposure of Chemistry Teaching and Research Data a Digital Repository for the Chemical Community) project's final report is also available.

National Science Digital Library Releases Initial Fedora-based NCore Components

The National Science Digital Library Core Integration team at Cornell University has released a partial version of NCore, a "general platform for building semantic and virtual digital libraries united by a common data model and interoperable applications," which is built upon Fedora.

Here's an excerpt from the NSDL posting:

The NCore platform consists of a central repository built on top of Fedora, a data model, an API, and a number of fundamental services such as full-text search or OAI-PMH. Innovative NSDL services and tools that empower users as content creators are now built on, or transitioning to, the NCore platform. These include: the Expert Voices blogging system (http://expertvoices.nsdl.org/);the NSDL Wiki (http://wiki.nsdl.org/index.php/NSDL_Wiki); the NSDL OAI-PMH metadata ingest aggregation system; the OAI-PMH service for distributing public NSDL metadata; the NSDL Collection System (NCS), derived from the DLESE Collection system (DCS); the NSDL Search service, and the OnRamp content management and distribution system (http://onramp.nsdl.org).

Because NCore is a general Fedora-based open source platform useful beyond NSDL, Core Integration developers at Cornell University have made the repository and API code components of NCore available for download at the NCore project on Sourceforge (http://sourceforge.net/projects/nsdl-core). Over the next six months, NSDL will release the code for major tools and services that comprise the full NCore suite on SourceForge.

For further information, see the NCore presentation.

Towards the Australian Data Commons: A Proposal for an Australian National Data Service

The Australian eResearch Infrastructure Council has released Towards the Australian Data Commons: A Proposal for an Australian National Data Service.

Here's an excerpt from the "Overview":

This paper is designed to encourage, inform and ultimately summarise the discussions around the appropriate strategic and technical descriptions of the Australian National Data Service; to fill in the outline in the Platforms for Collaboration investment plan.

To do so, the paper:

  • introduces the Australian National Data Service (ANDS) and the driving forces behind its creation;
  • provides a rationale for the services that ANDS will provide, and the programs through which the services will be offered; and
  • describes in detail the ANDS programs.

Part One (Background) provides a brief summary of the reasons to focus on data management, as well as an overview of ANDS, and identifies some issues associated with implementation.

Part Two (Rationale) sets out the systemic issues associated with achieving a research data commons, and provides the resultant rationale for the services that ANDS will offer the programs that they will be delivered through.

Part Three (Detailed Descriptions of ANDS Programs) sets out in detail the Aim, Focus, Service Beneficiaries, Products and Community Engagement activities for each of the ANDS Programs.