Institutional Repositories – Page 22

Presentations from the CETIS Metadata and Digital Repository Special Interest Group Meeting

CETIS (Centre for Educational Technology & Interoperability Standards) has released presentations from its Metadata and Digital Repository Special Interest Group's February 12, 2008 meeting, which featured talks about selected projects in the JISC Repositories and Preservation Programme. SlideShare presentations and MP3 files are available.

The projects covered were Becta Vocabulary Management Services, DC-Education application profile, EnTag (Enhanced Tagging for Discovery), FeedForward, OAI-ORE, and SWORD.

The Data Documentation Initiative (DDI) and Institutional Repositories

JISC’s DataShare project has released The Data Documentation Initiative (DDI) and Institutional Repositories.

Here's an excerpt from the "Introduction":

One of the key issues present in IRs is dealing with the description of items held in the repository. Data are not different from other digital materials and need to be described not just for discovery but also for preservation and reuse. The social science data archiving community has been working for many years on a metadata standard to describe datasets, and a new version is about to be published, the Data Documentation Initiative (DDI) 3.0.

This document reports back from the DDI 3 workshop "Using DDI 3.0 to Support Preservation, Management, Access and Dissemination Systems for Social Science Data" held at the Schloss Dagstuhl in Germany in November 2007. It intends to present the DDI standard to repository managers, data librarians and data managers and provide background information to help them to examine how the DDI fits with developments in their institutional repositories for research-generated data. The report discusses the appropriateness of using the different DDI versions to address the requirements of research data in IRs. It brings together some of the key questions of the DataShare project with regards to access management, linking to other materials and versioning of datasets.

Presentations from the Open Access Collections Workshop

Presentations from the Open Access Collections workshop are now available.

Here are selected presentations:

Muradora Version 1.2.1 Released: Federated Identity and Authorization for Fedora

The DRAMA (Digital Repository Authorization Middleware Architecture) team has released version 1.2.1 of Muradora.

Here's an excerpt from the Muradora home page that describes Muradora:

Muradora is an easy to use repository application that supports federated identity (via Shibboleth authentication) and flexible authorization (using XACML). Muradora leverages the modularity, flexibility and scalability of the well-known Fedora repository.

Muradora's unique vision is one where Fedora forms the core back-end repository, while different front-end applications (such as portlets or standalone web interfaces) can all talk to the same instance of Fedora, and yet maintain a consistent approach to access control.

Read more about it at "Muradora 1.2.1 Release."

DSpace 1.5 Beta Now Available

The first beta version of the DSpace 1.5 repository software is now available.

Read more about at "DSpace 1.5 Beta 1 Released."

VALA 2008 Presentations

Presentations from the VALA 2008 conference are now available.

Here's a selection of presentations:

Michelle McLean has blogged a number of VALA 2008 sessions in Connecting Librarian postings.

Caplan's The Preservation of Digital Materials Published

Library Technology Reports has published The Preservation of Digital Materials, written by noted digital preservation expert Priscilla Caplan (Assistant Director for Digital Library Services at the Florida Center for Library Automation), as its volume 4, number 2 issue.

E-Print Preservation: SHERPA DP: Final Report of the SHERPA DP Project

JISC has released SHERPA DP: Final Report of the SHERPA DP Project.

Here's an excerpt from the "Executive Summary":

The SHERPA DP project (2005–2007) investigated the preservation of digital resources stored by institutional repositories participating in the SHERPA project. An emphasis was placed on the preservation of e-prints—research papers stored in an electronic format, with some support for other types of content, such as electronic theses and dissertations.

The project began with an investigation of the method that institutional repositories, as Content Providers, may interact with Service Providers. The resulting model, framed around the OAIS, established a Co-operating archive relationship, in which data and metadata is transferred into a preservation repository subsequent to it being made available. . . .

The Arts & Humanities Data Service produced a demonstrator of a Preservation Service, to investigate the operation of the preservation service and accepted responsibility for the preservation of the digital objects for a three-year period (two years of project funding, plus one year).

The most notable development of the Preservation Service demonstrator was the creation of a reusable service framework that allows the integration of a disparate collection of software tools and standards. The project adopted Fedora as the basis for the preservation repository and built a technical infrastructure necessary to harvest metadata, transfer data, and perform relevant preservation activities. Appropriate software tools and standards were selected, including JHOVE and DROID as software tools to validate data objects; METS as a packaging standard; and PREMIS as a basis on which to create preservation metadata. . . .

A number of requirements were identified that were essential for establishing a disaggregated service for preservation, most notably some method of interoperating with partner institutions and he establishment of appropriate preservation policies. . . . In its role as a Preservation Service, the AHDS developed a repository-independent framework to support the EPrints and DSpace-based repositories, using OAI-PMH as common method of connecting to partner institutions and extracting digital objects.

JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions

JISC has published JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions: A Review of the 4-04 Programme on Digital Preservation and Asset Management in Institutions for the JISC Information Environment: Part II: Programme Synthesis.. The report covers a number of projects, including LIFE, MANDATE, PARADIGM, PRESERV, and SHERPA DP.

Here's an excerpt from UKOLN News:

Written by Maureen Pennock, DCC researcher at UKOLN, the study provides a comprehensive and categorised overview of the outputs from the entire programme. Categories include training, costs and business models, life cycles, repositories, case studies, and assessment and surveys. Each category includes detailed information on project outputs and references a number of re-usable project-generated tools that range from software services to checklists and guidance.

REPOMAN-L (Institutional Repository Managers' Mailing List) Launched

Richard Griscom, University of Pennsylvania, and Leah Vanderjagt, University of Alberta, have launched REPOMAN-L (Institutional Repository Managers' Mailing List).

Here's an excerpt from the announcement:

We have created REPOMAN-L (Institutional Repository Managers' Mailing List) as an open forum for the discussion of issues, great and small, that confront repository managers. We hope that you will subscribe and participate enthusiastically, and use this list for problem-solving and sharing of advice; for example:

to poll the group on practices at their institutions

to ask about any aspect of development from policy to outreach

initiatives to software evaluation

to share links to useful tools and references

to explore rationale around decisions you're making about your repository. . . .

The list is purposefully unaffiliated with any institution, initiative, repository software platform, or conceptual idea such as open access; the list would of course not exclude discussion of these areas, but we ask subscribers to consider initiating these discussions on lists set up specifically for the topics and then bring summaries of relevance to this list.

Presentations from eResearch Australasia 2007

Presentations from eResearch Australasia 2007 are now available.

Here are selected presentations:

Podcasts from Clever Collections: A National Showcase of Technical Innovations for Digital Collections

Podcasts of sessions at APSR's Clever Collections: A National Showcase of Technical Innovations for Digital Collections conference are now available.

Here are the titles of some repository-oriented presentations:

"Enhancing Research Collections by Harvesting Community Annotations"
"Integrating Repositories with Researcher Environments"
"Object Re-Use and Exchange (ORE): Practice and Experience in the Open Language Archives Community"
"The Repository Interoperability Framework"
"Taking Aim and (Mostly) Hitting Our Targets: from DART to ARCHER"

New Mailing Lists: JISC-SHIBBOLETH-LIBRARIES and Sword-app-tech

Two mailing lists have been recently established: JISC-SHIBBOLETH-LIBRARIES and sword-app-tech.

Excerpt from the JISC-SHIBBOLETH announcement:

Many institutions are now at the stage with their implementation of federated access management where issues directly impacting libraries are being considered and managed. This includes discovery processes for end-users, testing and changing access to federated service providers, dealing with different user definitions, managing license and resource information and changing send-user information.

To help support this process we have established a separate mailing list to enable discussion and exchange of views directly relating to library issues.

Excerpt from the Fedora-commons-users announcement:

A new mailing list has been created for discussion, bug reports, implementations questions and development ideas relating to SWORD (Simple Web-service Offering Repository Deposit).

SWORD is a protocol for interoperable deposit between repository platforms. It was developed by a JISC project during 2007, building on earlier work to define a deposit protocol, and is based on the Atom Publishing Protocol.

Citation, Location, and Deposition in Discipline & Institutional Repositories

The JISC CLADDIER project has published Citation, Location, and Deposition in Discipline & Institutional Repositories: CLADDIER Project Report III, Recommendations for Data/Publication Linkage.

Here's an excerpt from the abstract:

A key aim of the CLADDIER project is to investigate the cross-linking and citation of resources (in particular data and their associated publications) held in institutional and subject-based repositories within the research sector. Typically traditional citations are partial in that they are "backward citations", referring to work which influenced the current research, and they only cite other formal publications, ignoring other artefacts which are the output of research, in particular research data. Online repositories storing more dynamic digital objects gives the opportunity to provide a more complete picture of the relationships between them, with backward and forward citations to data and publications being propagated between repositories.

This report motivates the cross-citations of data from the CLADDIER use case example, and considers the approaches which have been implemented to harvest and propagate citation information. Most of these existing approaches depend on centralised services, which were considered unsatisfactory in an environment where independent repositories wish to maintain control of their resources and do not wish to be dependant on third-party services. Criteria are identified for building a Citation Notification Service to propagate citation references and links between repositories, including using a peer-to-peer protocol. A number of different architectures are proposed and evaluated.

The requirement for a light-weight peer-to-peer service which is as widely applicable as possible lead to the selection of Linkback services, in particular Trackback which provides an existing simple specification which can be implemented quickly and adapted to the requirements of citation notification. A detailed description the Trackback protocol is then given, together with the design of the adaptations and extensions identified as required for citation notification. This extended Trackback protocol has been implemented in the STFC ePubs institutional repository; this implementation is described and a use case is described.

Geoffrey Bilder has commented on the report in "CLADDIER Final Report."

DSpace Newsletter Launched

The DSpace Foundation now has a newsletter, NewSpace. The first issue has just been published.

ARL Establishes Task Force on Digital Repository Issues

The Association of Research Libraries has established a Task Force on Digital Repository Issues, chaired by Carole Moore, Chief Librarian for the University of Toronto Libraries.

Machine Services for Metadata Discovery and Aggregation—metadata+ Report

JISC has released Machine Services for Metadata Discovery and Aggregation—metadata+.

Here's an excerpt from the Executive Summary:

The main aim of the project is to develop an interoperability demonstrator to explore the technical aspects of providing a service-oriented infrastructure to facilitate metadata discovery and aggregation. The project developed a test bed that exposes metadata through standard search and linking protocols. Metadata mapping work was undertaken to enable the test bed to provide search response in multiple metadata schemas that are widely used in digital library and e-learning.

The core of the test bed consists of an open source digital repository—Fedora. Off-the-shelf, the repository provides web services for metadata searching and substantial content management and security features particularly suitable for real-life use scenarios. Since the search protocol considered in this project requires additional features that are not available from the repository, modifications to the repository source code were made. The modifications also involve incorporating the metadata mapping requirement such that search responses from different metadata formats can be facilitated.

A basic demonstrator (project website) has been created to exemplify how the search protocol can be used for discovering and aggregating metadata, as well as presenting them in coherent formats relevant to the intended presentation contexts. The metadata sources include publisher and digital libraries providing both bibliographic and user-generated (enrichment) metadata such as reviews and recommendations. In addition, the project demonstrated a novel use of the search protocol to dynamically create e-learning content packages, digital library metadata collection and news feeds.

Several digital libraries initiatives have evaluated the test bed infrastructure for real use scenarios. These libraries are an extended form of the test bed demonstrator and provide relevant facilities such metadata wiki (editor) and annotation services for gauging enrichment metadata (review, rating and recommendation) from users. They will continue the objectives of this project particularly on improving the test bed infrastructure and exploring the aggregated use of enrichment metadata, to enable the academic and research user communities to add values to bibliographic metadata from the publishers and libraries communities.

Web 2.0 Meets EPrints: The SNEEP Project

The SNEEP project has released an alpha version of SNEEP.comment, which adds user comments to the EPrints repository software.

SNEEP is also working on a tagging component for EPrints.

Institutional Repositories, Tout de Suite

Institutional Repositories, Tout de Suite, the latest Digital Scholarship publication, is designed to give the reader a very quick introduction to key aspects of institutional repositories and to foster further exploration of this topic through liberal use of relevant references to online documents and links to pertinent websites. It is under a Creative Commons Attribution-Noncommercial 3.0 United States License, and it can be freely used for any noncommercial purpose in accordance with the license.

University of Michigan Libraries Make over 100,000 Records for Digitized Books Available for Harvesting

The University of Michigan Libraries have made over 100,000 metadata records from its MBooks collection available for OAI-PMH harvesting. The records are for digitized books in the public domain.

Here's an excerpt from the announcement:

The University of Michigan Library is pleased to announce that records from our MBooks collection are available for OAI harvesting. The MBooks collection consists of materials digitized by Google in partnership with the University of Michigan.

http://quod.lib.umich.edu/cgi/o/oai/oai?verb=Identify

Only records for MBooks available in the public domain are exposed. We have split these into sets containing public domain items according to U.S. copyright law, and public domain items worldwide. There are currently over 100,000 records available for harvesting. We anticipate having 1 million records available when the entire U-M collection has been digitized by Google.

University of Michigan Libraries Release the UMich OAI Toolkit

The University of Michigan Libraries have released the UMich OAI Toolkit.

Here's an excerpt from the announcement:

This toolkit contains both harvester and data provider, both written in Perl. . . .

UMHarvester is a robust tool using LWP for harvesting nigh on every OAI data provider available. It allows for incremental harvesting, has multiple re-try options, and a batch harvest tool (Batch_UMHarvest) that can automatically perform incremental harvesting.

UMProvider relies heavily on libxml (XML::LibXML) and will store the data in nearly any relational database. It functions by harvesting from a database of records, making rights determinations from a separate database, and providing the resulting set of records.

Originally, only the UMHarvester was available from UM's DLXS software site. The UMProvider tool is newly developed and takes the place of our DLXS data provider tool.

New Release of BioMed Central's Open Repository, a Hosted Institutional Repository Service

BioMed Central has released version 1.4.9 of Open Repository, its DSpace-based, hosted institutional repository service.

Here's an excerpt from the press release:

Open Repository version 1.4.9 has several new features that are designed to enhance the customer experience. The release offers an improved user interface, making it easier for customers to browse and submit their material online. Additionally, institutions can convert their Word, Excel, PowerPoint, Text and RTF documents to PDF format. Customers can also set up RSS feeds, and customize lists and search fields, adding value to the already robust platform.

Stable Version of SPECTRa Released: Software for Depositing Chemical Data into Repositories

A stable version of SPECTRa has been released. SPECTRa is designed to facilitate the deposit of chemical data into digital repositories.

The JISC-funded SPECTRa (Submission, Preservation and Exposure of Chemistry Teaching and Research Data a Digital Repository for the Chemical Community) project's final report is also available.

National Science Digital Library Releases Initial Fedora-based NCore Components

The National Science Digital Library Core Integration team at Cornell University has released a partial version of NCore, a "general platform for building semantic and virtual digital libraries united by a common data model and interoperable applications," which is built upon Fedora.

Here's an excerpt from the NSDL posting:

The NCore platform consists of a central repository built on top of Fedora, a data model, an API, and a number of fundamental services such as full-text search or OAI-PMH. Innovative NSDL services and tools that empower users as content creators are now built on, or transitioning to, the NCore platform. These include: the Expert Voices blogging system (http://expertvoices.nsdl.org/);the NSDL Wiki (http://wiki.nsdl.org/index.php/NSDL_Wiki); the NSDL OAI-PMH metadata ingest aggregation system; the OAI-PMH service for distributing public NSDL metadata; the NSDL Collection System (NCS), derived from the DLESE Collection system (DCS); the NSDL Search service, and the OnRamp content management and distribution system (http://onramp.nsdl.org).

Because NCore is a general Fedora-based open source platform useful beyond NSDL, Core Integration developers at Cornell University have made the repository and API code components of NCore available for download at the NCore project on Sourceforge (http://sourceforge.net/projects/nsdl-core). Over the next six months, NSDL will release the code for major tools and services that comprise the full NCore suite on SourceForge.

For further information, see the NCore presentation.

Towards the Australian Data Commons: A Proposal for an Australian National Data Service

The Australian eResearch Infrastructure Council has released Towards the Australian Data Commons: A Proposal for an Australian National Data Service.

Here's an excerpt from the "Overview":

This paper is designed to encourage, inform and ultimately summarise the discussions around the appropriate strategic and technical descriptions of the Australian National Data Service; to fill in the outline in the Platforms for Collaboration investment plan.

To do so, the paper:

introduces the Australian National Data Service (ANDS) and the driving forces behind its creation;

provides a rationale for the services that ANDS will provide, and the programs through which the services will be offered; and

describes in detail the ANDS programs.

Part One (Background) provides a brief summary of the reasons to focus on data management, as well as an overview of ANDS, and identifies some issues associated with implementation.

Part Two (Rationale) sets out the systemic issues associated with achieving a research data commons, and provides the resultant rationale for the services that ANDS will offer the programs that they will be delivered through.

Part Three (Detailed Descriptions of ANDS Programs) sets out in detail the Aim, Focus, Service Beneficiaries, Products and Community Engagement activities for each of the ANDS Programs.