Aberystwyth University Launches CADAIR Institutional Repository

Aberystwyth University has launched CADAIR, its DSpace-based institutional repository.

Here's an excerpt from the press release:

The new service has been developed by the Subject Support and E-Library team in Information Services, led by Dr Talat Chaudhri and Stuart Lewis.

A successful two year pilot project, during which the team worked closely with the Departments of Computer Science and Information Studies, and the Institute of Mathematics and Physics, was concluded in early 2008. Currently the site features approximately 500 academic papers and dissertations by taught masters and PhD students.

Second Beta Version of Fedora 3.0 Released

The Fedora Commons has released the second beta version of Fedora 3.0.

Here's an excerpt from the announcement:

Fedora 3.0 features the Content Model Architecture (CMA), an integrated structure for persisting and delivering the essential characteristics of digital objects in Fedora. . . . The Fedora CMA plays a central role in the Fedora architecture, in many ways forming the over-arching conceptual framework for future development of Fedora Repositories.

Like a well-thumbed book on a shelf, digital content is stored with the expectation that intellectual works will be the same each time they are accessed, whether the content was put away yesterday, or many years ago. Fedora is a simple, flexible and evolvable approach to delivering and sharing the "essential characteristics" of enduring digital content. Librarians, archivists, records managers, media producers, authors and publishers use patterns of expression formats such as books, journals, articles, collections to convey the essential characteristics of content. The capabilities of digital tools combined with essential characteristics of digital works result in well-understood patterns of expression for different types of content models.

The software engineering community also utilizes patterns of expression for the development of complex computer systems. The same concepts that satisfy agile IT infrastructures can help provide solutions for creating, accessing and preserving content. The Fedora CMA builds on the Fedora architecture-downloaded more than 18,000 times in the last 12 months—to simplify use while unlocking potential.

Dan Davis explains the CMA in the context of Fedora 3.0, "It's a hybrid. The Fedora CMA handles content models that are used by publishers and others, and is also a computer model that describes an information representation and processing architecture." By combining these viewpoints, Fedora CMA has the potential to provide a way to build an interoperable repository for integrated information access within organizations and to provide durable access to our intellectual works.

UK ETD Support: Updated EThOS Toolkit Released

The EThOSnet Project has released an updated version of the EThOS Toolkit.

Here's an excerpt from the announcement:

In addition to full details of how your institution can participate, the interactive Toolkit provides practical information on how theses can be produced by students at your Institution so they can be accessed via EThOS and from your Institutional Repository. Accessed from its new location at http://ethostoolkit.cranfield.ac.uk the toolkit provides guidance on:

  • Putting forward the case for the importance of electronic theses (Culture Change)
  • Outlining the business case including information on which participation options suit (Business Needs)
  • Clear standards provided on technical requirements (Technical Requirements)
  • Practical materials and templates to be used for authors and supervisors in contributing to EThOS (Training and Guidance)

Presentations from APSR Workshop about Author Identity Management in Scholarly Communication Systems

The Australian Partnership for Sustainable Repositories has released presentations from its Identifying Researchers workshop. Both PDF and MP3 files are available.

Here's an excerpt from the workshop's web page:

The issue of managing researcher and author identities is a significant one that has an impact on a range of situations including, but not limited to, scholarly communications. This is an issue not only for researchers who nowadays interact with multiple identity and security systems but also for scholarly communications where the need to accurately identify authors and describe their scholarly resources is increasing in importance.

Foresite Project OAI-ORE Resource Maps Software

The Foresite Project has released the foresite-toolkit.

Here's an excerpt from the announcement (footnotes removed):

The Foresite project is pleased to announce the initial code of two software libraries for constructing, parsing, manipulating and serialising OAI-ORE Resource Maps. These libraries are being written in Java and Python, and can be used generically to provide advanced functionality to OAI-ORE aware applications, and are compliant with the latest release (0.9) of the specification. The software is open source, released under a BSD licence, and is available from a Google Code repository . . . .

Foresite is a JISC funded project which aims to produce a demonstrator and test of the OAI-ORE standard by creating Resource Maps of journals and their contents held in JSTOR, and delivering them as ATOM documents via the SWORD interface to DSpace. DSpace will ingest these resource maps, and convert them into repository items which reference content which continues to reside in JSTOR. The Python library is being used to generate the resource maps from JSTOR and the Java library is being used to provide all the ingest, transformation and dissemination support required in DSpace.

DSpace Foundation and Fedora Commons Investigate Joint Collaboration

The DSpace Foundation and the Fedora Commons have been recently investigating the possibility of joint collaboration.

Here's an excerpt from a Dspace-General message:

Over the last few weeks, we (Michele Kimpton and Sandy Payette) have been discussing the possibilities of our organizations collaborating. . . .

Over the past couple of weeks, we have had informal discussions with members of our communities, leaders in libraries and higher education, and Board members to get initial feedback as to whether they would support collaboration and the outcomes they would like to see as a result.

This past week, we convened members of both communities during the PASIG conference to get input and ideas regarding a collaboration.

Thus far, all of the stakeholders we have had the opportunity to talk with have been extremely supportive and excited about the possibility of the Fedora and DSpace communities working together in some capacity.

As a result of these discussions, we have agreed to move forward in our exploration of collaborative possibilities. Over the next several weeks our organizations will meet to plan the next steps in the process. Our intent is to bring together the ideas and expertise within both communities to come up with the most compelling issues to work on to best serve our communities.

Sustainability and Revenue Models for Online Academic Resources: An Ithaka Report Released

The Strategic Content Alliance has released Sustainability and Revenue Models for Online Academic Resources: An Ithaka Report.

Here's an excerpt from the announcement:

This paper was commissioned by the Joint Information Systems Committee (JISC) is the first step in a three-stage process aimed at gaining a more systematic understanding of the mechanisms for pursuing sustainability in not-for-profit projects. It focuses on what we call 'online academic resources' (OARs), which are projects whose primary aim is to make content and scholarly discourse available on the web for research, collaboration, and teaching. This includes scholarly journals and monographs as well as a vast array of new formats that are emerging to disseminate scholarship, such as preprint servers and wikis. It also includes digital collections of primary source materials, datasets, and audio-visual materials that universities, libraries, museums, archives and other cultural and educational institutions are putting online.

This work is being done as part of the planning work for the Strategic Content Alliance (SCA), so it emphasises the development and maintenance of digital content useful in the networked world. In this first stage, we have conducted an initial assessment of the relevant literature focused on not-for-profit sustainability, and have compared the processes pursued in the not-for-profit and education sectors with those pursued by commercial organisations, specifically in the newspaper industry. The primary goal of this initial report is to determine to what extent it would make sense to conduct a more in-depth study of the issues surrounding sustainability.

Public Beta of Object Reuse and Exchange Specifications (OAI-ORE) Released

The Open Archives Initiative has released the public beta of Object Reuse and Exchange Specifications.

Here's an excerpt from the press release:

Over the past eighteen months the Open Archives Initiative (OAI), in a project called Object Reuse and Exchange (OAI-ORE), has gathered international experts from the publishing, web, library, and eScience community to develop standards for the identification and description of aggregations of online information resources. These aggregations, sometimes called compound digital objects, may combine distributed resources with multiple media types including text, images, data, and video. The goal of these standards is to expose the rich content in these aggregations to applications that support authoring, deposit, exchange, visualization, reuse, and preservation. Although a motivating use case for the work is the changing nature of scholarship and scholarly communication, and the need for cyberinfrastructure to support that scholarship, the intent of the effort is to develop standards that generalize across all web-based information including the increasing popular social networks of “web 2.0”. The beta version of the OAI-ORE specifications and implementation documents are released to the public on June 2, 2008. These documents describe a data model to introduce aggregations as resources with URIs on the web. They also detail the machine-readable descriptions of aggregations expressed in the popular Atom syndication format, in RDF/XML, and RDFa.

Muradora 1.3 Released: Web-Based GUI for Fedora

The DRAMA team at Macquarie University has released version 1.3 release of Muradora.

Here's an excerpt from the announcement:

Muradora is a web-based GUI for the popular Fedora repository, built using enterprise Java Spring and Struts 2 frameworks. Amongst the common features found in a typical repository such as search, browse, self-submission, and versioning supports, Muradora enables flexible access control for end users (based on the XACML standard), inter-domain authentication and federated identity (using Shibboleth implementation of the SAML standard), and multiple metadata schema management (via W3C XForms standard).

Notable features in 1.3 release:

  • Faceted Search: By incorporating GSearch 2.0 with Solr support, users can perform faceted searches, i.e. one can now narrow down search results based on other categories.
  • All-in-one installation: There is now an installation script for Unix/Linux systems which will install all the necessary components for Muradora. The complete package is called "muradora-allinone".
  • RSS/Atom Feeds: Users can subscribe to collections (even non-public collections) and get notifications of new objects added to those collections.
  • Thumbnail preview and gallery view: Thumbnails are now generated automatically for images. Thanks to the work by the MediaShelf team, one can browse and search using either the traditional listing view or with the gallery view.

OAI2LODServer Version 0.2 Released

MediaSpaces has released Version 0.2 of the OAI2LODServer.

Here's a description from the software's home page:

The OAI2LOD Server exposes any OAI-PMH compliant metadata repository according to the Linked Data guidelines. This makes things and media objects accessible via HTTP URIs and query able via the SPARQL protocol. Parts of the OAI2LOD architecture, especially the front-end, are based on the D2R Server implementation.

Further, it provides a configurable linking mechanism based on string similarity metrics. This allows the automatic linking of OAI-PMH data with other open data sets such as DBPedia or any other OAI-PMH repository exposed via the OAI2LOD Server.

Repositories Support Project Briefings Released

The Repositories Support Project has released several new or updated briefings:

Key Services [ Paper ]

This briefing paper gives an overview of some of the
key services currently available to repository managers and provides further details on how to access and use them.

Metadata [ Paper ]

This paper explores the topic of metadata in the repository and includes advice and information on metadata schemas and application profiles.

Making Effective Use of Your Repository [ Paper ]

Repositories are both part of an institution’s local information provision and part of the developing global open access information environment. This briefing paper discusses these contexts, helping the repository to serve the institution’s business needs effectively.

Repository Policy Framework – Updated [Paper]

Updated information about giving structure to your repository planning through the implementation of a policy framework.

"Institutional Repository Checklist for Serving Institutional Management"

Leslie Carr, Wendy White, Susan Miles, and Bill Mortimer have deposited a for-discussion draft of the "Institutional Repository Checklist for Serving Institutional Management" in OR08 Publications repository.

Here's the abstract:

This document is a For Discussion draft that came from the Research Assessment Experience session of the EPrints User Group at Open Repositories 2008. Comment is invited from managers of all repository platforms to share experience of the demands of the processes involved in supporting research assessment at an institutional level. The aim of institutional repositories has focused on serving the interests of faculty—researchers and teachers—by collecting their intellectual outputs for long-term access, preservation and management. However, experience shows that in order to attain the engagement of the faculty, it is necessary to obtain the support of the institutional management. But in order to gain management support, a repository has to demonstrate a measureable and effective contribution to current management agendas and concerns—e.g. research management or research assessment. Such contributions are achievable, but only if the repository fulfils a number of criteria that are in addition to its usual library-faculty roles. This document lists the success criteria as distilled from the authors' recent experience.

Repositories Support Project Releases Briefing Papers: Open Archives Initiative-Protocol for Metadata Harvesting and Workflows

The Repositories Support Project has released two briefing papers: Open Archives Initiative-Protocol for Metadata Harvesting and Workflows (i.e., digital repository submission workflows). Both briefing papers provide succinct introductions to the topic at hand.

Two JISC Open Archives Initiative Object Reuse and Exchange Projects

JISC is funding two projects to do small-scale OAI-ORE tests:

TheOREM (Theses with ORE Metadata), at the University of Cambridge, aims to:

  • Test the applicability of the ORE standard in a realistic scholarly setting—thesis description, submission and publication.
  • Demonstrate the advantages of the ORE approach in complex object publication, by combining it with existing web-standards compliant technologies.
  • Provide examples to fully exercise the ORE specifications in order to provide validation and future direction.

FORESITE (Functional Object Reuse and Exchange: Supporting Information Topology Experiments) will create Resource Map descriptions of JSTOR's holdings, and then ingest them into the DSpace institutional repository system via the SWORD protocol, creating external references back to the original files. The description work will be automated, and the system for achieving this implemented at the University of Liverpool. The SWORD protocol will be implemented within DSpace by HP Labs along with other extensions necessary.

For further information, see the FORESITE proposal, A Preview of the TheOREM Project, and the TheOREM proposal.

Isilon's IQ Clustered Storage System Chosen by Michigan and Rice for Digital Repository Storage

Isilon Systems has announced that its IQ Clustered Storage System will be used to support the Michigan Digitization Project and the Rice Digital Scholarship Archive.

Here's an excerpt from the press release about Michigan:

Isilon Systems . . . today announced that the University of Michigan (U-M) has selected Isilon's IQ clustered storage system as the primary repository for its Michigan Digitization Project. In partnership with Google, the University of Michigan and its Michigan Digitization Project are digitizing more than 7.5 million books, ensuring these valuable resources are available to the public into perpetuity. This enormous undertaking includes the storage of digital copies of all unique books within the libraries of the entire Big-Ten Conference and directly supports Google Book Search, which aims to create a single, comprehensive, searchable, virtual card catalog of all books in all languages. The University of Michigan, in partnership with Indiana University (IU), is leveraging Isilon's IQ clustered storage system to create a Shared Digital Repository (SDR) of the universities' published library materials. Using Isilon IQ, U-M and IU are able consolidate digital copies of millions of books into one, single, shared pool of storage to meet the rapidly growing storage demand of its massive book digitization project. . . .

In conjunction with the Committee for Institutional Cooperation (CIC), an academic partnership formed by the universities of the Big-Ten Conference and the University of Chicago, the University of Michigan and Indiana University are working to create a Shared Digital Repository (SDR) which will mirror the content from U-M and the CIC libraries found in Google Book Search. Using Isilon IQ clustered storage, featuring its OneFS® operating system software, U-M has eliminated disparate data silos to create a shared pool of storage for the digitization efforts of these partner institutions. Each digitized book is approximately 55 MB in size, downloading at a rate of 3 MB/second, 24 hours a day, 7 days a week, for the entire six year duration of the project. Isilon IQ reduces storage management time, enabling U-M to accelerate the book scanning process, preserve valuable materials, and ultimately expand the research and learning capabilities for millions of users across the globe.

Here's an excerpt from the press release about Rice:

Isilon . . . today announced that Rice University has selected Isilon's IQ clustered storage system as its central repository for digital multimedia, including video of selected speeches by international dignitaries and musical performances from the Shepherd School of Music. In an effort to preserve the many historic events held at these prestigious venues and ensure the productions are available to the public into perpetuity, Rice has deployed Isilon clustered storage to consolidate hundreds of recorded musical performances and keynote speeches into a single, highly scalable and reliable shared pool of storage for the Rice Digital Scholarship Archive, an institutional repository based on the DSpace software platform. . . .

Through a cooperative effort between Rice University's Digital Library Initiative, Fondren Library and Central IT department, the university has created a central repository for all its critical multi-media content, enabling a variety of departments to execute on vital, content-driven projects simultaneously, activity that was impossible with traditional storage. Prior to using Isilon IQ, Rice's storage management for the Digital Scholarship archiving system was unable to effectively support management of large digital video and audio files that required streaming for delivery. These assets, therefore, were stored on a variety of streaming servers by various groups across campus, creating multiple access bottlenecks that led to inefficient storage management and undue IT cost and complexity. By unifying all of its digital content onto one, easy to use, "pay as you grow" clustered storage system, Rice University has removed costly data access and management barriers and dramatically simplified its storage architecture. Additionally, using Isilon's SmartQuotas provisioning and quota management software application, Rice is also storing its Language Center's multi-media course work and its Central IT department's webcasts on Isilon IQ, delivering immediate, concurrent data access to multiple users and user groups, further reducing storage management costs to maximize system efficiency.

Rice University will stream its collection of musical performances from the Shepherd School, as well as its video library of the many world leaders and dignitaries that have spoken at the Baker Institute, to thousands of users online. This operation necessitates the use of multiple media servers, using Windows, Quicktime and Real Player formats. Isilon clustered storage communicates natively over CIFS, NFS FTP, and HTTP, as well as interoperating with Windows, Mac and Linux environments, enabling seamless integration with Rice's variety of server formats and enabling all content to be streamed from one, central, easily and immediately accessible storage system. With Isilon IQ, Rice's entire collection of multi-media is accessible to all its servers 24x7x365, ensuring that the media streaming operations are not only efficient and cost-effective, but prepared to meet high user demand.

Summary of Experiences with E-Journal Publishing Software and Institutional Repositories

Sunny Yoon, Digital Resources Coordinator at the City University of New York, posted a query on the CODE4LIB list about the use of e-journal publishing software and its integration into institutional repositories.

She has now posted an interesting summary of responses to her query.

You can also read the replies that were posted to the list under the heading "e-journal publishing software."

Open Repositories 2008 Presentations

Presentations from the Open Repositories 2008 conference are available in the OR08 Publications repository.

The easiest way to find presentations is to use the Browse by Subject capability; however, both simple and advanced search functions are available as well.

Currently, the repository holds over 90 documents. You can track new additions at the Latest Additions to OR08 Publications page (RSS feed). It's anticipated that all documents will be available by 4/13/08.

Here's a brief selection of available presentations:

Project Reports from the Andrew W. Mellon Foundation's 2008 Research in Information Technology Retreat

Project reports from the Andrew W. Mellon Foundation's 2008 Research in Information Technology retreat are now available.

Here are selected project briefing reports:

Weblog Reports from Open Repositories 2008

Below are selected Weblog reports from Open Repositories 2008.