Aberystwyth University Launches CADAIR Institutional Repository

Aberystwyth University has launched CADAIR, its DSpace-based institutional repository.

Here's an excerpt from the press release:

The new service has been developed by the Subject Support and E-Library team in Information Services, led by Dr Talat Chaudhri and Stuart Lewis.

A successful two year pilot project, during which the team worked closely with the Departments of Computer Science and Information Studies, and the Institute of Mathematics and Physics, was concluded in early 2008. Currently the site features approximately 500 academic papers and dissertations by taught masters and PhD students.

Second Beta Version of Fedora 3.0 Released

The Fedora Commons has released the second beta version of Fedora 3.0.

Here's an excerpt from the announcement:

Fedora 3.0 features the Content Model Architecture (CMA), an integrated structure for persisting and delivering the essential characteristics of digital objects in Fedora. . . . The Fedora CMA plays a central role in the Fedora architecture, in many ways forming the over-arching conceptual framework for future development of Fedora Repositories.

Like a well-thumbed book on a shelf, digital content is stored with the expectation that intellectual works will be the same each time they are accessed, whether the content was put away yesterday, or many years ago. Fedora is a simple, flexible and evolvable approach to delivering and sharing the "essential characteristics" of enduring digital content. Librarians, archivists, records managers, media producers, authors and publishers use patterns of expression formats such as books, journals, articles, collections to convey the essential characteristics of content. The capabilities of digital tools combined with essential characteristics of digital works result in well-understood patterns of expression for different types of content models.

The software engineering community also utilizes patterns of expression for the development of complex computer systems. The same concepts that satisfy agile IT infrastructures can help provide solutions for creating, accessing and preserving content. The Fedora CMA builds on the Fedora architecture-downloaded more than 18,000 times in the last 12 months—to simplify use while unlocking potential.

Dan Davis explains the CMA in the context of Fedora 3.0, "It's a hybrid. The Fedora CMA handles content models that are used by publishers and others, and is also a computer model that describes an information representation and processing architecture." By combining these viewpoints, Fedora CMA has the potential to provide a way to build an interoperable repository for integrated information access within organizations and to provide durable access to our intellectual works.

UK ETD Support: Updated EThOS Toolkit Released

The EThOSnet Project has released an updated version of the EThOS Toolkit.

Here's an excerpt from the announcement:

In addition to full details of how your institution can participate, the interactive Toolkit provides practical information on how theses can be produced by students at your Institution so they can be accessed via EThOS and from your Institutional Repository. Accessed from its new location at http://ethostoolkit.cranfield.ac.uk the toolkit provides guidance on:

  • Putting forward the case for the importance of electronic theses (Culture Change)
  • Outlining the business case including information on which participation options suit (Business Needs)
  • Clear standards provided on technical requirements (Technical Requirements)
  • Practical materials and templates to be used for authors and supervisors in contributing to EThOS (Training and Guidance)

Presentations from APSR Workshop about Author Identity Management in Scholarly Communication Systems

The Australian Partnership for Sustainable Repositories has released presentations from its Identifying Researchers workshop. Both PDF and MP3 files are available.

Here's an excerpt from the workshop's web page:

The issue of managing researcher and author identities is a significant one that has an impact on a range of situations including, but not limited to, scholarly communications. This is an issue not only for researchers who nowadays interact with multiple identity and security systems but also for scholarly communications where the need to accurately identify authors and describe their scholarly resources is increasing in importance.

BagIt: New LC/CDL Format for Transferring Digital Content between Cultural Institutions

The Library of Congress and the California Digital Library have established a new format called BagIt for transferring large data collections between cultural institutions.

Read more about it at "The BagIt File Package Format (V0.94)" and "Library Develops Format for Transferring Digital Content."

Foresite Project OAI-ORE Resource Maps Software

The Foresite Project has released the foresite-toolkit.

Here's an excerpt from the announcement (footnotes removed):

The Foresite project is pleased to announce the initial code of two software libraries for constructing, parsing, manipulating and serialising OAI-ORE Resource Maps. These libraries are being written in Java and Python, and can be used generically to provide advanced functionality to OAI-ORE aware applications, and are compliant with the latest release (0.9) of the specification. The software is open source, released under a BSD licence, and is available from a Google Code repository . . . .

Foresite is a JISC funded project which aims to produce a demonstrator and test of the OAI-ORE standard by creating Resource Maps of journals and their contents held in JSTOR, and delivering them as ATOM documents via the SWORD interface to DSpace. DSpace will ingest these resource maps, and convert them into repository items which reference content which continues to reside in JSTOR. The Python library is being used to generate the resource maps from JSTOR and the Java library is being used to provide all the ingest, transformation and dissemination support required in DSpace.

Version 72, Scholarly Electronic Publishing Bibliography

Version 72 of the Scholarly Electronic Publishing Bibliography is now available from Digital Scholarship. This selective bibliography presents over 3,250 articles, books, and other digital and printed sources that are useful in understanding scholarly electronic publishing efforts on the Internet.

This version adds hundreds of links to freely available journal articles from publishers as well as to e-prints of published articles housed in disciplinary archives and institutional repositories. All article references were checked for the availability of such free content.

These links have also been added to a revised version of the Scholarly Electronic Publishing Bibliography: 2007 Annual Edition. Annual editions of the Scholarly Electronic Publishing Bibliography are PDF files designed for printing.

The bibliography has the following sections (revised sections are in italics):

1 Economic Issues
2 Electronic Books and Texts
2.1 Case Studies and History
2.2 General Works
2.3 Library Issues
3 Electronic Serials
3.1 Case Studies and History
3.2 Critiques
3.3 Electronic Distribution of Printed Journals
3.4 General Works
3.5 Library Issues
3.6 Research
4 General Works
5 Legal Issues
5.1 Intellectual Property Rights
5.2 License Agreements
6 Library Issues
6.1 Cataloging, Identifiers, Linking, and Metadata
6.2 Digital Libraries
6.3 General Works
6.4 Information Integrity and Preservation
7 New Publishing Models
8 Publisher Issues
8.1 Digital Rights Management
9 Repositories, E-Prints, and OAI
Appendix A. Related Bibliographies
Appendix B. About the Author
Appendix C. SEPB Use Statistics

Scholarly Electronic Publishing Resources includes the following sections:

Cataloging, Identifiers, Linking, and Metadata
Digital Libraries
Electronic Books and Texts
Electronic Serials
General Electronic Publishing
Images
Legal
Preservation
Publishers
Repositories, E-Prints, and OAI
SGML and Related Standards

An article about the bibliography ("Evolution of an Electronic Book: The Scholarly Electronic Publishing Bibliography") has been published in The Journal of Electronic Publishing.

DSpace Foundation and Fedora Commons Investigate Joint Collaboration

The DSpace Foundation and the Fedora Commons have been recently investigating the possibility of joint collaboration.

Here's an excerpt from a Dspace-General message:

Over the last few weeks, we (Michele Kimpton and Sandy Payette) have been discussing the possibilities of our organizations collaborating. . . .

Over the past couple of weeks, we have had informal discussions with members of our communities, leaders in libraries and higher education, and Board members to get initial feedback as to whether they would support collaboration and the outcomes they would like to see as a result.

This past week, we convened members of both communities during the PASIG conference to get input and ideas regarding a collaboration.

Thus far, all of the stakeholders we have had the opportunity to talk with have been extremely supportive and excited about the possibility of the Fedora and DSpace communities working together in some capacity.

As a result of these discussions, we have agreed to move forward in our exploration of collaborative possibilities. Over the next several weeks our organizations will meet to plan the next steps in the process. Our intent is to bring together the ideas and expertise within both communities to come up with the most compelling issues to work on to best serve our communities.

Sustainability and Revenue Models for Online Academic Resources: An Ithaka Report Released

The Strategic Content Alliance has released Sustainability and Revenue Models for Online Academic Resources: An Ithaka Report.

Here's an excerpt from the announcement:

This paper was commissioned by the Joint Information Systems Committee (JISC) is the first step in a three-stage process aimed at gaining a more systematic understanding of the mechanisms for pursuing sustainability in not-for-profit projects. It focuses on what we call 'online academic resources' (OARs), which are projects whose primary aim is to make content and scholarly discourse available on the web for research, collaboration, and teaching. This includes scholarly journals and monographs as well as a vast array of new formats that are emerging to disseminate scholarship, such as preprint servers and wikis. It also includes digital collections of primary source materials, datasets, and audio-visual materials that universities, libraries, museums, archives and other cultural and educational institutions are putting online.

This work is being done as part of the planning work for the Strategic Content Alliance (SCA), so it emphasises the development and maintenance of digital content useful in the networked world. In this first stage, we have conducted an initial assessment of the relevant literature focused on not-for-profit sustainability, and have compared the processes pursued in the not-for-profit and education sectors with those pursued by commercial organisations, specifically in the newspaper industry. The primary goal of this initial report is to determine to what extent it would make sense to conduct a more in-depth study of the issues surrounding sustainability.

Public Beta of Object Reuse and Exchange Specifications (OAI-ORE) Released

The Open Archives Initiative has released the public beta of Object Reuse and Exchange Specifications.

Here's an excerpt from the press release:

Over the past eighteen months the Open Archives Initiative (OAI), in a project called Object Reuse and Exchange (OAI-ORE), has gathered international experts from the publishing, web, library, and eScience community to develop standards for the identification and description of aggregations of online information resources. These aggregations, sometimes called compound digital objects, may combine distributed resources with multiple media types including text, images, data, and video. The goal of these standards is to expose the rich content in these aggregations to applications that support authoring, deposit, exchange, visualization, reuse, and preservation. Although a motivating use case for the work is the changing nature of scholarship and scholarly communication, and the need for cyberinfrastructure to support that scholarship, the intent of the effort is to develop standards that generalize across all web-based information including the increasing popular social networks of “web 2.0”. The beta version of the OAI-ORE specifications and implementation documents are released to the public on June 2, 2008. These documents describe a data model to introduce aggregations as resources with URIs on the web. They also detail the machine-readable descriptions of aggregations expressed in the popular Atom syndication format, in RDF/XML, and RDFa.

Muradora 1.3 Released: Web-Based GUI for Fedora

The DRAMA team at Macquarie University has released version 1.3 release of Muradora.

Here's an excerpt from the announcement:

Muradora is a web-based GUI for the popular Fedora repository, built using enterprise Java Spring and Struts 2 frameworks. Amongst the common features found in a typical repository such as search, browse, self-submission, and versioning supports, Muradora enables flexible access control for end users (based on the XACML standard), inter-domain authentication and federated identity (using Shibboleth implementation of the SAML standard), and multiple metadata schema management (via W3C XForms standard).

Notable features in 1.3 release:

  • Faceted Search: By incorporating GSearch 2.0 with Solr support, users can perform faceted searches, i.e. one can now narrow down search results based on other categories.
  • All-in-one installation: There is now an installation script for Unix/Linux systems which will install all the necessary components for Muradora. The complete package is called "muradora-allinone".
  • RSS/Atom Feeds: Users can subscribe to collections (even non-public collections) and get notifications of new objects added to those collections.
  • Thumbnail preview and gallery view: Thumbnails are now generated automatically for images. Thanks to the work by the MediaShelf team, one can browse and search using either the traditional listing view or with the gallery view.

OAI2LODServer Version 0.2 Released

MediaSpaces has released Version 0.2 of the OAI2LODServer.

Here's a description from the software's home page:

The OAI2LOD Server exposes any OAI-PMH compliant metadata repository according to the Linked Data guidelines. This makes things and media objects accessible via HTTP URIs and query able via the SPARQL protocol. Parts of the OAI2LOD architecture, especially the front-end, are based on the D2R Server implementation.

Further, it provides a configurable linking mechanism based on string similarity metrics. This allows the automatic linking of OAI-PMH data with other open data sets such as DBPedia or any other OAI-PMH repository exposed via the OAI2LOD Server.

Repositories Support Project Briefings Released

The Repositories Support Project has released several new or updated briefings:

Key Services [ Paper ]

This briefing paper gives an overview of some of the
key services currently available to repository managers and provides further details on how to access and use them.

Metadata [ Paper ]

This paper explores the topic of metadata in the repository and includes advice and information on metadata schemas and application profiles.

Making Effective Use of Your Repository [ Paper ]

Repositories are both part of an institution’s local information provision and part of the developing global open access information environment. This briefing paper discusses these contexts, helping the repository to serve the institution’s business needs effectively.

Repository Policy Framework – Updated [Paper]

Updated information about giving structure to your repository planning through the implementation of a policy framework.

University of Florida Has Digitized 1.7 Million Pages, over 100,000 in Last Month Alone

The University of Florida Digital Library Center has announced that it has digitized over 1.7 million pages, with about 100,000 pages being added in the last month alone. Their digitization statistics are available online. (Thanks to Open Access News.)

Read more about it "100,000 Pages a Month."

Interview with Microsoft's Pablo Fernicola about Article Authoring Add-in for Microsoft Office Word 2007

Jon Udell has posted an interview ("Word for Scientific Publishing") with Pablo Fernicola, a Microsoft Group Manager, about the Article Authoring Add-in for Microsoft Office Word 2007 (see my prior posting "Microsoft Developing Authoring Add-in for Microsoft Office Word 2007 with NLM DTD Support"). (Warning: there is a very annoying Silverlight download pop-up that obscures part of the post.)

Udell has also posted a screencast of Fernicola demonstrating the add-in ("Pablo Fernicola Demonstrates the Word Add-In for Scientific Authors").

JorumOpen, UK Repository for Creative Commons Licensed Educational Materials, Announced

JISC has announced JorumOpen, a national repository of open access educational materials under Creative Commons licenses.

Here's an excerpt from the announcement:

It was announced today that Jorum, the UK national repository for learning and teaching materials funded by JISC, is to offer open educational resources. This will make it easier for lecturers and teaching staff to share and re-use each other's teaching resources. JorumOpen—as it will be called—will also provide a showcase for UK universities and colleges on the international stage. . . .

Jorum is managed jointly by EDINA and Mimas, the two National Academic Data Centres funded by JISC at the Universities of Edinburgh and Manchester. During the first phase of Jorum's development, the focus has been on building a system that safeguards investment in digital learning resources and offers controlled access to licensed materials. The result is a service that supports access to over 2,500 learning resources for download for direct use in the classroom and within virtual learning environments (VLEs).

Through the development of JorumOpen, lecturers and teachers will be able to share materials under the Creative Commons licence framework: this makes sharing easier, granting users greater rights for use and re-use of online content and easier to understand. Importantly, it does not require prior registration. As a result availability is global as well as across UK universities and colleges. JorumOpen will run alongside a 'members only' facility, JorumEducationUK, that will support sharing of material just within the UK educational sector; this will be available only to registered users and contributors, as is currently the case.

OCLC Announces Digital Archive Service

OCLC has announced the availability of a Digital Archive service.

Here's an excerpt from the press release:

The service provides a secure storage environment for libraries to easily manage and monitor master files and digital originals. The importance of preserving master files grows as a library's digital collections grow. Libraries need a workflow for capturing and managing master files that finds a balance between the acquisition of both digitized and born-digital content while not outpacing a library's capability to manage these large files. . . .

The Digital Archive service is a specially designed system in a controlled operating environment dedicated to the ongoing managed storage of digital content. OCLC has developed specific systems processes and procedures for the service tuned to the management of data for the long term.

From the time content arrives, the Digital Archive systems begin inspecting it to ensure continuity. OCLC systems perform quality checks and record the results in a "health record" for each file. Automated systems revisit these quality checks periodically so libraries receive up-to-date reports on the health of the collection. OCLC provides monthly updated information for all collections on the personal archive report portal.

For users of CONTENTdm, OCLC's digital collection management software for libraries and other cultural heritage institutions, the Digital Archive service is an optional capability integrated with various workflows for building collections. Master files are secured for ingest to the Digital Archive service using the CONTENTdm Acquisition Station, the Connexion digital import capability and the Web Harvesting service.

For users of other content management systems, the Digital Archive service provides a low-overhead mechanism for safely storing master files.

Repositories Support Project Releases Briefing Papers: Open Archives Initiative-Protocol for Metadata Harvesting and Workflows

The Repositories Support Project has released two briefing papers: Open Archives Initiative-Protocol for Metadata Harvesting and Workflows (i.e., digital repository submission workflows). Both briefing papers provide succinct introductions to the topic at hand.