JPEG 2000—A Practical Digital Preservation Standard?

The Digital Preservation Coalition has published JPEG 2000—A Practical Digital Preservation Standard?.

Here's an excerpt from the "Executive Summary":

With JPEG 2000, an application can access and decode only as much of the compressed image as needed to perform the task at hand. This means a viewer, for example, can open a gigapixel image almost instantly by retrieving and decompressing a low resolution, display-sized image from the JPEG 2000 codestream.

JPEG 2000 also improves a user’s ability to interact with an image. The zoom, pan, and rotate operations that users increasingly expect in networked image systems are performed dynamically by accessing and decompressing just those parts of the JPEG 2000 codestream containing the compressed image data for the region of interest. The JPEG 2000 data can be either converted to JPEG and delivered for viewing with a standard image browser or delivered to a native JPEG 2000 viewer using the JPIP client-server protocol, developed to support the JPEG 2000 feature set.

Using a single JPEG 2000 master to satisfy user requests for dynamic viewing reduces storage costs and management overhead by eliminating the need to maintain multiple derivatives in a repository.

Beyond image access and distribution, JPEG 2000 is being used increasingly as a repository and archival image format. What is remarkable is that many repositories are storing “visually lossless” JPEG 2000 files: the compression is lossy and irreversible but the artefacts are not noticeable and do not interfere with the performance of applications. Compared to uncompressed TIFF, visually lossless JPEG 2000 compression can reduce the amount of storage by an order of magnitude or more.

JISC E-Journal Archive Registry Study

JISC has released "Scoping Study for a Registry of Electronic Journals That Indicates Where They Are Archived."

Here's an excerpt from the "Executive Summary":

The research and especially the interviews have confirmed the assumption behind the project that there is a need for more information, and more easily accessible information, about where e-journals are archived. However, what has also emerged strongly is that this issue cannot be considered in isolation, either from the overall context of relationships within the scholarly communication system, nor from other initiatives being undertaken to improve information flows e.g. in relation to the transfer of journal titles between publishers. . . .

Librarians felt that they were most likely to consult a registry in situations where they were considering taking out or renewing a subscription; considering cancellation of a print subscription in favour of an e-only subscription; contemplating relocating or discarding print holdings. The vast majority of potential users of such a registry would be library staff in university and national libraries, though organisations licensing e-journals on behalf of the library community would also be likely to use the registry to check compliance with licence conditions.

One of the key benefits of a registry is perceived to be the exposure of gaps in archive provision. This was identified by all types of stakeholder: librarians would want to be alerted to risks to any of their holdings; publishers who are making provision would like to see their efforts recognised and pressure placed on publishers who are not making satisfactory arrangements; archive organisations would also benefit as that effect fed through to more demand for their services.

The drawbacks to a registry as a solution to the acknowledged information gap were mainly seen as ones of practicality (keeping the information accurate and up to date), trust (especially whether a national solution is appropriate, and conversely whether an international solution is feasible) and sustainability of the funding model. Other solutions were suggested, mainly involving either WorldCat or ERM vendors such as Serials Solutions. The latter were also suggested as a complementary part of a solution involving, but not limited to, a registry.

Preservation in the Age of Large-Scale Digitization: A White Paper

The Council on Library and Information Resources has published Preservation in the Age of Large-Scale Digitization: A White Paper by Oya Rieger.

Here's an excerpt from the "Preface":

This paper examines large-scale initiatives to identify issues that will influence the availability and usability, over time, of the digital books that these projects create. As an introduction, the paper describes four key large-scale projects and their digitization strategies. Issues range from the quality of image capture to the commitment and viability of archiving institutions, as well as those institutions' willingness to collaborate. The paper also attempts to foresee the likely impacts of large-scale digitization on book collections. It offers a set of recommendations for rethinking a preservation strategy. It concludes with a plea for collaboration among cultural institutions. No single library can afford to undertake a project on the scale of Google Book Search; it can, however, collaborate with others to address the common challenges that such large projects pose.

Although this paper covers preservation administration, digital preservation, and digital imaging, it does not attempt to present a comprehensive discussion of any of these distinct specialty areas. Deliberately broad in scope, the paper is designed to be of interest to a wide range of stakeholders. These stakeholders include scholars; staff at institutions that are currently providing content for large-scale digital initiatives, are in a position to do so in the future, or are otherwise influenced by the outcomes of such projects; and leaders of foundations and government agencies that support, or have supported, large digitization projects. The paper recommends that Google and Microsoft, as well as other commercial leaders, also be brought into this conversation.

Full Scholarships Available for Online Graduate Digital Information Management Certificate Program

Full scholarships are available for students interested in obtaining a graduate certificate in Digital Information Management from the University of Arizona's School of Information Resources and Library Science. Recently, the Library of Congress honored Richard Pearce-Moses, one of the key figures in the development of the program, by naming him as a digital preservation pioneer.

Here's the announcement:

The University of Arizona School of Information Resources and Library Science is pleased to announce that a number of full scholarships are still available in the school's graduate certificate program in Digital Information Management. The program is scheduled to begin a new series of courses starting this summer. Prospects have until April 1, 2008 to apply for one of the openings and available financial aid.

DigIn, as the program is known, provides hands-on experience and focused instruction supporting careers in libraries and archives, cultural heritage institutions and digital collections, information repositories in government and the private sector and similar institutions. The certificate is comprised of six courses covering diverse topics including digital collections, applied technology, technology planning and leadership, policy and ethics, digital preservation and curation, and other subjects relevant to today's digital information environments.

For people just starting in the field or considering career changes, the DigIn certificate program offers an alternative path to graduate studies that helps prepare students for success in traditional graduate programs or the workplace. The certificate also provides a means for working professionals and those who already have advanced graduate degrees in the library and information sciences to broaden their knowledge and skills in today's rapidly evolving digital information landscape.

The program is delivered in a 100% virtual environment and has no residency requirements. Students may choose to complete the certificate in fifteen or twenty-seven months.

The certificate program has been developed in cooperation with the Arizona State Library, Archives and Public Records and the University of Arizona Office of Continuing Education and Academic Outreach. Major funding for program development comes from the federal government's Institute of Museum and Library Services (IMLS), which has also provided funding for a number of scholarships.

Additional details on the program including course descriptions, admissions requirements and application forms may be found on the program website at http://sir.arizona.edu/digin. Or, contact the UA School of Information Resources and Library Science by phone at 520-621-3565 or email at sirls@email.arizona.edu.

E-Print Preservation: SHERPA DP: Final Report of the SHERPA DP Project

JISC has released SHERPA DP: Final Report of the SHERPA DP Project.

Here's an excerpt from the "Executive Summary":

The SHERPA DP project (2005–2007) investigated the preservation of digital resources stored by institutional repositories participating in the SHERPA project. An emphasis was placed on the preservation of e-prints—research papers stored in an electronic format, with some support for other types of content, such as electronic theses and dissertations.

The project began with an investigation of the method that institutional repositories, as Content Providers, may interact with Service Providers. The resulting model, framed around the OAIS, established a Co-operating archive relationship, in which data and metadata is transferred into a preservation repository subsequent to it being made available. . . .

The Arts & Humanities Data Service produced a demonstrator of a Preservation Service, to investigate the operation of the preservation service and accepted responsibility for the preservation of the digital objects for a three-year period (two years of project funding, plus one year).

The most notable development of the Preservation Service demonstrator was the creation of a reusable service framework that allows the integration of a disparate collection of software tools and standards. The project adopted Fedora as the basis for the preservation repository and built a technical infrastructure necessary to harvest metadata, transfer data, and perform relevant preservation activities. Appropriate software tools and standards were selected, including JHOVE and DROID as software tools to validate data objects; METS as a packaging standard; and PREMIS as a basis on which to create preservation metadata. . . .

A number of requirements were identified that were essential for establishing a disaggregated service for preservation, most notably some method of interoperating with partner institutions and he establishment of appropriate preservation policies. . . . In its role as a Preservation Service, the AHDS developed a repository-independent framework to support the EPrints and DSpace-based repositories, using OAI-PMH as common method of connecting to partner institutions and extracting digital objects.

JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions

JISC has published JISC Programme Synthesis Study: Supporting Digital Preservation and Asset Management in Institutions: A Review of the 4-04 Programme on Digital Preservation and Asset Management in Institutions for the JISC Information Environment: Part II: Programme Synthesis.. The report covers a number of projects, including LIFE, MANDATE, PARADIGM, PRESERV, and SHERPA DP.

Here's an excerpt from UKOLN News:

Written by Maureen Pennock, DCC researcher at UKOLN, the study provides a comprehensive and categorised overview of the outputs from the entire programme. Categories include training, costs and business models, life cycles, repositories, case studies, and assessment and surveys. Each category includes detailed information on project outputs and references a number of re-usable project-generated tools that range from software services to checklists and guidance.

Stewardship of Digital Research Data: A Framework of Principles and Guidelines

The Research Information Network (RIN) has published Stewardship of Digital Research Data: A Framework of Principles and Guidelines: Responsibilities of Research Institutions and Funders, Data Managers, Learned Societies and Publishers.

Here's an excerpt from the Web page describing the document:

Research data are an increasingly important and expensive output of the scholarly research process, across all disciplines. . . . But we shall realise the value of data only if we move beyond research policies, practices and support systems developed in a different era. We need new approaches to managing and providing access to research data.

In order to address these issues, the RIN established a group to produce a framework of key principles and guidelines, and we consulted on a draft document in 2007. The framework is founded on the fundamental policy objective that ideas and knowledge, including data, derived from publicly-funded research should be made available for public use, interrogation, and scrutiny, as widely, rapidly and effectively as practicable. . . .

The framework is structured around five broad principles which provide a guide to the development of policy and practice for a range of key players: universities, research institutions, libraries and other information providers, publishers, and research funders as well as researchers themselves. Each of these principles serves as a basis for a series of questions which serve a practical purpose by pointing to how the various players might address the challenges of effective data stewardship.

Draft Curation Lifecycle Model Released by the Digital Curation Centre

The Digital Curation Centre has released the Draft DCC Curation Lifecycle Model for comment by February 29, 2008. Further information about the model can be found in "Draft DCC Curation Lifecycle Model" from the latest issue of the International Journal of Digital Curation.

Here's an excerpt from the announcement:

The model provides a generic graphical high-level overview of the stages required for successful curation and preservation of digital material from initial conceptualisation. The Digital Curation Centre will shortly start to use this draft model to ensure that information, services and advisory material cover all areas of the lifecycle.

Library of Congress Office of Strategic Initiatives' Strategic Plan FY 2008-2013

The Library of Congress Office of Strategic Initiatives has published its Strategic Plan FY 2008-2013. This is a large color PDF file (about 34 MB).

Here are three brief excerpts:

By 2013 we expect to have in place:

  • Increased digital content holdings
  • Capability to produce and receive sustainable digital content from multiple sources
  • A stewardship network of collaborative partners
  • Recommendations on digital content information architecture, preservation and access
  • Recommendations on public policy for digital content preservation and access

Through 2013, we expect to:

  • Increase use and awareness of content and services by target user communities
  • Improve integrated search and discovery
  • Secure delivery of digital content and services
  • Enable multiple ways/methods of access to digital content and services
  • Facilitate integration of the Library’s primary sources into K–12 educational settings
  • and networks

Key outcomes associated with our technology infrastructure objective include:

  • Secured, available and scalable technology infrastructure
  • Defined Library of Congress technical infrastructure for shared tools and services
  • among networked entities
  • Defined future institution-wide architecture and support for a national networked
  • digital information architectural framework
  • Specialized institutional digital media repository services
  • Preserved authentic digital content over time.

Report on Library of Congress/San Diego Supercomputer Center Data Transfer and Storage Tests

The Library of Congress has published Data Center for Library of Congress Digital Holdings: A Pilot Project; Final Report.

Here an excerpt from the "Introduction":

Between May 2006 and October 2007, the Library of Congress (LC) and the San Diego Supercomputer Center (SDSC) conducted data-transfer and storage tests. At the heart of the project was the issue of trust, specifically how the LC could trust SDSC to reliably store several terabytes of the LC’s data. By what means could SDSC prove to the LC that the data was intact, preserved, and well-cared for? What tests could the LC devise, and what metrics could SDSC produce, to guarantee the integrity of their remotely stored data?

The two main objectives of the project were:

  • For SDSC to host LC content reliably and return it intact at the end of the project
  • For LC to be able to remotely access, process, analyze, and manage that content . . . .

Inspired by SDSC’s staggering technological potential, the LC had devised several scenarios for the data tests. But ultimately, as the project progressed, the LC opted to keep its goals simple: data transfer, storage, and file manipulation. In the end, both partners were happy with the project’s success. The project also produced lessons and unexpected results, some of which will have deep implications for all cultural institutions regarding transfer and storage of their digital assets.

Is the End of the Print Journal Near?: New ARL Report Examines This Issue

The Association of Research Libraries has published The E-only Tipping Point for Journals: What’s Ahead in the Print-to-Electronic Transition Zone.

Here's an excerpt from the "Executive Summary":

The role of the printed journal in the institutional marketplace faces a steep decline in the coming 5 to 10 years. Print journals will exist mainly to address specialized needs, users, or business opportunities. Financial imperatives will draw libraries first—and ultimately publishers also—toward a tipping point where it no longer makes sense to subscribe to or publish printed versions of most journals.

Publishers will be driven to rationalize their investments in declining print revenue streams and to finance investments in e-publishing infrastructure and emerging opportunities. Some will be faster to do so, such as those already straining from the cost burden. Others will be slower, such as those with a self-supporting base of individual subscribers or significant advertising revenue from print.

A new focus will emerge on productivity in scholarly communication. Experiments will explore new business models and new ways of conducting and facilitating research. Along the way, vexing issues such as those surrounding assurance of long-term access to the scholarly record will continue to be sorted out and perhaps even solved.

Digital Preservation Report from RAND Europe

RAND Europe has released Addressing the Uncertain Future of Preserving the Past. Towards a Robust Strategy for Digital Archiving and Preservation. The report "examines key determinants of the sustainable digital preservation of scholarly records, with specific reference to developing a robust approach to the archiving of such records at the Koninklijke Bibliotheek in the Netherlands."

National Archives in the UK Releases New Versions of PRONOM and DROID

The National Archives in the UK has released new versions of PRONOM, an online registry of file formats, software, and other technical information used for digital preservation purposes, and DROID (Digital Record Object Identification), software that is used to identify file formats in batch mode. DROID requires the Java Runtime Environment (version 1.5.0 or later), and it runs under Windows (Windows 2000, XP, and Vista), OS X, and Linux.

DigitalPreservationEurope Publishes Report on Copyright and Privacy Issues for Cooperating Repositories

DigitalPreservationEurope has published PO3.4: Report on the Legal Framework on Repository Infrastructure Impacting on Cooperation Across Member States.

Here's excerpt from the "Introduction."

The focus of this paper is the legal framework for the management of content of cooperating repositories. The focus will be on the regulation of copyright and protection of personal data. That copyright is important when managing data repositories is common knowledge. However, there is an increasing tendency among authors not only to deposit their published scientific work, scientific articles, dissertations or books, but also the underlying data. In addition to this ordinary publicly available sources like internet web pages contain personal data, often of a sensitive nature. Due to this emergent trend repositories will have to comply with the rules governing the use and protection of personal data, especially in the medical and social sciences.

The scenario is the following:

  • National repositories acquire material from different sources and in different formats.
  • The repositories cooperate with repositories in other countries in the preservation of data.
  • There is some degree of specialisation, some repositories specialise on preserving certain formats and other repositories on the preservation of other formats.

This paper describes the legal framework regulating the two decisive actions which have to take place if this scenario is to become a reality:

  1. The reproduction of data
  2. The transfer of data to other repositories

Other copyright issues like the rules concerning communication with the public and the protection of databases will also be touched upon.

International Coalition of Library Consortia Protests AAAS Decision to Drop JSTOR

The International Coalition of Library Consortia, which represents 72 consortia, has issued a statement regarding the American Association for the Advancement of Science decision to sever its relationship to JSTOR.

Here's an excerpt from the statement:

The ICOLC strongly objects to the recent decision by the American Association for the Advancement of Science (AAAS) to discontinue its participation in JSTOR, including withholding future issues of its premier publication, Science, from the JSTOR archive and prohibiting JSTOR from making issues of Science currently held in the archive available to new JSTOR participants.

JSTOR has been a singular success in meeting the needs of students, scholars, librarians, and publishers. JSTOR offers a robust platform for cross-disciplinary discovery and integration of content that extends the multi-disciplinary reach of Science to students and faculty, including those in non-scientific disciplines. In addition, JSTOR offers to publishers a moving wall policy that protects their ability to obtain current subscription revenue to support ongoing publication.

Science is an outstanding source of high-quality, vetted information covering all areas of science, the inclusion of which enhances the value, breadth, and quality of the JSTOR archive. The decision to discontinue participation in JSTOR is in conflict with AAAS' mission, as a non-profit, membership-based organization, of advancing science and serving society. Withholding future issues of Science from JSTOR, and prohibiting JSTOR from making previously archived Science content available to future JSTOR participants, is an action which diminishes the value and contribution of both AAAS and JSTOR to the international community of researchers, the academy, and society.

Boston Public Library/Open Content Alliance Contract Made Public

Boston Public Library has made public its digitization contract with the Open Content Alliance.

Some of the most interesting provisions include the intent of the Internet Archive to provide perpetual free and open access to the works, the digitization cost arrangements (BPL pays for transport and provides bibliographic metadata, the Internet Archive pays for digitization-related costs), the specification of file formats (e.g., JPEG 2000, color PDF, and various XML files), the provision of digital copies to BPL (copies are available immediately after digitization for BPL to download via FTP or HTTP within 3 months), and use of copies (any use by either party as long as provenance metadata and/or bookplate data is not removed).

A Study of Curation and Preservation Issues in the eCrystals Data Repository and Proposed Federation

JISC's eBank UK project, which is now in phase three, has released A Study of Curation and Preservation Issues in the eCrystals Data Repository and Proposed Federation, which addresses key issues related to the establishment of the eCrystals Federation.

Here's an excerpt from "eBank Phase 3: Transitioning to the eCrystals Federation" that explains the overall project:

This project will progress the establishment of a global Federation of data repositories for crystallography by performing a scoping study into the feasibility of constructing a network of data repositories: the eCrystals Federation. The Federation approach is presented as an innovative domain model to promote Open Access to data more widely and to facilitate take-up.

It builds on the work of the eBank project, and has links to Repository for the Laboratory (R4L), SPECTRa and SMART Tea projects in chemistry. The Federation will contribute to the development of a digital repository e-infrastructure for research and will inform the Repository Support Project. . . .

In Phase 3, partners will assess organisational issues and promote advocacy, examine interoperability associated with research workflow and data deposit, harmonise the metadata application profiles from repositories operating on different platforms (EPrints, DSpace & ReciprocalNet), investigate aggregation issues arising from harvesting metadata from repositories situated within the information environments developed in other countries (EU, USA & Australia) and scope the issues of the Federation of institutional archives interoperating with an international subject archive (IUCr).

Digital Lives Research Project

Funded by the Arts and Humanities Research Council, the Digital Lives project is investigating personal digital collections.

Here's an excerpt from the About Digital Lives page:

Digital Lives is a major research project focusing on personal digital collections and their relationship with research repositories. It brings together expert curators and practitioners in digital preservation, digital manuscripts, literary collections, web-archiving, history of science, and oral history from within the British Library (one of the world’s leading research libraries) with researchers in the School of Library, Archive and Information Studies at University College London, and The Centre for Information Technology and Law at the University of Bristol.

The Project's proposal provides detailed information about it's research methods.