Boston Public Library/Open Content Alliance Contract Made Public

Boston Public Library has made public its digitization contract with the Open Content Alliance.

Some of the most interesting provisions include the intent of the Internet Archive to provide perpetual free and open access to the works, the digitization cost arrangements (BPL pays for transport and provides bibliographic metadata, the Internet Archive pays for digitization-related costs), the specification of file formats (e.g., JPEG 2000, color PDF, and various XML files), the provision of digital copies to BPL (copies are available immediately after digitization for BPL to download via FTP or HTTP within 3 months), and use of copies (any use by either party as long as provenance metadata and/or bookplate data is not removed).

Yale Will Work with Microsoft to Digitize 100,000 Books

The Yale University Library and Microsoft will work together to digitize 100,000 English-language out-of-copyright books, which will be made available via Microsoft’s Live Search Books.

Here’s an excerpt from the press release:

The Library and Microsoft have selected Kirtas Technologies to carry out the process based on their proven excellence and state-of-the art equipment. The Library has successfully worked with Kirtas previously, and the company will establish a digitization center in the New Haven area. . . .

The project will maintain rigorous standards established by the Yale Library and Microsoft for the quality and usability of the digital content, and for the safe and careful handling of the physical books. Yale and Microsoft will work together to identify which of the approximately 13 million volumes held by Yale’s 22 libraries will be digitized. Books selected for digitization will remain available for use by students and researchers in their physical form. Digital copies of the books will also be preserved by the Yale Library for use in future academic initiatives and in collaborative scholarly ventures.

Brewster Kahle on Libraries Going Open

Brewster Kahle's "Libraries Going Open" document provides some details on where the Internet Archive and the Open Content Alliance are going with projects involving mass digitization of microfilm, mass digitization of journals, ILL of scanned out-of-print books, scanning books on demand, and other areas.

DLF and if:book Ponder Mass Digitization Issues

The Digital Library Federation and if:book are seeking comments on a series of questions about mass digitization issues that they will raise in invited brainstorming sessions as part of a project they are calling "The Really Modern Library."

Here's a suggestion: use CommentPress or a wiki to further refine ideas as the project evolves.

Source: Vershbow, Ben. "The Really Modern Library." if:book, 8 October 2007.

Digital Assets Factory Version 2.0 Released Under GPL

Bibliotheca Alexandrina has released version 2.0 of its open-source Digital Assets Factory software.

Here’s an excerpt from the project home page:

DAF v2.0 provides all the necessary tools required to manage the whole process of a digitization workflow, including its various Phases, User management, file movement and archiving. It provides the flexibility to manage multiple simultaneous projects with a diversity of materials, covering books, journals, newspapers, manuscripts, unbound materials, audio, video, and slides.

Advancing Knowledge: The IMLS/NEH Digital Partnership Grants Awarded

The Institute of Museum and Library Services and the National Endowment for the Humanities have announced the award of three grants under their Advancing Knowledge: The IMLS/NEH Digital Partnership program.

Here's an excerpt from the press release:

  • $347,520 to Historical Society of Pennsylvania for its project: PhilaPlace: A Neighborhood History and Culture Project. The Historical Society of Pennsylvania in collaboration with the Philadelphia Department of Records and the University of Pennsylvania’s School of Design will develop PhilaPlace, an interactive Web resource chronicling the history, culture, and architecture of Philadelphia's neighborhoods. Complete with maps, historical records, photographs, and digital models of select neighborhoods, PhilaPlace will serve as a prototype website for communities wishing to digitize their cultural heritage.
  • $349,939 to Tufts University, Medford for its project: Scalable Named Entity Identification in Classical Studies. The Perseus Project and the Collections and Archives of Tufts University will construct a testing database of scholarly and cultural documents about the ancient world. In the second part of the project, Tufts will develop a digital reference tool allowing researchers and librarians to conduct context-based “smart searches” of un-indexed words from existing databases in the Tufts Digital Library. By developing this database, and allowing for much shorter and complete context-based searches, Tufts hopes to lead scholars and students to the next generation of digital tools.
  • $349,996 to University of California, Berkeley for its project: Context and Relationships: Ireland and Irish Studies. The University of California, Berkeley in collaboration with the Queen’s University, Belfast, will develop a digital database of Irish studies materials to test three open-source digital tools. The Context Finder, Context Builder, and Context Provider tools will be aimed at establishing scholarly context. Using a common word search feature in digital collections, these tools will allow users to access the ideas that are associated with the words, thereby creating context through maps, primary texts and secondary works.

National Archives Seeks Comments on Draft Digitization Plan

The National Archives and Records Administration is soliciting comments on its draft Plan for Digitizing Archival Materials for Public Access, 2007-2016.

Here's an excerpt from the press release:

The document is divided into several sections. The first section, INTRODUCTION AND BACKGROUND, provides information on NARA's mission, our archival holdings, and our past experience with digitization, to give you the context of the draft Plan for Digitizing Archival Materials for Public Access, 2007-2016. Section II, PLAN OVERVIEW, describes our planned goals, activities, and priorities for digitization. Sections III through V provide listings of current digitization activities being carried out by NARA and through partnerships to digitize and make available archival materials. Appendix A contains draft operating principles that we are using as we enter into partnerships and Appendix B references relevant NARA guidance that applies to handling of archival materials being digitized and the technical guidelines for image creation and description.

Google Scholar Digitization Program

According to the article "Changes at Google Scholar: A Conversation with Anurag Acharya," Google Scholar has begun a small-scale, targeted journal digitization effort.

Here's a quote from the article:

Representing another effort to reach currently inaccessible content, Google Scholar now has its own digitization program. “It’s a small program,” said Acharya. “We mainly look for journals that would otherwise never get digitized. Under our proposal, we will digitize and host journal articles with the provision that they must be openly reachable in collaboration with publishers, fully downloadable, and fully readable. Once you get out of the U.S. and Western European space into the rest of the world, the opportunities to get and digitize research are very limited. They are often grateful for the help. It gives us the opportunity to get that country’s material or make that scholarly society more visible.”

Source: Quint, Barbara. "Changes at Google Scholar: A Conversation with Anurag Acharya." NewsBreaks 27 August 2007.

Cornell Joins Google Books Library Project

The Cornell University Library has joined the Google Books Library Project.

Here's an excerpt from the press release:

Google will digitize up to 500,000 works from Cornell University Library and make them available online using Google Book Search. As a result, materials from the library’s exceptional collections will be easily accessible to students, scholars and people worldwide, supporting the library’s long-standing commitment to make its collections broadly available.

“Research libraries today are integral partners in the academic enterprise through their support of research, teaching and learning. They also serve a public good by enhancing access to the works of the world's best minds,” said Interim University Librarian Anne R. Kenney. “As a major research library, Cornell University Library is pleased to join its peer institutions in this partnership with Google. The outcome of this relationship is a significant reduction in the time and effort associated with providing scholarly full-text resources online.”

Materials from Mann Library, one of 20 member libraries that comprise Cornell University Library, will be digitized as part of the agreement. Mann’s collections include some of the following subject areas: biological sciences, natural resources, plant, animal and environmental sciences, applied economics, management and public policy, human development, textiles and apparel, nutrition and food science.. . .

Cornell is the 27th institution to join the Google Book Search Library Project, which digitizes books from major libraries and makes it possible for Internet users to search their collections online. Over the next six years, Cornell will provide Google with public domain and copyrighted holdings from its collections. If a work has no copyright restrictions, the full text will be available for online viewing. For books protected by copyright, users will just get the basic background (such as the book’s title and the author’s name), at most a few lines of text related to their search and information about where they can buy or borrow a book. Cornell University Library will work with Google to choose materials that complement the contributions of the project’s other partners. In addition to making the materials available through its online search service, Google will also provide Cornell with a digital copy of all the materials scanned, which will eventually be incorporated into the university’s own digital library.

Australian Framework and Action Plan for Digital Heritage Collections

The Collections Council of Australia Ltd. has released Australian Framework and Action Plan for Digital Heritage Collections, Version 0.C3 for comment.

Here's an excerpt from the document:

This is the Collections Council of Australia's plan to prepare an Australian framework for digital heritage collections. It brings together information shared by people working in archives, galleries, libraries and museums at a Summit on Digital Collections held in 2006. It proposes an Action Plan to address issues shared by the Australian collections sector in relation to current and future management of digital heritage collections.

British Library Licenses Turning the Pages Toolkit

The British Library has announced that it is now licensing its Turning the Pages Toolkit to libraries and museums. You can see the software in action at their Turning the Pages Web site.

Here’s an excerpt from the press release:

From today, libraries around the World will be able to license the award-winning Turning the Pages software used by the British Library to bring some of the world’s most rare and valuable books online.

Since its launch in 2004, Turning the Pages has grown to become one of the most popular resources at the British Library, allowing the Library to bring iconic treasures such as the Lindisfarne Gospels, Leonardo da Vinci’s Notebooks and Mercator’s Atlas of Europe online for everyone to see. With the launch of Turning the Pages 2.0, and a completely re-built software platform developed by Armadillo Systems, May 2007 also sees launch of a new "toolkit" that allows other libraries and museums around the World to create their own Turning the Pages gallery. . . .

Michael Stocking, Managing Director of Armadillo Systems and developer of the Turning the Pages software said "As well as making it easy for our customers to create their own collections, we also wanted to enhance the Turning the Pages experience. We have migrated the software to a new platform that places the book in a 3-D environment so, as well as being able to examine the book as a piece of text, users can now also examine it as an object. They can now look at the book from different angles, zoom in and even look at two books, side-by-side."

CLIR Receives Mellon Grant to Study Mass Digitization

According to a O’Reilly Radar posting, the Council on Library and Information Resources has been awarded a grant from the Mellon Foundation to study mass digitization efforts.

Here’s an excerpt from the posting that describes the grant’s objectives:

  1. Assess selected large scale digitization programs by exploring their efficacy and utility for conducting scholarship, in multiple fields or disciplines (humanities, sciences, etc.).
  2. Write and issue a report with findings and recommendations for improving the design of mass digitization projects.
  3. Create a Collegium that can serve in the long-term as an advisory group to mass digitization efforts, helping to assure and obtain the highest possible data quality and utility.
  4. Convene a series of meetings amongst scholars, libraries, publishers, and digitizing organizations to discuss ways of achieving these quality and design improvements.

The University of Maine and Two Public Libraries Adopt Emory’s Digitization Plan

Library Journal Academic Newswire reports that the University of Maine, the Toronto Public Library, and the Cincinnati Public Library will follow Emory University’s lead and digitize public domain works utilizing Kirtas scanners with print-on-demand copies being made available via BookSurge. (Also see the press release: "BookSurge, an Amazon Group, and Kirtas Collaborate to Preserve and Distribute Historic Archival Books.")

Source: "University of Maine, plus Toronto and Cincinnati Public Libraries Join Emory in Scan Alternative." Library Journal Academic Newswire, 21 June 2007.

CIC’s Digitization Contract with Google

Library Journal Academic Newswire has published a must-read article ("Questions Emerge as Terms of the CIC/Google Deal Become Public") about the Committee on Institutional Cooperation’s Google Book Search Library Project contract.

The article includes quotes from Peter Brantley, Digital Library Federation Executive Director, from his "Monetizing Libraries" posting about the contract (another must-read piece).

Here’s an excerpt from Brantley’s posting:

In other words—pretty much, unless Google ceases business operations, or there is a legal ruling or agreement with publishers that expressly permits these institutions (excepting Michigan and Wisconsin which have contracts of precedence) to receive digitized copies of In-Copyright material, it will be held in escrow until such time as it becomes public domain.

That could be a long wait. . . .

In an article early this year in The New Yorker, "Google’s Moon Shot," Jeffrey Toobin discusses possible outcomes of the antagonism this project has generated between Google and publishers. Paramount among them, in his mind, is a settlement. . . .

A settlement between Google and publishers would create a barrier to entry in part because the current litigation would not be resolved through court decision; any new entrant would be faced with the unresolved legal issues and required to re-enter the settlement process on their own terms. That, beyond the costs of mass digitization itself, is likely to deter almost any other actor in the market.

Google Library Project Adds Committee on Institutional Cooperation (CIC)

The Google Book Search Library Project has an important new participant—the Committee on Institutional Cooperation (CIC). The CIC members are the University of Chicago, the University of Illinois, Indiana University, the University of Iowa, the University of Michigan, Michigan State University, the University of Minnesota, Northwestern University, Ohio State University, Pennsylvania State University, Purdue University, and the University of Wisconsin-Madison. As many as 10 million volumes will be digitized from the collections of these major research libraries.

Here’s an excerpt from the CIC press release:

This partnership between our 12 member universities and Google is unprecedented. What makes this work so exciting is that we will literally open the pages of millions of books that have been assembled on our library shelves over more than a century. In literally seconds, we’ll be able browse across the content of thousands of volumes, searching for words or phrases, and making links across those texts that would have taken weeks or months or years of dedicated and scrupulous analysis. It is an extraordinary effort, blending the efforts and aspirations of librarians, university administrators, and scholars from across 12 world-class research universities. And our corporate partner possesses unparalleled expertise in creating and opening the digital world to coherent and comprehensive searching.

The effort is not entirely without controversy—no great undertaking ever is. But our universities believe strongly in the power of information to change the world, and in preserving, protecting and extending access to information. We have carefully weighed and considered the intellectual property issues and believe that our effort is firmly within the guidelines of current copyright law, while providing some flexibility as those laws are tested in the new digital environment in the coming years.