Publisher Author Agreements

According to today's SHERPA/RoMEO statistics, 36% of the 308 included publishers are green ("can archive pre-print and post-print"), 24% are blue ("can archive post-print (i.e. final draft post-refereeing)"), 11% are yellow ("can archive pre-print (i.e. pre-refereeing)"), and 28% are white ("archiving not formally supported"). Looked at another way, 72% of the publishers permit some form of self-archiving.

These are certainly encouraging statistics, and publishers who permit any form of self-archiving should be applauded; however, leaving aside Creative Commons licenses and author agreements that have been crafted by SPARC and others to promote rights retention, publishers recently liberalized author agreements still raise issues that librarians and scholars should be aware of.

Looking deeper, there are publisher variations in terms of where e-prints can be self-archived. Typically, this might be some combination of the author's Website, institutional repository or Website, funding agency's server, or disciplinary archive. Some agreements allow deposit on any noncommercial or open access server. Restricting deposit to open access or noncommercial servers is perfectly legitimate in my view; more specific restrictions are, well, too restrictive. The problem arises when the agreement limits the author's deposit options to ones he or she doesn't have, such as only allowing deposit in an institutional repository when the author's institution doesn't have one or only allowing posting on an author's Website when the author doesn't have one.

Another issue is publisher requirements for authors to remove e-prints on publication, to modify e-prints after publication to reflect citation and publisher contact information, to replace e-prints with published versions, or to create their own versions of postprints. Low deposit rates in institutional repositories without institutional mandates suggest that anything that involves extra effort by authors is a deterrent to deposit. The above kinds of publisher requirements are likely to have equally low rates on compliance, resulting in deposited e-prints that do not conform to author agreements. To be effective, such requirements would have to be policed by publishers or digital repositories. Otherwise, they are meaningless and are best deleted from author agreements.

A final issue is retrospective deposit. We can think of the journal literature as an inverted pyramid, with the broad top being currently published articles and the bottom being the first published journal articles. The papers published since the emergence of author agreements that permit self-archiving are a significant resource; however, much of the literature precedes such agreements. The vast majority of these articles are under standard copyright transfer agreements, with publishers holding all rights. Consequently, it is very important that publishers clarify whether their relatively new self-archiving policies can be applied retroactively. Elsevier has done so:

When Elsevier changes its policies to enable greater academic use of journal materials (such as the changes several years ago in our web-posting policies) or to clarify the rights retained by journal authors, Elsevier is prepared to extend those rights retroactively with respect to articles published in journal issues produced prior to the policy change.

Elsevier is pleased to confirm that, unless explicitly noted to the contrary, all policies apply retrospectively to previously published journal content. If, after reviewing the material noted above, you have any questions about such rights, please contact Global Rights.

Unfortunately, many publishers have not clarified this issue. Under these conditions, whether authors can deposit preprints or author-created postprints hinges on whether these works are viewed as being different works from the publisher version, and, hence, owned by the authors. Although some open access advocates believe this to be the case, to my knowledge this has never been decided in a court of law. Michael Carroll, who is a professor at the Villanova University School of Law and a member of the Board of the Creative Commons, has said in an analysis of whether authors can put preprints of articles published using standard author agreements under Creative Commons licenses:

Although technically distinct, the copyrights in the pre-print and the post-print overlap. The important point to understand is that copyright grants the owner the right to control exact duplicates and versions that are "substantially similar" to the copyrighted work. (This is under U.S. law, but most other jurisdictions similarly define the scope of copyright).

A pre-print will normally be substantially similar to the post-print. Therefore, when an author transfers the exclusive rights in the work to a publisher, the author precludes herself from making copies or distributing copies of any substantially similar versions of the work as well.

Much progress has been made in the area of author agreements, but authors must still pay careful attention to the details of agreements, which vary considerably by publisher. The SHERPA/RoMEO—Publisher Copyright Policies & Self-Archiving database is a very useful and important tool and users should actively participate in refining this database; however, authors are well advised not to stop at the summary information presented here and to go to the agreement itself (if available). It would be very helpful if a set of standard author agreements that covered the major variations could be developed and put into use by the publishing industry.

Michael Keller Appointed CLIR Senior Presidential Fellow

The Council on Library and Information Resources (CLIR) has announced the appointment of Michael Keller, Stanford’s University Librarian, as CLIR Senior Presidential Fellow. Keller is also Director of Academic Information Resources, founder and publisher of HighWire Press, and publisher of the Stanford University Press.

Here’s an excerpt from the press release:

During the two-year appointment, which begins August 1, Mr. Keller will undertake a series of studies and reports for CLIR publication. His research will include examining the recommendations of recent cyberinfrastructure reports and exploring how our communities can respond to the complex environment these reports envision, including the role and function of institutional repositories, digital archives, and digital libraries. He will also compose white papers that elucidate new and emerging research methodologies, new models of scholarly publishing, the role of supercomputer
centers in the evolving concept of cyberinfrastructure, and topics specific to rethinking aspects of libraries and academic life. During his tenure as fellow, he will continue to work from Stanford.

Update on the DSpace Foundation

Michele Kimpton, Executive Director of the DSpace Foundation, gave gave a talk about the foundation at the DSpace UK & Ireland User Group meeting in early July.

Her PowerPoint presentation is now available.

Source: Lewis, Stuart. "Presentations from Recent DSpace UK & Ireland User Group Meeting," Unilever Centre for Molecular Informatics, Cambridge—Jim Downing, 11 July 2007.

An Ecological Approach to Repository and Service Interactions

UKOLN and JISC CETIS have released An Ecological Approach to Repository and Service Interactions, Draft Version 0.9 for comment.

Here’s an excerpt from the "Not the Executive Summary" section:

This work began with the need to express something of how and why repositories and services interact. As a community we have well understood technical models and architectures that provide mechanisms for interoperability. The actual interactions that occur, however, are not widely understood and knowledge about them is not often shared. This is in part because we tend to share in the abstract through architectures and use cases, articulating interactions or connections requires an engagement with specific details. . . .

Ecology is the study of systems that are complex, dynamic, and full of interacting entities and processes. Although the nature of these interactions and processes may be highly detailed, a higher level view of them is accessible and intuitive. We think that ecology and the ecosystems it studies may offer a useful analogy to inform the task of understanding and articulating the interactions between users, repositories, and services and the information environments in which they take place. This report outlines some concepts from ecology that may be useful and suggests some definitions for a common conversation about the use of this metaphor.

We hope that this report suggests an additional way to conceptualise and analyse interactions and provide a common vocabulary for an ecological approach. It should as a minimum provoke and support some useful discussions about networks and communities.

Curation of Scientific Data: Challenges for Institutions and Their Repositories Podcast

A podcast of Chris Rusbridge’s "Curation of Scientific Data: Challenges for Institutions and their Repositories" presentation at The Adaptable Repository conference is now available. Rusbridge is Director of the Digital Curation Centre in the UK.

The PowerPoint for the presentation is also available.

DSpace How-To Guide

Tim Donohue, Scott Phillips, and Dorothea Salo have published DSpace How-To Guide: Tips and Tricks for Managing Common DSpace Chores (Now Serving DSpace 1.4.2 and Manakin 1.1).

This 55-page booklet, which is under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 License, will be a welcome addition to the virtual bookshelves of institutional repository managers struggling with the mysteries of DSpace.

DRAMA Releases Fedora Front-End Beta for Authentication/Full-Text Search

DRAMA (Digital Repository Authorization Middleware Architecture) has released Fiddler, a beta version of its mura Fedora front-end that provides access control, authentication, full-text searching and a variety of other functions. DRAMA is a sub-project of RAMP (Research Activityflow and Middleware Priorities Project).

Here’s an excerpt from the news item that describes Fiddler’s features:

  • Hierarchical access control enforcement: Policies can be applied at the collection level, object level or datastream level. . . .
  • Improved access control interface: One can now view existing access control of a particular user or group for a given datastream, object or collection. . . .
  • User-centric GUI: mura only presents users with operations for which they have permissions.
  • XForms Metadata Input: We employ an XForms engine (Orbeon) for metadata input. XForms allow better user interaction, validation and supports any XML-based metadata schemas (such as MARC or MODS).
  • LDAP Filter for Fedora: The current Fedora LDAP filter (in version 2.2) does not authenticate properly, so we have developed a new LDAP filter to fix this problem.
  • Local authentication for DAR and ASM: In addition to Shibboleth authentication, the DAR and ASM can be configured to use a local authentication source (eg. via a local LDAP).
  • Generic XACML Vocabulary: XACML policies are now expressed in a generic vocabulary rather than Fedora specific ones. . . .
  • XACML Optimization: We have optimized of the evaluation engine by employing a cache with user configurable time-to-live. We have also greatly reduced the time for policies matching with DB XML, through the use of bind parameters in our queries.
  • Flexible mapping of Fedora actions to new Apache Axis handlers: Axis is the SOAP engine that Fedora employs to provide its web services. The new flexibility allows new handlers to be easily plugged into Fedora to support new features that follow the same Interceptor pattern as our authorization framework.
  • Version control: mura now supports version control.
  • Full-text search: We enabled full-text search by incorporating Fedoragsearch package.

Dealing with Data: Roles, Rights, Responsibilities and Relationships

JISC has released its Dealing with Data: Roles, Rights, Responsibilities and Relationships: Consultancy Report, which was written as part of its Digital Repositories Programme’s Data Cluster Consultancy.

Here’s an excerpt from the Executive Summary:

This Report explores the roles, rights, responsibilities and relationships of institutions, data centres and other key stakeholders who work with data. It concentrates primarily on the UK scene with some reference to other relevant experience and opinion, and is framed as "a snapshot" of a relatively fast-moving field. . . .

The Report is largely based on two methodological approaches: a consultation workshop and a number of semi-structured interviews with stakeholder representatives.

It is set within the context of the burgeoning "data deluge" emanating from e-Science applications, increasing momentum behind open access policy drivers for data, and developments to define requirements for a co-ordinated e-infrastructure for the UK. The diversity and complexity of data are acknowledged, and developing typologies are referenced.

Council of Australian University Librarians ETD Survey Report

The Council of Australian University Librarians has released Australasian Digital Theses Program: Membership Survey 2006.

Here’s an excerpt from the "Key Findings" section:

1. The average percentage of records for digital theses added to ADT is 95% when digital submission is mandatory and 17% when it is not mandatory. . . .

2. 59% of respondents will have mandatory digital submission in place in 2007.

3. With this level of mandatory submission it is predicted that 60% of all theses produced in Australia and New Zealand in 2007 will have a digital copy recorded in ADT. . . .

5. The overwhelming majority of respondents offer a mediated submission service, either only having a mediated service or offering both mediated and self-submission services. When mediated and self-submission are both available, the percentage self-submitted is polarised with some achieving over a 75% self-submission rate.

6. Over half the respondents have a repository already and most are using it to manage digital theses.

7. 87% will have a repository by the end of this year, and the rest are in the initial planning stage.

Report on Chemistry Teaching/Research Data and Institutional Repositories

The JISC-funded SPECTRa project has released Project SPECTRa (Submission, Preservation and Exposure of Chemistry Teaching and Research Data): JISC Final Report, March 2007.

Here’s an excerpt from the Executive Summary:

Project SPECTRa’s principal aim was to facilitate the high-volume ingest and subsequent reuse of experimental data via institutional repositories, using the DSpace platform, by developing Open Source software tools which could easily be incorporated within chemists’ workflows. It focussed on three distinct areas of chemistry research—synthetic organic chemistry, crystallography and computational chemistry.

SPECTRa was funded by JISC’s Digital Repositories Programme as a joint project between the libraries and chemistry departments of the University of Cambridge and Imperial College London, in collaboration with the eBank UK project. . . .

Surveys of chemists at Imperial and Cambridge investigated their current use of computers and the Internet and identified specific data needs. The survey’s main conclusions were:

  • Much data is not stored electronically (e.g. lab books, paper copies of spectra)
  • A complex list of data file formats (particularly proprietary binary formats) being used
  • A significant ignorance of digital repositories
  • A requirement for restricted access to deposited experimental data

Distributable software tool development using Open Source code was undertaken to facilitate deposition into a repository, guided by interviews with key researchers. The project has provided tools which allow for the preservation aspects of data reuse. All legacy chemical file formats are converted to the appropriate Chemical Markup Language scheme to enable automatic data validation, metadata creation and long-term preservation needs. . . .

The deposition process adopted the concept of an "embargo repository" allowing unpublished or commercially sensitive material, identified through metadata, to be retained in a closed access environment until the data owner approved its release. . . .

Among the project’s findings were the following:

  • it has integrated the need for long-term management of experimental chemistry data with the maturing technology and organisational capability of digital repositories;
  • scientific data repositories are more complex to build and maintain than are those designed primarily for text-based materials;
  • the specific needs of individual scientific disciplines are best met by discipline-specific tools, though this is a resource-intensive process;
  • institutional repository managers need to understand the working practices of researchers in order to develop repository services that meet their requirements;
  • IPR issues relating to the ownership and reuse of scientific data are complex, and would benefit from authoritative guidance based on UK and EU law.

Repositories as Platforms for Researchers e-Portfolios Podcast

The Australian Partnership for Sustainable Repositories (APSR) has made a podcast of Susan Gibbons’s "Repositories as Platforms for Researchers e-Portfolios" presentation at the Adaptable Repository workshop at the University of Sydney.

Powerpoints from the workshop’s presentations are also available.

The REMAP Project: Record Management and Preservation in Digital Repositories

The REMAP Project at the University of Hull has been funded by JISC investigate how record management and digital preservation functions can be best supported in digital repositories. It utilizes the Fedora system.

Here’s an except from the Project Aims page (I have added the links in this excerpt):

The REMAP project has the following aims:

  • To develop Records Management and Digital Preservation (RMDP) workflow(s) in order to understand how a digital repository can support these activities
  • To embed digital repository interaction within working practices for RMDP purposes
  • To further develop the use of a WSBPEL orchestration tool to work with external Web services, including the PRONOM Web services, to provide appropriate metadata and file information for RMDP
  • To develop and test a notification layer that can interact with the orchestration tool and allow RSS
    syndication to individuals alerting them to RMDP tasks
  • To develop and test an intermediate persistence layer to underpin the notification layer and interact
    with the WSBPEL orchestration tool to allow orchestrated workflows to take place over time
  • To test and validate the use of the enhanced WSBPEL tool with institutional staff involved in RMDP activities

SWORD (Simple Web-service Offering Repository Deposit) Project

Led by UKOLN, The JISC SWORD (Simple Web-service Offering Repository Deposit) Project is developing "a prototype ‘smart deposit’ tool" to "facilitate easier and more effective population of repositories."

Here’s an excerpt from the project plan:

The effective and efficient population of repositories is a key concern for the repositories community. Deposit is a crucial step in the repository workflow; without it a repository has no content and can fulfill no further function. Currently most repositories exist in a fairly linear context, accepting deposits from a single interface and putting them into a single repository. Further deployment of repositories, encouraged by JISC and other funders, means that this situation is changing and we are beginning to see an increasingly complex and dynamic ecology of interactions between repositories and other services and systems. By and large developers are not creating repository systems and software from scratch, rather they are considering how repositories interface with other applications within institutions and the wider information landscape. A single repository, or multiple repositories, might interact with other components, such as VLEs, authoring tools, packaging tools, name authority services, classification services and research systems. In terms of content, resources may be deposited in a repository by both human and software agents, e.g. packaging tools that push content into repositories or a drag-and-drop desktop tool. The type of resource being deposited will also influence the choice of deposit mechanism. If the resources are complex packaged objects then a web service will need to support the ingest of multiple packaging standards.

There is currently no standard mechanism for accepting content into repositories, yet there already exists a stable and widely implemented service for harvesting metadata from repositories (OAI-PMH—Open Archives Initiative Protocol for Metadata Harvesting). This project will implement a similarly open protocol or specification for deposit. By taking a similar approach, the project and the resulting protocol and implementations will gain easier acceptance by a community already familiar with the OAI-PMH.

This project aims to develop a Simple Web-service Offering Repository Deposit (SWORD)—a lightweight deposit protocol that will be implemented as a simple web service within EPrints, DSpace, Fedora and IntraLibrary and tested against a prototype ‘smart deposit’ tool. The project plans to take forward the lightweight protocol originally formulated by a small group working within the Digital Repositories Programme (the ‘Deposit API’ work) . The project is aligned with the Object Reuse and Exchange (ORE) Mellon-funded two-year project by the Open Archives Initiative, which commenced in October 2006. Members of the SWORD project team are represented on its Technical and Liaison Committees. . . . . The SWORD project is not attempting to duplicate work being done being done by ORE, but seeks to build on existing work to support UK-specific requirements whilst feeding into the ongoing ORE project.

DSpace Executive Director Appointed

Michele Kimpton, formerly of the Internet Archive, has been appointed the Executive Director of the newly formed DSpace nonprofit organization.

Here’e an excerpt from the announcement:

I am happy to report that we are making good progress on establishing the new non-profit organization, and I would like to take this opportunity to announce that Michele Kimpton has accepted the position as Executive Director for the organization. The DSpace non-profit corporation will initially provide organizational, legal and financial support for the DSpace open source software project. Prior to joining DSpace, Michele Kimpton was one of the founding Directors at Internet Archive, in charge of Web archiving technology and services. . . .

Michele developed an organization within Internet Archive to help support and fund open source software and web archiving programs, so she comes to us with a lot of experience in both open source software and long-term digital curation. Her organization worked primarily with National Libraries and Archives around the world, so she is familiar with large, widely diverse and distributed communities. Michele was one of the co-founders of the IIPC ( International Internet Preservation Consortium, netpreserve.org), whose mission is to work collaboratively to develop tools, standards and processes for archiving and preservation of web material.

The DSpace non-profit corporation is in the final stages of completing filing status as a not-for-profit corporation of Massachusetts. By summer 2007 we expect to have this legal entity in place, and a complete Board of Directors. Both MIT and Hewlett Packard have provided the start up funding to establish the organization over the next several years. . . .

Position Papers from the NSF/JISC Repositories Workshop

Position papers from the NSF/JISC Repositories Workshop are now available.

Here’s an excerpt from the Workshop’s Welcome and Themes page:

Here is some background information. A series of recent studies and reports have highlighted the ever-growing importance for all academic fields of data and information in digital formats. Studies have looked at digital information in science and in the humanities; at the role of data in Cyberinfrastructure; at repositories for large-scale digital libraries; and at the challenges of archiving and preservation of digital information. The goal of this workshop is to unite these separate studies. The NSF and JISC share two principal objectives: to develop a road map for research over the next ten years and what to support in the near term.

Here are the position papers:

EDUCAUSE 2006 Podcast on Penn’s Institutional Repository

In this podcast ("Content Recruitment and Development: A Proactive Approach to Building an Institutional Repository"), Marjorie Hassen describes the University of Pennsylvania Libraries’ strategy for developing and supporting the ScholarlyCommons@Penn, an institutional repository based on the Digital Commons platform.

Thursday’s OAI5 Presentations

Presentations from Thursday’s sessions of the 5th Workshop on Innovations in Scholarly Communication in Geneva are now available.

Here are a few highlights from this major conference:

  • Business Models for Digital Repositories (PowerPoint): "Those setting up, or planning to set up, a digital repository may be interested to know more about what has gone before them. What is involved, what is the cost, how many people are needed, how have others made the case to their institution, and how do you get anything into it once it is built? I have recently undertaken a study of European repository business models for the DRIVER project and will present an overview of the findings."
  • DRIVER: Building a Sustainable Infrastructure of European Scientific Repositories (PowerPoint): "Ten partners from eight countries have entered into an international partnership, to connect and network as a first step more than 50 physically distributed institutional repositories to one, large-scale, virtual Knowledge Base of European research."
  • On the Golden Road : Open Access Publishing in Particle Physics (RealVideo): "A working party works now to bring together funding agencies, laboratories and libraries into a single consortium, called SCOAP3 (Sponsoring Consortium for Open access Publishing in Particle Physics). This consortium will engage with publishers towards building a sustainable model for open access publishing. In this model, subscription fees from multiple institutions are replaced with contracts with publishers of open access journals where the SCOAP3 consortium is a single financial partner."
  • Open Access Forever—Or Five Years, Whichever Comes First: Progress on Preserving the Digital Scholarly Record (RealVideo): "The current state of the curation and preservation of digital scholarship over its entire lifecycle will be reviewed, and progress on problems of specific interest to scholarly communication will be examined. The difficulty of curating the digital scholarly record and preserving it for future generations has important implications for the movement to make that record more open and accessible to the world, so this a timely topic for those who are interested in the future of scholarly communication."

(You may want to download PowerPoint Viewer 2007 if you don’t have PowerPoint 2007).

OpenDOAR API

The OpenDOAR project has announced the availability of an API for accessing digital repository data in their database.

Here’s an excerpt from the press release:

OpenDOAR, as a SHERPA project, is pleased to announce the release of an API that lets developers use OpenDOAR data in their applications. It is a machine-to-machine interface that can run a wide variety of queries against the OpenDOAR Database and get back XML data. Developers can choose to receive just repository titles & URLs, all the available OpenDOAR data, or intermediate levels of detail. They can then incorporate the output into their own applications and ‘mash-ups’, or use it to control processes such as OAI-PMH harvesting. . . .

OpenDOAR is a continuing project hosted at the University of Nottingham under the SHERPA Partnership. OpenDOAR maintains and builds on a quality-assured list of the world’s Open Access Repositories. OpenDOAR acts as a bridge between repository administrators and the service providers who make use of information held in repositories to offer search and other services to researchers and scholars worldwide.

A key feature of OpenDOAR is that all of the repositories we list have been visited by project staff, tested and assessed by hand. We currently decline about a quarter of candidate sites as being broken, empty, out of scope, etc. This gives a far higher quality assurance to the listings we hold than results gathered by just automatic harvesting. OpenDOAR has now surveyed over 1,100 repositories, producing a classified Directory of over 800 freely available archives of academic information.

Wednesday’s OAI5 Presentations

Presentations from Wednesday’s sessions of the 5th Workshop on Innovations in Scholarly Communication in Geneva are now available.

Here are a few highlights from this major conference:

  • MESUR: Metrics from Scholarly Usage of Resources (PowerPoint): "The two-year MESUR project, funded by the Andrew W. Mellon Foundation, aims to define and validate a range of usage-based impact metrics, and issue guidelines with regards to their characteristics and proper application. The MESUR project is constructing a large-scale semantic model of the scholarly community that seamlessly integrates a wide range of bibliographic, citation and usage data."
  • OAI Object Re-Use and Exchange (PowerPoint): "In this presentation, we will give an overview of the current activities, including: defining the problem of compound documents within the web architecture, enumerating and exploring several use cases, and identifying likely adopters of OAI-ORE."
  • OpenDOAR Policy Tools and Applications (RealVideo): "OpenDOAR has developed a set of policy generator tools for repository administrators and is contacting administrators to advocate policy development."
  • State of OAI-PMH (PowerPoint): "The OAI-PMH was released in 2001 and stabilized at v2.0 in 2002. Since then there has been steady growth in adoption of the protocol. Support for the OAI-PMH is assumed for base-level interoperability between institutional repositories, and is also provided for many other collections of scholarly material. I will review the current landscape and reflect on some milestones and issues."

(You may want to download PowerPoint Viewer 2007 if you don’t have PowerPoint 2007).

Repository 66: OA Digital Repository Map Mashup

Stuart Lewis of the University of Wales Aberystwyth has created a Google Map mashup called Repository 66 that shows worldwide open access digital repositories using data from ROAR and OpenDOAR. (Route 66 was a famous highway in the US.)