Washington DC Principles for Free Access to Science Coalition Statement

Hard on the heels of the "Brussels Declaration on STM Publishing" by STM publishers, the Washington DC Principles for Free Access to Science Coalition has issued a statement condemning the Federal Research Public Access Act and similar measures.

An excerpt from the press release follows:

"The long tradition of methodical scientific inquiry and information sharing through publication in scholarly journals has helped advance medicine to where it is today," said Martin Frank of the American Physiological Society and coordinator of the coalition. "We as independent publishers must determine when it is appropriate to make content freely available, and we believe strongly it should not be determined by government mandate."

The Coalition also reaffirmed its ongoing practice of making millions of scientific journal articles available free of charge, without an additional financial burden on the scientific community or on funding agencies. More than 1.6 million free articles are already available to the public free of charge on HighWire Press.

"The scholarly publishing system is a delicate balance between the need to sustain journals financially and the goal of disseminating scientific knowledge as widely as possible. Publishers have voluntarily made more journal articles available free worldwide than at any time in history — without government intervention," noted Kathleen Case of the American Association for Cancer Research.

The Coalition expressed concern that a mandatory timetable for free access to all federally funded research could harm journals, scientists, and ultimately the public. Subscriptions to journals with a high percentage of federally funded research would decline rapidly. Subscription revenues support the quality control system known as peer review and also support the educational work of scientific societies that publish journals.

Undermining subscriptions would shift the cost of publication from the publisher who receives subscription revenue to the researcher who receives grants. Such a shift could:

Divert scarce dollars from research. Publishers now pay the cost of publication out of subscription revenue; if the authors have to pay, the funds will come from their research grants. Nonprofit journals without subscription revenue would have to rely on the authors’ grant funds to cover publication costs, which would divert funding from research.

Result in only well-funded scientists being able to publish their work. The ability to publish in scientific journals should be available equally to all.

Reduce the ability of journals to fund peer review. Most journals spend 40% or more of their revenue on quality control through the peer review system; without subscription income and with limitations on author fees, peer review would suffer.

Harm those scientific societies that rely on income from journals to fund the professional development of scientists. Revenues from scholarly publications fund research, fellowships to junior scientists, continuing education, and mentoring programs to increase the number of women and under-represented groups in science, among many other activities.

Prominent open access advocates Stevan Harnad and Peter Suber have both critiqued the statement in detail.

The Brussels Declaration: You Don’t Need a Weatherman to Know Which Way the Wind Blows

The recent "Brussels Declaration on STM Publishing" by major scholarly publishers, such as Elsevier and Wiley, can be boiled down to: the scholarly publishing system ain’t broke, so don’t try to fix it. It provides an interesting contrast to the 2004 "Washington DC Principles for Free Access to Science" by not-for-profit publishers, which outlined a variety of strategies for making content freely available.

Sadly, it suggests that the "Brussels Declaration" publishers fail to fully understand that the decades-old serials crisis has deeply alienated several generations of librarians, who are their primary customers. Publishers count on libraries being captive customers because scholarly publishing is monopolistic in nature (e.g., one journal article does not substitute for another article) and, consequently, demand is relatively inelastic, regardless of price. However, it is a rare business that thrives by alienating its customers. As the SPARC initiative and similar efforts illustrate, many librarians want to dramatically change the existing scholarly publishing system.

Driven by endless library serials cuts for journals in their disciplines, a growing belief that scholarly literature needs to be freely available for global scholarship to flourish, and excitement over the new potentials of digital publishing, scholars increasingly want to change the system as well. As has often been noted, the open access movement is not anti-publisher, but it is publisher-neutral, meaning that, as long as certain critical functions (such as peer review) are adequately performed, it does not matter how freely available scholarly works are published.

In my view, publishers add significant value to scholarly journals and other works. Some of these value-added functions are currently difficult to replicate; however, given technological advances in open-source digital publishing software, the number of these functions has been dwindling. A key question is: How long will it be before the most difficult production-oriented functions can be easily replicated, leaving non-technical functions, such as branding and prestige, to be dealt with? Another is: How long will it be before viable new modes of scholarly publishing, supported by open source software, are developed that compete with existing publishing models?

The clock is ticking. The more intransigent publishers are, the stronger the incentive for those who want change to improve open source publishing tools, to fund low-cost or open access publishing alternatives, to seek remedies from governments and other organizations that fund research, and to develop new modes of scholarly publishing.

Dialog, openness to new funding strategies and publishing practices, compromise, and imagination may serve publishers better in the long run than denial, rigidity, and attack. A more flexible outlook may reveal opportunities, not just dangers, in a scholarly publishing system in flux.

Open Access Petition Given to European Commission

A petition that asks the European Commission to endorse the recommendations of the 2006 Study on the Economic and Technical Evolution of the Scientific Publication Markets of Europe was delivered to Janez Potocnik, EU Commissioner for Science and Research.

The petition has over 20,000 signatures, including those of senior officials from 750 education, research, and cultural organizations and several Nobel laureates.

See the JISC press release for further details.

Economists’ Self-Archiving Behavior

Ted C. Bergstrom and Rosemarie Lavaty have deposited an eprint in eScholarship that studies the self-archiving behavior of economists ("How Often Do Economists Self-Archive?").

They summarize their findings in the paper’s abstract:

To answer the question of the paper’s title, we looked at the tables of contents from two recent issues of 33 economics journals and attempted to find a freely available online version of each article. We found that about 90 percent of articles in the most-cited economics journals and about 50 percent of articles in less-cited journals are available. We conduct a similar exercise for political science and find that only about 30 percent of the articles are freely available. The paper reports a regression analysis of the effects of author and article characteristics on likelihood of posing and it discusses the implications of self-archiving for the pricing of subscription-based academic journals.

Their conclusion suggests that significant changes in journal pricing could result from self-archiving:

As more content becomes available in open access archives, publishers are faced with greater availability of close substitutes for their products and library demand for journals is likely to become more price-elastic. The increased price-responsiveness means that profit-maximizing prices will fall. As a result, it can be hoped that commercial publishers will no longer be able to charge subscription prices greatly in excess of average cost. Thus the benefits of self-archiving to the academic community are twofold. There is the direct effect of making a greater portion of the body of research available to scholars everywhere and the secondary effect of reducing the prices charged by publishers who exploit their monopoly power.

American Society for Cell Biology Issues Open Access Position Paper

The American Society for Cell Biology (ASCB) has issued an open access position paper ("ASCB Position on Public Access to Scientific Literature").

Here is an excerpt:

The ASCB believes strongly that barriers to scientific communication slow scientific progress. The more widely scientific results are disseminated, the more readily they can be understood, applied, and built upon. The sooner findings are shared, the faster they will lead to new scientific insights and breakthroughs. This conviction has motivated the ASCB to provide free access to all of the research articles in Molecular Biology of the Cell two months after publication, which it has done since 2001. . . .

Some publishers argue that providing free access to their journal’s content will catastrophically erode their revenue base. The experience of many successful research journals demonstrates otherwise; these journals make their online content freely available after a short embargo period that protects subscription revenue. For example, as noted above, the content of Molecular Biology of the Cell is free to all after only two months, yet the journal remains not only financially sound, but profitable. The data clearly show that free access and profitability are not mutually exclusive.

Our goal should be to make research articles freely available as soon as feasible so that science and the public benefit from their expanded use and application. At the same time, it is important that nonprofit societies and other publishers generate sufficient revenues to sustain the costs of reviewing and publishing articles. We believe that a six-month embargo period represents a reasonable compromise between the financial requirements of supporting a journal and the need for access to current research.

Recent Object Reuse and Exchange (ORE) Documents

In a previous posting, I discussed the Open Archives Initiative’s Object Reuse and Exchange (ORE) project. ORE is worth watching closely.

Two new documents were released this January:

  • "Report of the January 2007 ORE-TC Meeting," which is: "A detailed report of the results of the meeting of OAI-ORE Technical Committee describing features and requirements of the ORE model and its context in the Web Architecture."
  • "Open Repositories 2007," which is: "A presentation describing OAI-ORE and progress based on the January 2007 ORE Technical Committee Meeting."

Petition to European Commission to Support Open Access Tops 10,000 Signatures

A petition to the European Commission asking it to support the European Union’s "Study on the Economic and Technical Evolution of the Scientific Publication Markets of Europe" has been signed by more than 10,000 people.

From the press release:

Nobel laureates Harold Varmus and Rich Roberts are among the more than ten thousand concerned researchers, senior academics, lecturers, librarians, and citizens from across Europe and around the world who are signing an internet petition calling on the European Commission to adopt polices to guarantee free public access to research results and maximise the worldwide visibility of European research.

Organisations too are lending their support, with the most senior representatives from over 500 education, research and cultural organisations in the world adding their weight to the petition, including CERN, the UK’s Medical Research Council, the Wellcome Trust, the Italian Rector’s Conference, the Royal Netherlands Academy for Arts & Sciences (KNAW) and the Swiss Academy for the Humanities and Social Sciences (SAGW), alongside the petition’s sponsors, SPARC Europe, JISC, the SURF Foundation, the German Research Foundation (DFG) and the Danish Electronic Research Library (DEFF).

From the petition

Research funding agencies have a central role in determining researchers’ publishing practices. Following the lead of the NIH and other institutions, they should promote and support the archiving of publications in open repositories, after a (possibly domain-specific) time period to be discussed with publishers. This archiving could become a condition for funding.

The following actions could be taken at the European level: (i) Establish a European policy mandating published articles arising from EC-funded research to be available after a given time period in open access archives, and (ii) Explore with Member States and with European research and academic associations whether and how such policies and open repositories could be implemented.

More signatures are needed, especially from EU organizations and individuals.

DOE and British Library to Develop Science.world Portal

The U.S. Department of Energy (DOE) and the British Library have signed an agreement to develop an portal to international science resources called Science.world.

From the press release:

Called ‘Science.world,’ the planned resource would be available for use by scientists in all nations and by anyone interested in science. The approach will capitalise on existing technology to search vast collections of science information distributed across the globe, enabling much-needed access to smaller, less well-known sources of highly valuable science. Following the model of Science.gov, the U.S. interagency science portal that relies on content published by each participating agency, ‘Science.world’ will rely on scientific resources published by each participating nation. Other countries have been invited to participate in this international effort. . . .

Objectives of the ‘Science.world’ initiative are to:

  • Search dispersed, electronic collections in various science disciplines;
  • Provide direct, seamless and free searching of open-source collections and portals;
  • Build upon existing and already successful national models for searching;
  • Complement existing information collections and systems; and
  • Raise the visibility and usage of individual sources of quality science information.

OAIster Hits 10,000,000 Records

Excerpt from the press release:

We live in an information-driven world—one in which access to good information defines success. OAIster’s growth to 10 million records takes us one step closer to that goal.

Developed at the University of Michigan’s Library, OAIster is a collection of digital scholarly resources. OAIster is also a service that continually gathers these digital resources to remain complete and fresh. As global digital repositories grow, so do OAIster’s holdings.

Popular search engines don’t have the holdings OAIster does. They crawl web pages and index the words on those pages. It’s an outstanding technique for fast, broad information from public websites. But scholarly information, the kind researchers use to enrich their work, is generally hidden from these search engines.

OAIster retrieves these otherwise elusive resources by tapping directly into the collections of a variety of institutions using harvesting technology based on the Open Archives Initiative (OAI) Protocol for Metadata Harvesting. These can be images, academic papers, movies and audio files, technical reports, books, as well as preprints (unpublished works that have not yet been peer reviewed). By aggregating these resources, OAIster makes it possible to search across all of them and return the results of a thorough investigation of complete, up-to-date resources. . . .

OAIster is good news for the digital archives that contribute material to open-access repositories. "[OAIster has demonstrated that]. . . OAI interoperability can scale. This is good news for the technology, since the proliferation is bound to continue and even accelerate," says Peter Suber, author of the SPARC Open Access Newsletter. As open-access repositories proliferate, they will be supported by a single, well-managed, comprehensive, and useful tool.

Scholars will find that searching in OAIster can provide better results than searching in web search engines. Roy Tennant, User Services Architect at the California Digital Library, offers an example: "In OAIster I searched ‘roma’ and ‘world war,’ then sorted by weighted relevance. The first hit nailed my topic—the persecution of the Roma in World War II. Trying ‘roma world war’ in Google fails miserably because Google apparently searches ‘Rome’ as well as ‘Roma.’ The ranking then makes anything about the Roma people drop significantly, and there is nothing in the first few screens of results that includes the word in the title, unlike the OAIster hit."

OAIster currently harvests 730 repositories from 49 countries on 6 continents. In three years, it has more than quadrupled in size and increased from 6.2 million to 10 million in the past year. OAIster is a project of the University of Michigan Digital Library Production Service.

2006 PACS Review Use Statistics

The Public-Access Computer Systems Review (PACS Review) was a freely available e-journal, which I founded in 1989. It allowed authors to retain their copyrights, and it had a liberal copyright policy for noncommercial use. It’s last issue was published in 1998.

In 2006, there were 763,228 successful requests for PACS Review files, 2,091 average successful requests per day, 751,264 successful requests for pages, and 2,058 average successful requests for pages per day. (A request is for any type of file; a page request is for a content file, such as an HTML, PDF, or Word file). These requests came from 41,865 distinct host computers.

The requests came from 134 Internet domains. Leaving aside requests from unresolved numerical addresses, the top 15 domains were: .com (Commercial), .net (Networks), .edu (USA Higher Education), .cz (Czech Republic), .jp (Japan), .ca (Canada), .uk (United Kingdom), .au (Australia), .de (Germany), .nl (Netherlands), .org (Non Profit Making Organizations), .in (India), .my (Malaysia), .it (Italy), and .mx (Mexico). At the bottom were domains such as .ms (Montserrat), .fm (Micronesia), .nu (Niue), .ad (Andorra), and .az (Azerbaijan).

Rounded to the nearest thousand, there had previously been 3.5 million successful requests for PACS Review files.

This is the last time that use statistics will be reported for the PACS Review.

Fedora 2.2 Released

The Fedora Project has released version 2.2 of Fedora.

From the announcement:

This is a significant release of Fedora that includes a complete repackaging of the Fedora source and binary distribution so that Fedora can now be installed as a standalone web application (.war) in any web container. This is a first step in positioning Fedora to fit within a standard "enterprise system" environment. A new installer application makes it easy to setup and run Fedora. Fedora now uses Servlet Filters for authentication. To support digital object integrity, the Fedora repository can now be configured to calculate and store checksums for datastream content. This can be done globally, or on selected datastreams. The Fedora API also provides the ability to check content integrity based on checksums. The RDF-based Resource Index has been tuned for better performance. Also, a new high-performing triplestore, backed by Postgres, has been developed that can be plugged into the Resource Index. Fedora contains many other enhancements and bug fixes.

ScientificCommons.org: Access to Over 13 Million Digital Documents

ScientificCommons.org is an initiative of the Institute for Media and Communications Management at the University of St. Gallen. It indexes both metadata and full-text from global digital repositories. It uses OAI-PMH to identify relevant documents. The full-text documents are in PDF, PowerPoint, RTF, Microsoft Word, and Postscript formats. After being retrieved from their original repository, the documents are cached locally at ScientificCommons.org. It has indexed about 13 million documents from over 800 repositories.

Here are some additional features from the About ScientificCommons.org page:

Identification of authors across institutions and archives: ScientificCommons.org identifies authors and assigns them their scientific publications across various archives. Additionally the social relations between the authors will be extracted and displayed. . . .

Semantic combination of scientific information: ScientificCommons.org structures and combines the scientific data to knowledge areas with Ontology’s. Lexical and statistical methods are used to identify, extract and analyze keywords. Based on this processes ScientificCommons.org classifies the scientific data and uses it e.g. for navigational and weighting purposes.

Personalization services: ScientificCommons.org offers the researchers the possibilities to inform themselves about new publications via our RSS Feed service. They can customize the RSS Feed to a special discipline or even to personalized list of keywords. Furthermore ScientificCommons.org will provide an upload service. Every researcher can upload his publication directly to ScientificCommons.org and assign already existing publications at ScientificCommons.org to his own researcher profile.

Notre Dame Institutional Digital Repository Phase I Final Report

The University of Notre Dame Libraries have issued a report about their year-long institutional repository pilot project. There is an abbreviated HTML version and a complete PDF version.

From the Executive Summary:

Here is the briefest of summaries regarding what we did, what we learned, and where we think future directions should go:

  1. What we did—In a nutshell we established relationships with a number of content groups across campus: the Kellogg Institute, the Institute for Latino Studies, Art History, Electrical Engineering, Computer Science, Life Science, the Nanovic Institute, the Kaneb Center, the School of Architecture, FTT (Film, Television, and Theater), the Gigot Center for Entrepreneurial Studies, the Institute for Scholarship in the Liberal Arts, the Graduate School, the University Intellectual Property Committee, the Provost’s Office, and General Counsel. Next, we collected content from many of these groups, "cataloged" it, and saved it into three different computer systems: DigiTool, ETD-db, and DSpace. Finally, we aggregated this content into a centralized cache to provide enhanced browsing, searching, and syndication services against the content.
  2. What we learned—We essentially learned four things: 1) metadata matters, 2) preservation now, not later, 3) the IDR requires dedicated people with specific skills, 4) copyright raises the largest number of questions regarding the fulfillment of the goals of the IDR.
  3. Where we are leaning in regards to recommendations—The recommendations take the form of a "Chinese menu" of options, and the options are be grouped into "meals." We recommend the IDR continue and include: 1) continuing to do the Electronic Theses & Dissertations, 2) writing and implementing metadata and preservation policies and procedures, 3) taking the Excellent Undergraduate Research to the next level, and 4) continuing to implement DigiTool. There are quite a number of other options, but they may be deemed too expensive to implement.

digitalculturebooks

The University of Michigan Press and the Scholarly Publishing Office of the University of Michigan Library, working together as the Michigan Digital Publishing Initiative, have established digitalculturebooks, which offers free access to digital versions of its published works (print works are fee-based). The imprint focuses on "the social, cultural, and political impact of new media."

The objectives of the imprint are to:

  • develop an open and participatory publishing model that adheres to the highest scholarly standards of review and documentation;
  • study the economics of Open Access publishing;
  • collect data about how reading habits and preferences vary across communities and genres;
  • build community around our content by fostering new modes of collaboration in which the traditional relationship between reader and writer breaks down in creative and productive ways.

Library Journal Academic Newswire notes in its article about digitalculturebooks:

While press officials use the term "open access," the venture is actually more "free access" than open at this stage. Open access typically does not require permission for reuse, only a proper attribution. UM director Phil Pochoda told the LJ Academic Newswire that, while no final decision has been made, the press’s "inclination is to ask authors to request the most restrictive Creative Commons license" for their projects. That license, he noted, requires attribution and would not permit commercial use, such as using it in a subsequent for-sale product, without permission. The Digital Culture Books web site currently reads that "permission must be received for any subsequent distribution."

The imprint’s first publication is The Best of Technology Writing 2006.

(Prior postings about digital presses.)

Has Authorama.com "Set Free" 100 Public Domain Books from Google Book Search?

In a posting on Google Blogoscoped, Philipp Lenssen has announced that he has put up 100 public domain books from Google Book Search on Authorama.

Regarding his action, Lenssen says:

In other words, Google imposes restrictions on these books which the public domain does not impose*. I’m no lawyer, and maybe Google can print whatever guidelines they want onto those books. . . and being no lawyer, most people won’t know if the guidelines are a polite request, or legally enforceable terms**. But as a proof of concept—the concept of the public domain—I’ve now ‘set free’ 100 books I downloaded from Google Book Search by republishing them on my public domain books site, Authorama. I’m not doing this out of disrespect for the Google Books program (which I think is cool, and I’ll credit Google on Authorama) but out of respect for the public domain (which I think is even cooler).

Since Lenssen has retained Google’s usage guidelines in the e-books, it’s unclear how they have been "set free," in spite of the following statement on Authorama’s Books from Google Book Search page:

The following books were downloaded from Google Book Search and are made available here as public domain. You can download, republish, mix and mash these books, for private or public, commercial or non-commercial use.

Leaving aside the above statement, Lenssen’s action appears to violate the following Google usage guideline, where Google asks that users:

Make non-commercial use of the files We designed Google Book Search for use by individuals, and we request that you use these files for personal, non-commercial purposes.

However, in the above guideline, Google uses the word "request," which suggests voluntary, rather than mandatory, compliance. Google also requests attribution and watermark retention.

Maintain attribution The Google ‘watermark’ you see on each file is essential for informing people about this project and helping them find additional materials through Google Book Search. Please do not remove it.

Note the use of the word "please."

It’s not clear how to determine if Google’s watermark remains in the Authorama files, but, given the retention of the usage guidelines, it likely does.

So, do Google’s public domain books really need to be "set free"? In its usage guidelines, Google appears to make compliance requests, not compliance requirements. Are such requests binding or not? If so, the language could be clearer. For example, here’s a possible rewording:

Make non-commercial use of the files Google Book Search is for individual use only, and its files can only be used for personal, non-commercial purposes. All other use is prohibited.

Will Self-Archiving Cause Libraries to Cancel Journal Subscriptions?

There has been a great deal of discussion of late about the impact of self-archiving on library journal subscriptions. Obviously, this is of great interest to journal publishers who do not want to wake up one morning, rub the sleep from their eyes, and find out over their first cup of coffee at work that libraries have en masse canceled subscriptions because a "tipping point" has been reached. Likewise, open access advocates do not want journal publishers to panic at the prospect of cancellations and try to turn back the clock on liberal self-archiving policies. So, this is not a scenario that any one wants, except those who would like to simply scrap the existing journal publishing system and start over with a digital tabula rosa.

So, deep breath: Is the end near?

This question hinges on another: Will libraries accept any substitute for a journal that does not provide access to the full, edited, and peer-reviewed contents of that journal?

If the answer is "yes," publishers better get out their survival kits and hunker down for the digital nuclear winter or else change business practices to embrace the new reality. Attempts to fight back by rolling back the clock may just make the situation worse: the genie is out of the bottle.

If the answer is "no," preprints pose no threat, but postprints may under some difficult to attain circumstances.

It is unlikely that a critical mass of author created postprints (i.e., author makes the preprint look like the postprint) will ever emerge. Authors would have to be extremely motivated to have this occur. If you don’t believe me, take a Word file that you submitted to a publisher and make it look exactly like the published article (don’t forget the pagination because that might be a sticking point for libraries). That leaves publisher postprints (generally PDF files).

For the worst to happen, every author of every paper published in a journal would have to self-archive the final publisher PDF file (or the publishers themselves would have to do it for the authors under mandates).

But would that be enough? Wouldn’t the permanence and stability of the digital repositories housing these postprints be of significant concern to libraries? If such repositories could not be trusted, then libraries would have to attempt to archive the postprints in question themselves; however, since postprints are not by default under copyright terms that would allow this to happen (e.g., they are not under Creative Commons Licenses), libraries may be barred from doing so. There are other issues as well: journal and issue browsing capabilities, the value-added services of indexing and abstracting services, and so on. For now, let’s wave our hands briskly and say that these are all tractable issues.

If the above problems were overcome, a significant one remains: publishers add value in many ways to scholarly articles. Would libraries let the existing system of journal publishing collapse because of self-archiving without a viable substitute for these value-added functions being in place?

There have been proposals for and experiments with overlay journals for some time, as well other ideas for new quality control strategies, but, to date, none have caught fire. Old-fashioned peer review, copy editing and fact checking, and publisher-based journal design and production still reign, even among the vast majority of e-journals that are not published by conventional publishers. In the Internet age, nothing technological stops tens of thousands of new e-journals using open source journal management software from blooming, but they haven’t so far, have they? Rather, if you use a liberal definition of open access, there are about 2,500 OA journals—a significant achievement; however, there are questions about the longevity of such journals if they are published by small non-conventional publishers such as groups of scholars (e.g., see "Free Electronic Refereed Journals: Getting Past the Arc of Enthusiasm"). Let’s face it—producing a journal is a lot of work, even a small journal that only publishes less than a hundred papers a year.

Bottom line: a perfect storm is not impossible, but it is unlikely.

Journal 2.0: PLoS ONE Beta Goes Live

The Public Library of Science has released a beta version of its innovative PLoS ONE journal.

Why innovative? First, it’s a multidisciplinary scientific journal, with published articles covering subjects that range from Biochemistry to Virology. Second, it’s a participative journal that allows registered users to annotate and initiate discussions about articles. Open commentary and peer-review have been previously implemented in some e-journals (e.g, see JIME: An Interactive Journal for Interactive Media), but PLoS ONE is the most visible of these efforts and, given PLoS’s reputation for excellence, it lends credibility to a concept that has yet to catch fire in the journal publishing world. A nice feature is the “Most Annotated” tab on the home page that highlights articles that have garnered reader commentary. Third, it’s an open access journal in the full sense of the term, with all articles under the least restrictive Creative Commons license, the Creative Commons Attribution License.

The beta site is a bit slow, probably due to significant interest, so expect some initial browsing delays.

Congratulations to PLoS on PLoS ONE. It’s journal worth keeping an eye on.

Certifying Digital Repositories: DINI Draft

The Electronic Publishing Working Group of the Deutsche Initiative für Netzwerkinformation (DINI) has released an English draft of its DINI-Certificate Document and Publication Services 2007.

It outlines criteria for repository author support; indexing; legal aspects; long-term availability; logs and statistics; policies; security, authenticity and data integrity; and service visibility. It also provides examples.

STARGATE Final Report and Tools

The STARGATE project has issued its final report. Here’s a brief summary of the project from the Executive Summary:

STARGATE (Static Repository Gateway and Toolkit) was funded by the Joint Information Systems Committee (JISC) and is intended to demonstrate the ease of use of the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) Static Repository technology, and the potential benefits offered to publishers in making their metadata available in this way This technology offers a simpler method of participating in many information discovery services than creating fully-fledged OAI-compliant repositories. It does this by allowing the infrastructure and technical support required to participate in OAI-based services to be shifted from the data provider (the journal) to a third party and allows a single third party gateway provider to provide intermediation for many data providers (journals).

To support the its work, the project developed tools and supporting documentation, which can be found below:

Details on Open Repositories 2007 Talks

Details about the Open Repositories 2007 conference sessions are now available, including keynotes, poster sessions, presentations, and user groups. For DSpace, EPrints, and Fedora techies, the user group sessions look like a don’t miss with talks by luminaries such as John Ockerbloom and MacKenzie Smith. The presentations sessions include talks by Andrew Treloar, Carl Lagoze and Herbert Van de Sompel, Leslie Johnston, Simeon Warner among other notables. Open Repositories 2007 will be held in San Antonio, January 23-26.

Hopefully, the conference organizers plan to make streaming audio and/or video files available post-conference, but PowerPoints, as was the case for Open Repositories 2006, would also be useful.

Under the Hood of PLoS ONE: The Open Source TOPAZ E-Publishing System

PLoS is building its innovative PLoS ONE e-journal, which will incorporate both traditional and open peer review, using the open source TOPAZ software. (For a detailed description of the PLoS ONE peer review process, check out "ONE for All: The Next Step for PLoS.")

What is TOPAZ? It’s Web site doesn’t provide specifics, but "PLoS ONE—Technical Background" by Richard Cave does:

The core of TOPAZ is a digital information repository called Fedora (Flexible Extensible Digital Object Repository Architecture). Fedora is an Open Source content management application that supports the creation and management of digital objects. The digital objects contain metadata to express internal and external relationships in the repository, like articles in a journal or the text, images and video of an article. This relationship metadata can also be search using a semantic web query languages. Fedora is jointly developed by Cornell University’s computer science department and the University of Virginia Libraries.

The metastore Kowari will be used with Fedora to support Resource Description Framework (RDF) http://en.wikipedia.org/wiki/Resource_Description_Framework metadata within the repository.

The PLoS ONE web interface will be built with AJAX. Client-side APIs will create the community features (e.g. annotations, discussion threads, ratings, etc.) for the website. As more new features are available on the TOPAZ architecture, we will launch them on PLoS ONE.

There was a TOPAZ Wiki at PLoS. It’s gone, but it’s pages are still cached by Google. The Wiki suggests that TOPAZ is likely to support Atom/RSS feeds, full-text search, and OAI-PMH among other possible features.

For information about other open source e-journal publishing systems, see "Open Source Software for Publishing E-Journals."