"10 Simple Rules for the Care and Feeding of Scientific Data"

Alyssa Goodman et al. have self-archived "10 Simple Rules for the Care and Feeding of Scientific Data" in arXiv.org.

Here's an excerpt:

This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more, but our goal here is not to review that literature. Instead, we present a short guide intended for researchers who want to know why it is important to "care for and feed" data, with some practical advice on how to do that.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

From Bitstreams to Heritage: Putting Digital Forensics into Practice in Collecting Institutions

The BitCurator Project has released From Bitstreams to Heritage: Putting Digital Forensics into Practice in Collecting Institutions.

Here's an excerpt:

The application of forensics tools and methods to the curation of born-digital collections in LAMs has advanced significantly over the past several years. It would have been quite surprising, for example, to hear an archivist talking about write blockers or disk images ten years ago, but such terms are now used frequently at archival conferences and increasingly in the professional literature. The BitCurator project is actively working to construct both the tools and the necessary documentation to help LAMs integrate digital forensics into their workflows. The BitCurator software environment is freely available for download and installation, and we continue to add associated documentation. The next phase of BitCurator will focus on continued software development, professional engagement activities, and further uptake of the software as we work to resolve real-world challenges facing LAMs.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"A Risk Analysis of File Formats for Preservation Planning"

SCAPE has released "A Risk Analysis of File Formats for Preservation Planning."

Here's an excerpt:

This paper presents an approach for automatic estimation of preservation risk for file formats. The main contribution of this work is a definition of the risk factors with associated severity level and its automatic computation. Our goal is to apply a solid knowledge base automatically extracted from linked open data repositories as the basis of the risk analysis system for digital preservation. This method is meant to facilitate decision making with regard to preservation of digital content in libraries and archives.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Digitizing Video for Long-Term Preservation: An RFP Guide and Template

The PrestoCentre Foundation has released Digitizing Video for Long-Term Preservation: An RFP Guide and Template.

Here's an excerpt:

Digitizing Video for Long-Term Preservation: An RFP Guide and Template is intended to take an institution step-by-step through the process of drafting a Request for Proposals (RFP) for the transfer of analog video formats to digital carriers for preservation. This template can be used by libraries, archives, and other cultural heritage institutions and submitted to qualified transfer vendors.

Funded by The Andrew W. Mellon Foundation, the Digitizing Video for Long-Term Preservation publication is part of the Video at Risk project undertaken by New York University and two partner institutions, Loyola University New Orleans and the University of California, Berkeley. The authors of this publication set out to create a template that would identify the key elements integral to the transfer of the video and audio signal from Standard Definition VHS to a preservation-quality digital file.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Digital Curation: D3.1—Evaluation of Cost Models and Needs & Gaps Analysis (MS12 Draft)

The 4C Project has released D3.1—Evaluation of Cost Models and Needs & Gaps Analysis (MS12 Draft).

Here's an excerpt from the announcement:

This draft report. . . provides an analysis of existing research related to the economics of digital curation and reports upon the investigation of how well current cost and benefit models meet stakeholders' needs for calculating and comparing financial information. It aims to point out gaps that need to be bridged between the capabilities of currently available models and tools, and stakeholders' needs for financial information.

To facilitate and clarify the model evaluation the report first outlines a basic terminology and a general description of the components of cost and benefit models.

The report then describes how the ten current and emerging cost models and benefit models included in the evaluation were identified and provides a summary of each of the models. This is followed by an in depth analysis of stakeholders' needs for financial information derived from the 4C project stakeholder consultation.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

A Survey of Preservation Activities in Cultural Heritage Institutions: FY2012

The Association for Library Collections and Technical Services has released A Survey of Preservation Activities in Cultural Heritage Institutions: FY2012.

Here's an excerpt from the announcement:

This survey is based on the Preservation Statistics survey program coordinated by the Association of Research Libraries (ARL) from 1984 through 2008. Following the collection of the 2007-2008 Preservation Statistics data, ARL discontinued surveying its members about their preservation activities; this decision left the preservation community without a way to document, assess, and analyze its collective current practices, staff and budget resources and strategic direction.

The report examines how respondents are organizing and administering traditional and digital preservation programs and chronicles their preventive preservation (disaster planning, environmental monitoring, outreach and education), conservation and reformatting/digitization activities. The report assesses trends in the preservation programs of academic and research libraries in the five-year span between the last ARL Preservation Statistics Survey in 2007 and this 2012 survey based on the responses provided by the 34 ARL members who participated in this pilot survey.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Speaking the Same Language: Building a Data Governance Program for Institutional Impact"

Michael J. Chapple has published "Speaking the Same Language: Building a Data Governance Program for Institutional Impact" in EDUCAUSE Review.

Here's an excerpt:

The need for consistent, reliable data across business and academic units is creating an unprecedented push toward strong data governance practices on college and university campuses. Working together, leaders from the central IT organization, the institutional research division, central administrative offices, and the academy can build a valuable platform to support data-driven decision-making across the institution. The tools used to create this platform will vary from institution to institution, but all should build toward the common goals of creating a data environment that embraces the five pillars of Quality & Consistency, Policies & Standards, Security & Privacy, Compliance, and Retention & Archiving.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Starting the Conversation: University-wide Research Data Management Policy

OCLC Research has released Starting the Conversation: University-wide Research Data Management Policy.

Here's an excerpt:

Starting the Conversation: University-wide Research Data Management Policy is a call for action that summarizes the benefits of systemic data management planning and identifies the stakeholders and their concerns. It also suggests that the library director proactively initiate a conversation among these stakeholders to get buy-in for a high-level, responsible data planning and management policy that is proactive, rather than reactive.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"The Political Economy of Federally Sponsored Data"

Bart Ragon has published "The Political Economy of Federally Sponsored Data" in the latest issue of the Journal of eScience Librarianship.

Here's an excerpt:

Librarian involvement in the Open Access (OA) movement has traditionally focused on access to scholarly publications. Recent actions by the White House have focused attention on access on the data produced from federally sponsored research. Questions have emerged concerning access to the output of federally sponsored research and whether it is a public or private good. Understanding the political battle over access to federally funded research is closely tied to the ownership of the peer review process in higher education and associated revenue streams, and as a result, interest groups seeking to influence government regulation have politicized the issues. As a major funder of research in higher education, policies from the federal government are likely to drive change in research practices at higher education institutions and impact library services. The political economy of federally sponsored research data will shape research enterprises in higher education and inspire a number of new services distributed throughout the research life cycle.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Checking In With Google Books, HathiTrust, and the DPLA"

Naomi Eichenlaub has published "Checking In With Google Books, HathiTrust, and the DPLA" in Computers in Libraries.

Here's an excerpt:

Google Books and HathiTrust have been making headlines in the library world and beyond for years now, while a new player, the Digital Public Library of America (DPLA), has only recently entered the scene. This article will provide a "state of the environment" update for these digital library projects including project history and background. It will also examine some challenges common to all three projects including copyright, orphan works, metadata, and quality issues.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Where Have All the Scientific Data Gone? LIS Perspective on the Data-At-Risk Predicament"

College & Research Libraries has released an e-print of "Where Have All the Scientific Data Gone? LIS Perspective on the Data-At-Risk Predicament."

Here's an excerpt:

Scientists produce vast amounts of data that often are not preserved properly or do not have inventories, placing them at risk. As part of an effort to more fully understand the data-at-risk predicament, researchers engaged in the DARI project at UNC's Metadata Research Center surveyed information custodians working in a range of settings. The survey collected information on the data characteristics and preservation plans. Forty-three information custodians completed the survey. The results indicate that at-risk data include a variety of formats, subject areas, and ownership status, as well as compliance with a variety of standards. Although a majority of respondents agree data preservation is important, they caution that time is the greatest barrier to sharing these data. The study has implications for data rescue and for training information custodians.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Report on Data Management and Data-Management Plans for the History of Science Society Committee on Research and the Profession (September 6, 2013)"

Daniel Goldstein et al. have published "Report on Data Management and Data-Management Plans for the History of Science Society Committee on Research and the Profession (September 6, 2013)" in the History of Science Society Newsletter.

Here's an excerpt:

The growing attention paid to data management by funding agencies reflects two additional trends. First is the idea that agencies that fund research ought to have some say in how that research is disseminated. Second is a growing trend in academia toward the increased sharing and broader dissemination of research data. In addition to these contexts, our report is also informed by the recognition that some historians of science are employing computational methods in their research for which the accumulation of and shared access to data is increasingly central. This report therefore considers what data mean in the context of history of science, when they should and should not be shared, and what mechanisms exist or could be developed for their access and preservation. In the process, the report raises pertinent questions that should be considered and, perhaps, addressed by applicants for NSF grants, but it does not provide a template for grant application plans. This report is intended to serve as a starting point for discussion, to identify issues and suggest possibilities.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Born Digital: Guidance for Donors, Dealers, and Archival Repositories

The Council on Library and Information Resources has released Born Digital: Guidance for Donors, Dealers, and Archival Repositories.

Here's an excerpt from the announcement:

This report offers recommendations to help ensure the physical and intellectual well-being of born-digital materials transferred from donors to archival repositories. The report surveys the primary issues and concerns related to born-digital acquisitions and is intended for a broad audience with varying levels of interest and expertise, including donors, dealers, and repository staff.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Preservation, Trust and Continuing Access for e-Journals

The Digital Preservation Coalition has released Preservation, Trust and Continuing Access for e-Journals.

Here's an excerpt:

This report discusses current developments and issues which libraries, publishers, intermediaries and service providers are facing in the area of digital preservation, trust and continuing access for e-journals. It also includes generic lessons and recommendations on outsourcing and trust learnt in this field of interest to the wider digital preservation community. It is not solely focused on technology, and covers relevant legal, economic and service issues.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"The Librarian as Research Informationist: A Case Study"

Lisa Federer has published "The Librarian as Research Informationist: A Case Study" in the Journal of the Medical Library Association.

Here's an excerpt:

Whereas traditional library services have generally focused on the "last mile" or finished product of the research process—the peer-reviewed literature—librarians have expertise that can help researchers create better research output in the form of more useful data. In the last several years, new policies from major funding bodies (such as the National Science Foundation's data management plan requirement and the NIH's public access policy) indicate that funders expect researchers to demonstrate the highest possible return on investment for their grant dollars. The need for better research data management has given rise to a new role for librarians: the "research informationist." Research informationists work with research teams at each step of the research process, from project inception and grant seeking to final publication, providing expert guidance on data management and preservation, bibliometric analysis, expert searching, compliance with grant funder policies regarding data management and open access, and other information-related areas.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Issues in the Appraisal and Selection of Geospatial Data

The National Digital Stewardship Alliance has released Issues in the Appraisal and Selection of Geospatial Data.

Here's an excerpt from the announcement:

The report provides an illuminating background on the problem area, then suggests ways to establish criteria for appraisal and selection decisions for geospatial data. It then proposes some models and processes for appraisal and selection, including tools for the identification and evaluation of data resources and triggers for appraisal and selection, and finishes with further questions for the community to explore.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Report of the European Commission Public Consultation on Open Research Data

The European Commission has released the Report of the European Commission Public Consultation on Open Research Data.

Here's an excerpt:

The European Commission held a public consultation on open research data on 2 July 2013 in Brussels, which was attended by a variety of stakeholders from the research community, industry, funders, libraries, publishers, infrastructure developers and others. The debate focused on five questions posed by the Commission to structure the debate and can be summarized as follows. Information on the consultation, including the agenda, the list of participants, the list of contributions and the final report are available here: http://ec.europa.eu/digital-agenda/node/67533.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

PRESERVING.EXE: Toward a National Strategy for Software Preservation

The National Digital Information Infrastructure and Preservation Program has released PRESERVING.EXE: Toward a National Strategy for Software Preservation.

Here's an excerpt:

A report from the National Digital Information Infrastructure and Preservation Program of the Library of Congress, focused on identifying valuable and at-risk software. Topics covered include executable software preservation, game preservation, electronic literature and ideas for approaches to ensure long-term access.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Data Curation Issues in the Chemical Sciences"

Colin L. Bird, Cerys Willoughby, Simon J. Coles, and Jeremy G. Frey have published "Data Curation Issues in the Chemical Sciences" in a special issue of Information Standards Quarterly on data curation.

Here's an excerpt:

In this article, the authors investigate the extent to which chemists respect the importance of curation in their day-to-day activities in the laboratory or, nowadays, frequently at the computer. They emphasize that an essential ingredient in the curation process is metadata, especially for capturing context, which critical for reproducibility. They also consider the potential roles for librarians and information specialists in assisting with scientific curation, both directly and in training scientists. They assert that curation practice will increasingly emphasize the capture of metadata at the time data and information are created, described as curation at source. Innovative and automated methods for curation will be needed to overcome the perceived burden of curation.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Research Data Management at the University of Warwick: Recent Steps towards a Joined-up Approach at a UK University"

Jenny Delasalle has published "Research Data Management at the University of Warwick: Recent Steps towards a Joined-up Approach at a UK University" in LIBREAS. Library Ideas.

Here's an excerpt:

This paper charts the steps taken and possible ways forward for the University of Warwick in its approach to research data management, providing a typical example of a UK research university's approach in two strands: requirements and support. The UK government approach and funding landscape in relation to research data management provided drivers for the University of Warwick to set requirements and provide support, and examples of good practice at other institutions, support from a central national body (the UK Digital Curation Centre) and learning from other universities' experiences all proved valuable to the University of Warwick. Through interviews with researchers at Warwick, various issues and challenges are revealed: perhaps the biggest immediate challenges for Warwick going forward are overcoming scepticism amongst researchers, overcoming costs, and understanding the implications of involving third party companies in research data management. Building technical infrastructure could sit alongside and beyond those immediate steps and beyond the challenges that face one University are those that affect academia as a whole.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"A Cross Disciplinary Study of Link Decay and the Effectiveness of Mitigation Techniques"

Jason Hennessey and Steven Xijin Ge have published "A Cross Disciplinary Study of Link Decay and the Effectiveness of Mitigation Techniques" in BMC Bioinformatics.

Here's an excerp:

We accessed 14,489 unique web pages found in the abstracts within Thomson Reuters' Web of Science citation index that were published between 1996 and 2010 and found that the median lifespan of these web pages was 9.3 years with 62% of them being archived. Survival analysis and logistic regression were used to find significant predictors of URL lifespan. The availability of a web page is most dependent on the time it is published and the top-level domain names. Similar statistical analysis revealed biases in current solutions: the Internet Archive favors web pages with fewer layers in the Universal Resource Locator (URL) while WebCite is significantly influenced by the source of publication. We also created a prototype for a process to submit web pages to the archives and increased coverage of our list of scientific webpages in the Internet Archive and WebCite by 22% and 255%, respectively.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Metadata is a Love Note to the Future—UK Higher Education Research Data Management (RDM) Survey"

Martin Hamilton has released "Metadata is a Love Note to the Future—UK Higher Education Research Data Management (RDM) Survey."

Here's an excerpt:

I'm delighted to be able to present here the results of our recent survey of the UK Higher Education community's plans for Research Data Management, along with a little initial analysis and an executive summary. To stay true to the spirit of openness, we have made a redacted version of the raw data available, along with our analysis, using the figshare cloud RDM service.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

European Landscape Study of Research Data Management

SURF has released the European Landscape Study of Research Data Management.

Here's an excerpt:

This report presents the results of an online survey to establish which interventions are already being used by funding agencies, research institutions, national bodies and publishers across the European Union member states and a number of countries outside Europe in order to improve the capacity and skills of researchers in making effective use of research data infrastructures. It also makes recommendations that organisations can adopt to help their researchers. . . .

Interviews with researchers indicate that the main drivers for writing a data management plan are requirements by the funder or the publisher. Nearly half of the research funders who took part in the survey have a policy covering research data management, whilst a quarter of the funders require data management plans as part of the grant application. Data management plans should address data acquisition, use, re-use, storage and protection and the rights of ownership. Just over one third of the responding funding organisations designate a specific organisation for preservation, although no term has been identified.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations"

Jonathan Zittrain, Kendra Albert, and Lawrence Lessig have self-archived "Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations" in SSRN.

Here's an excerpt:

We document a serious problem of reference rot: more than 70% of the URLs within the Harvard Law Review and other journals, and 50% of the URLs found within U.S. Supreme Court opinions do not link to the originally cited information.

Given that, we propose a solution for authors and editors of new scholarship that involves libraries undertaking the distributed, long-term preservation of link contents.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Presentations from Research Data Management Forum 10

Presentations from the Research Data Management Forum 10 are now available.

Here are some representative presentations:

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap