"Peer Review of Datasets: When, Why, and How"

Matthew S. Mayernik et al. have published "Peer Review of Datasets: When, Why, and How" in the Bulletin of the American Meteorological Society.

Here's an excerpt:

This paper discusses issues related to data peer review, in particular the peer review processes, needs, and challenges related to the following scenarios: 1) Data analyzed in traditional scientific articles, 2) Data articles published in traditional scientific journals, 3) Data submitted to open access data repositories, and 4) Datasets published via articles in data journals.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

U.S. Open Data Action Plan

The White House has released the U.S. Open Data Action Plan.

Here's an excerpt:

The Smithsonian Cooper-Hewitt National Design Museum Collection plans to make all digitized collections metadata public domain, and digitized collection images without copyright or other restriction publicly available at the highest available resolution for non-commercial, educational use. . . .

The Smithsonian Freer Gallery of Art and Arthur M. Sackler Gallery plans to make all digitized collections metadata public domain, and digitized collection images without copyright or other restriction publicly available at the highest available resolution for non-commercial, educational use. . . .

After a successful limited release of an API of the Smithsonian American Art Museum collection and hackathon that resulted in a number of working prototypes, the Smithsonian American Art Museum is planning a staged release, from open metadata, like artist or medium, to an open API of digitized collections images without copyright or other restriction available for non- commercial, educational use.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Big Data: Seizing Opportunities, Preserving Values

The Executive Office of the President has released Big Data: Seizing Opportunities, Preserving Values.

Here's an excerpt:

On January 17, in a speech at the Justice Department about reforming the United States' signals intelligence practices, President Obama tasked his Counselor John Podesta with leading a comprehensive review of the impact big data technologies are having, and will have, on a range of economic, social, and government activities. Podesta was joined in this effort by Secretary of Commerce Penny Pritzker, Secretary of Energy Ernest Moniz, the President's Science Advisor John Holdren, the President's Economic Advisor Jeffrey Zients, and other senior government officials. The President's Council of Advisors for Science & Technology conducted a parallel report to take measure of the underlying technologies. Their findings underpin many of the technological assertions in this report.

This review was conceived as fundamentally a scoping exercise. Over 90 days, the review group engaged with academic experts, industry representatives, privacy advocates, civil rights groups, law enforcement agents, and other government agencies. The White House Office of Science and Technology Policy jointly organized three university conferences, at the Massachusetts Institute of Technology, New York University, and the University of California, Berkeley. The White House Office of Science & Technology Policy also issued a "Request for Information" seeking public comment on issues of big data and privacy and received more than 70 responses. In addition, the WhiteHouse.gov platform was used to conduct an unscientific survey of public attitudes about different uses of big data and various big data technologies. A list of the working group's activities can be found in the Appendix.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

What Drives Academic Data Sharing?

RatSWD has released What Drives Academic Data Sharing?.

Here's an excerpt:

Based on a systematic review of 98 scholarly papers and an empirical survey among 603 secondary data users, we develop a conceptual framework that explains the process of data sharing from the primary researcher’s point of view. We show that this process can be divided into six descriptive categories: Data donor, research organization, research community, norms, data infrastructure, and data recipients. Drawing from our findings, we discuss theoretical implications regarding knowledge creation and dissemination as well as research policy measures to foster academic collaboration.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Data Publication Consensus and Controversies"

F1000Research has released an eprint of "Data Publication Consensus and Controversies."

Here's an excerpt:

As data publication venues proliferate, significant debate continues over formats, processes, and terminology. Here, we present an overview of data publication initiatives underway and the current conversation, highlighting points of consensus and issues still in contention.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

How to Discover Requirements for Research Data Management Services

The DCC and DataONE have released How to Discover Requirements for Research Data Management Services.

Here's an excerpt:

This guide is meant for people whose role involves developing services or tools to support research data management (RDM) and digital curation, whether in a Higher Education Institution or a project working across institutions. Your RDM development role might be embedded with the research groups concerned, or at a more centralised level, such as a library or computing service. You will need a methodical approach to plan, elicit, analyse, document and prioritise a range of users' requirements.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

The Value and Impact of Data Sharing and Curation: A Synthesis of Three Recent Studies of UK Research Data Centres

JISC has released The Value and Impact of Data Sharing and Curation: A Synthesis of Three Recent Studies of UK Research Data Centres.

Here's an excerpt from the announcement:

The data centre studies combined quantitative and qualitative approaches in order to quantify value in economic terms and present other, non-economic, impacts and benefits. Uniquely, the studies cover both users and depositors of data, and we believe the surveys of depositors undertaken are the first of their kind. All three studies show a similar pattern of findings, with data sharing via the data centres having a large measurable impact on research efficiency and on return on investment in the data and services. These findings are important for funders, both for making the economic case for investment in data curation and sharing and research data infrastructure, and for ensuring the sustainability of such research data centres.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets"

Christopher W. Belter has published "Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets" in PLOS ONE.

Here's an excerpt:

Evaluation of scientific research is becoming increasingly reliant on publication-based bibliometric indicators, which may result in the devaluation of other scientific activities—such as data curation—that do not necessarily result in the production of scientific publications. This issue may undermine the movement to openly share and cite data sets in scientific publications because researchers are unlikely to devote the effort necessary to curate their research data if they are unlikely to receive credit for doing so. This analysis attempts to demonstrate the bibliometric impact of properly curated and openly accessible data sets by attempting to generate citation counts for three data sets archived at the National Oceanographic Data Center. My findings suggest that all three data sets are highly cited, with estimated citation counts in most cases higher than 99% of all the journal articles published in Oceanography during the same years. I also find that methods of citing and referring to these data sets in scientific publications are highly inconsistent, despite the fact that a formal citation format is suggested for each data set. These findings have important implications for developing a data citation format, encouraging researchers to properly curate their research data, and evaluating the bibliometric impact of individuals and institutions.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Response to Elsevier’s Text and Data Mining Policy: A LIBER Discussion Paper"

LIBER has released "Response to Elsevier's Text and Data Mining Policy: A LIBER Discussion Paper."

Here's an excerpt from the announcement:

LIBER believes that the right to read is the right to mine and that licensing will never bridge the gap in the current copyright framework as it is unscalable and resource intensive. Furthermore, as this discussion paper highlights, licensing has the potential to limit the innovative potential of digital research methods by:

  1. restricting the tools that researchers can use
  2. limiting the way in which research results can be made available
  3. impacting on the transparency and reproducibility of research results.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Exemplar Good Governance Structures and Data Policies

APARSEN has released Exemplar Good Governance Structures and Data Policies.

Here's an excerpt:

This report summarises the level of preparedness for interoperable governance and data policies based on both desktop research on selected data policies and online survey conducted during this study. It is important to understand what current data policies address and if they miss out on important topics, such as specific requirements for data preservation. This will give an indication on the possible impact of such data policies on the individual communities and allows recommendations to be drawn up to guide forthcoming policies. This report concludes with selected recommendations that should be taken into account when drawing up data policies concerning digital preservation.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

PLOS Clarifies Open Data Policy

PLOS has clarified its open data policy.

Here's an excerpt:

In the previous post, and also on our site for PLOS ONE Academic Editors, an attempt to simplify our policy did not represent the policy correctly and we sincerely apologize for that and for the confusion it has caused. We are today correcting that post and hoping it provides the clarity many have been seeking. . . .

Two key things to summarize about the policy are:

  1. The policy does not aim to say anything new about what data types, forms and amounts should be shared.
  2. The policy does aim to make transparent where the data can be found, and says that it shouldn't be just on the authors' own hard drive.

Correction

We have struck out the paragraph in the original PLOS ONE blog post headed "What do we mean by data", as we think it led to much of the confusion. Instead we offer this guidance to authors planning to submit to a PLOS journal.

What data do I need to make available?

We ask you to make available the data underlying the findings in the paper, which would be needed by someone wishing to understand, validate or replicate the work. Our policy has not changed in this regard. What has changed is that we now ask you to say where the data can be found.

As the PLOS data policy applies to all fields in which we publish, we recognize that we'll need to work closely with authors in some subject areas to ensure adherence to the new policy. Some fields have very well established standards and practices around data, while others are still evolving, and we would like to work with any field that is developing data standards. We are aiming to ensure transparency about data availability.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Geospatial Data Stewardship: Key Online Resources

The National Digital Stewardship Alliance has released Geospatial Data Stewardship: Key Online Resources.

Here's an excerpt:

This document lists online resources that highlight key concepts and practices supporting the preservation and stewardship of digital geospatial data and information. GIS practitioners take the initial preservation actions in the decisions they make regarding data creation and management. Librarians, archivists and museum professionals are often called on to support access and the long-term historical and temporal analysis of these same materials. The resources below offer a starting point to methods, tools and approaches across the information lifecycle to assist in understanding current best practices in the stewardship of geospatial data.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"An Introduction to the Coverage of the Data Citation Index (Thomson-Reuters): Disciplines, Document Types and Repositories"

Daniel Torres-Salinas, Alberto Martín-Martín, Enrique Fuente-Gutiérrez have self-archived "An Introduction to the Coverage of the Data Citation Index (Thomson-Reuters): Disciplines, Document Types and Repositories" in arXiv.org.

Here's an excerpt:

In the past years, the movement of data sharing has been enjoying great popularity. Within this context, Thomson Reuters launched at the end of 2012 a new product inside the Web of Knowledge family: the Data Citation Index. The aim of this tool is to enable discovery and access, from a single place, to data from a variety of data repositories from different subject areas and from around the world. In this short note we present some preliminary results from the analysis of the Data Citation Index. Specifically, we address the following issues: discipline coverage, data types present in the database, and repositories that were included at the time of the study.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

PLOS Mandates Immediate Open Access to Article-Related Data

PLOS has mandated that author's provide immediate open access to article-related data upon publication.

Here's an excerpt from the announcement:

In an effort to increase access to this data, we are now revising our data-sharing policy for all PLOS journals: authors must make all data publicly available, without restriction, immediately upon publication of the article. Beginning March 3rd, 2014, all authors who submit to a PLOS journal will be asked to provide a Data Availability Statement, describing where and how others can access each dataset that underlies the findings. This Data Availability Statement will be published on the first page of each article.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Feet on the Ground: A Practical Approach to the Cloud—Nine Things to Consider When Assessing Cloud Storage

AudioVisual Preservation Solutions, has released Feet on the Ground: A Practical Approach to the Cloud—Nine Things to Consider When Assessing Cloud Storage.

Here's an excerpt:

There is no all-in-one solution that will fulfill every archives' needs for preservation storage. Often, cloud storage services fulfill a portion of an organization's larger preservation infrastructure, providing secure back up for preservation copies or supporting delivery of access files from low-latency storage. Vetting and selection is therefore the alignment of organizational and collection needs with the offerings and functionality of a service. This means defining your acceptance criteria for optimal functionality and understanding how a service will fit in your preservation environment.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

APA/C-DAC International Conference on Digital Preservation and Development of Trusted Digital Repositories 2014 Proceedings

The APA/C-DAC International Conference on Digital Preservation and Development of Trusted Digital Repositories 2014 proceedings have been released.

Presentations and session videos are also available.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"E-Science as a Catalyst for Transformational Change in University Research Libraries"

Mary E. Piorun has self-archived her dissertaion "E-Science as a Catalyst for Transformational Change in University Research Libraries."

Here's an excerpt:

Changes in how research is conducted, from the growth of e-science to the emergence of big data, have lead to new opportunities for librarians to become involved in the creation and management of research data, at the same time the duties and responsibilities of university libraries continue to evolve. This study examines those roles related to e-science while exploring the concept of transformational change and leadership issues in bringing about such a change. Using the framework established by Levy and Merry for first- and second-order change, four case studies of libraries whose institutions are members in the Association of Research Libraries (ARL) are developed.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Open Science Win: Johnson & Johnson Clinical Trial Data Sharing Agreement

Johnson & Johnson has announced a clinical trial data sharing agreement with the Yale School of Medicine.

Here's an excerpt from the announcement:

Johnson & Johnson today announced that its subsidiary, Janssen Research and Development, LLC, has entered into a novel agreement with Yale School of Medicine's Open Data Access (YODA) Project that will extend its commitment to sharing clinical trials data to enhance public health and advance science and medicine. Under the agreement, YODA will serve as an independent body to review requests from investigators and physicians seeking access to anonymized clinical trials data from Janssen, the pharmaceutical companies of Johnson & Johnson, and make final decisions on data sharing. This is the first time any company has collaborated with a completely independent third party to review and make decisions regarding every request for clinical data.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Troubleshooting Public Data Archiving: Suggestions to Increase Participation"

Dominique G. Roche et al. have published "Troubleshooting Public Data Archiving: Suggestions to Increase Participation" in PLOS Biology.

Here's an excerpt:

An increasing number of publishers and funding agencies require public data archiving (PDA) in open-access databases. PDA has obvious group benefits for the scientific community, but many researchers are reluctant to share their data publicly because of real or perceived individual costs. Improving participation in PDA will require lowering costs and/or increasing benefits for primary data collectors. Small, simple changes can enhance existing measures to ensure that more scientific data are properly archived and made publicly available: (1) facilitate more flexible embargoes on archived data, (2) encourage communication between data generators and re-users, (3) disclose data re-use ethics, and (4) encourage increased recognition of publicly archived data.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

SJSU School of Library and Information Science Offers Digital Curation Post-Master’s Certificate

The San Jose State University School of Library and Information Science now offers a Digital Curation Post-Master's Certificate option.

Here's an excerpt from the announcement:

Students at the School of Library and Information Science at San José State University (SJSU) can now take courses that prepare them for a career in digital curation. The school recently added a new career pathway in digital curation for its Post-Master's Certificate program students. A similar career pathway will be available starting in fall 2014 for students enrolled in the school's Master of Library and Information Science (MLIS) program.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"It’s the Neoliberalism, Stupid: Why Instrumentalist Arguments for Open Access, Open Data, and Open Science Are Not Enough"

The Impact of Social Science has republished Eric Kansa's "It's the Neoliberalism, Stupid: Why Instrumentalist Arguments for Open Access, Open Data, and Open Science Are Not Enough."

Here's an excerpt:

Neoliberal universities primarily serve the needs of commerce. They need to churn out technically skilled human resources (made desperate for any work by high loads of debt) and easily monetized technical advancements. . . .

How can something so wonderful and right as "openness" further promote Neoliberalism? After all, aren't we the rebels blasting at the exhaust vents of Elsevier's Death Star? But in selling openness to the heads of foundations, businesses, governments and universities, we often end up adopting the tropes of Neoliberalism. As a tactic, that's perfectly reasonable. As a long-term strategy, I think it's doomed.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Research Libraries’ New Role in Research Data Management, Current Trends and Visions in Denmark"

The LIBER Quarterly has released a future article: "Research Libraries' New Role in Research Data Management, Current Trends and Visions in Denmark."

Here's an excerpt:

The first part of this paper presents the findings of a research project carried out under the auspices of DEFF. . . .This paper describes the various paths chosen by individual universities and research institutions, and the background for their strategies of research data management. Among the main reasons for the uneven practices are the lack of a national policy in this field, the different scientific traditions and cultures and the differences in the use and organization of IT-services. The second part of this paper presents perspectives of this development that are of particular relevance to research libraries. As they already curate digital collections and are active in establishing web archives,the research libraries become involved in research and dissemination of knowledge in new ways. This paper gives examples of how The State and University Library's services facilitate research data management with special regard to digitization of research objects, storage, preservation and sharing of research data. This paper concludes that the experience and skills of research libraries make the libraries important partners in a research data management infrastructure.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

A Workflow Model for Curating Research Data in the University of Minnesota Libraries: Report from the 2013 Data Curation Pilot

Lisa R. Johnston has self-archived A Workflow Model for Curating Research Data in the University of Minnesota Libraries: Report from the 2013 Data Curation Pilot.

Here's an excerpt:

The 2013 Data Curation Project set out to test and expand the University Libraries' programmatic and technical capacities to support research data management needs on campus by establishing a fixed-term data curation pilot. This pilot utilized our current suite of services and expertise in the University with the objective of developing a model workflow for curating a variety of types of research data in the Libraries. Specifically, in eight months, this project resulted in 1) a data curation workflow utilizing existing university resources; 2) five pilot research datasets that were solicited, selected, and curated for discovery and reuse in the libraries' digital repository, the University Digital Conservancy, at the persistent URL, http://purl.umn.edu/160292; and 3) and a summary report describing the successes and shortcomings of this approach. This report summarizes the steps taken to curate the datasets in the pilot, faculty needs and reactions to the result, and in addition to the specific dataset treatments, an overall data curation workflow is presented that outlines the steps needed for any dataset. A discussion of this process provides some useful lessons learned. As a result of this project, the University Libraries now hold a more realistic sense of the overall capacities and expertise needed to develop a sustainable data curation service model. Additionally, the Libraries are better prepared to fine-tune and implement selected recommendations from previous assessments and committee reports.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

"Unix Commands and Batch Processing for the Reluctant Librarian or Archivist"

Anthony Cocciolo has published "Unix Commands and Batch Processing for the Reluctant Librarian or Archivist" in the Code4Lib Journal.

Here's an excerpt:

The Unix environment offers librarians and archivists high-quality tools for quickly transforming born-digital and digitized assets, such as resizing videos, creating access copies of digitized photos, and making fair-use reproductions of audio recordings. These tools, such as ffmpeg, lame, sox, and ImageMagick, can apply one or more manipulations to digital assets without the need to manually process individual items, which can be error prone, time consuming, and tedious. This article will provide information on getting started in using the Unix environment to take advantage of these tools for batch processing.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap

Safe to Be Open: Study on the Protection of Research Data and Recommendation for Access And Usage

OpenAIRE has released Safe to Be Open: Study on the Protection of Research Data and Recommendation for Access And Usage.

Here's an excerpt from the announcement:

This study addresses the most important legal issues when implementing an open access e-infrastructure for research data. It examines the legal requirements for different kinds of usage of research data in an open access infrastructure, such as OpenAIREplus, which links them to publications. The existing legal framework regarding potentially relevant intellectual property (IP) rights is analysed from the general European perspective as well as from that of selected EU Member States. Various examples and usage scenarios are used to explain the scope of protection of the potentially relevant IP rights. In addition different licence models are analysed in order to identify the licence that is best suited to the aim of open access, especially in the context of the infrastructure of OpenAIREplus. Based on the outcomes of these analyses, some recommendations to the European legislator as well as data- and e-infrastructure providers are given on improving the rights situation in relation to research data.

Digital Scholarship | Digital Scholarship Publications Overview | Sitemap