American’ Views on Open Government Data

The Pew Research Center has released American' Views on Open Government Data.

Here's an excerpt:

Few Americans think governments are very effective in sharing data they collect with the public:

  • Just 5% say the federal government does this very effectively, with another 39% saying the federal government does this somewhat effectively.
  • 5% say state governments share data very effectively, with another 44% saying somewhat effectively.
  • 7% say local governments share data very effectively, with another 45% responding somewhat effectively.

Digital Scholarship | Digital Scholarship Sitemap

Preparing the Workforce for Digital Curation

The National Academies Press has released Preparing the Workforce for Digital Curation .

Here's an excerpt from the announcement:

The massive increase in digital information in the last decade has created new requirements arising from a deficit in the institutional and technological structures and the human capital necessary to utilize and sustain the abundance of new digital information. This National Research Council consensus study report focuses on the need for education and training in digital curation to meet the societal demands for access to and meaningful use of digital information, now and in the future. For the purposes of this study, digital curation is defined as: "The active management and enhancement of digital information assets for current and future use." This definition provided the committee with a shared understanding of the scope of digital curation. As discussed below, digital curation entails more than secure storage and preservation of digital information because curation may add value to digital information and increase its utility.

Digital Scholarship | Digital Scholarship Sitemap

"Next Up for Agency Public Access Plans: NOAA"

SPARC has released "Next Up for Agency Public Access Plans: NOAA" by Heather Joseph.

Here's an excerpt:

The National Oceanic and Atmospheric Administration (NOAA) has released its plan to create policies ensuring public access to articles and data resulting from its funded research, as required by the February 2013 White House directive. . . .

The NOAA plan calls for all agency-funded intramural and extramural researchers to deposit final, accepted manuscripts into the agency's repository upon acceptance in a peer-reviewed journal. Unlike many of the other agencies that have released plans to date, NOAA will also require its investigators to submit technical reports, data reports, and technical memoranda into the repository as well—significantly increasing the scope of the materials covered by the agency's policy.

NOAA will use the OSTP-suggested 12-month embargo period as its baseline. Like other agencies, it will provide stakeholders with a mechanism for petitioning the agency to change the embargo period. The plan indicates that requests must include evidence that outweighs the public benefit of having the embargo remain at one year. . . .

Currently, funded researchers are required to make data "visible and accessible" within two years. The new plan calls for this time frame to be shortened to just one year. It also indicates that data underlying the conclusions of peer-reviewed articles will most likely be required to be made available at the time of the article's publication, in appropriate repositories (presumably to be designated by NOAA).

Digital Scholarship | Digital Scholarship Sitemap

"Digital Curation Education and Training: From Digitization to Graduate Curricula to MOOCs"

Helen R. Tibbo has published "Digital Curation Education and Training: From Digitization to Graduate Curricula to MOOCs" in the International Journal of Digital Curation.

Here's an excerpt:

This paper traces the development of digital and data curation curricula. Due to the brief length of this paper, the focus is on North American initiatives and primarily on continuing education programs. It explores the strengths and weaknesses of professional workshops and the creation of graduate-level courses, certificates, degrees and MOOCs, as well as the role of funding agencies in this process. It concludes with an analysis of what is missing and what is needed to create the workforce required to steward digital assets in the foreseeable future

Digital Scholarship | Digital Scholarship Sitemap

"Geographic Information and Technologies in Academic Research Libraries: An ARL Survey of Services and Support"

Ann L Holstein has published "Geographic Information and Technologies in Academic Research Libraries: An ARL Survey of Services and Support" in Information Technology and Libraries.

Here's an excerpt:

One hundred fifteen academic libraries, all current members of the Association of Research Libraries (ARL), were selected to participate in an online survey in an effort to better understand campus use of geographic data and geospatial technologies, and how libraries support these uses. The survey was used to capture information regarding geographic needs of their respective campuses, the array of services they offer, and the education and training of geographic information services department staff members. The survey results, along with review of recent literature, were used to identify changes in geographic information services and support since 1997, when a similar survey was conducted by ARL. This new study has enabled recommendations to be made for building a successful geographic information service center within the campus library that offers a robust and comprehensive service and support model for all geographic information usage on campus.

Digital Scholarship | Digital Scholarship Sitemap

"When Data Sharing Gets Close to 100%: What Human Paleogenetics Can Teach the Open Science Movement"

Paolo Anagnostou et al. have published "When Data Sharing Gets Close to 100%: What Human Paleogenetics Can Teach the Open Science Movement" in .

Here's an excerpt:

This study analyzes data sharing regarding mitochondrial, Y chromosomal and autosomal polymorphisms in a total of 162 papers on ancient human DNA published between 1988 and 2013. The estimated sharing rate was not far from totality (97.6% ± 2.1%) and substantially higher than observed in other fields of genetic research (evolutionary, medical and forensic genetics). Both a questionnaire-based survey and the examination of Journals' editorial policies suggest that this high sharing rate cannot be simply explained by the need to comply with stakeholders requests. Most data were made available through body text, but the use of primary databases increased in coincidence with the introduction of complete mitochondrial and next-generation sequencing methods. Our study highlights three important aspects. First, our results imply that researchers' awareness of the importance of openness and transparency for scientific progress may complement stakeholders' policies in achieving very high sharing rates. Second, widespread data sharing does not necessarily coincide with a prevalent use of practices which maximize data findability, accessibility, useability and preservation. A detailed look at the different ways in which data are released can be very useful to detect failures to adopt the best sharing modalities and understand how to correct them. Third and finally, the case of human paleogenetics tells us that a widespread awareness of the importance of Open Science may be important to build reliable scientific practices even in the presence of complex experimental challenges.

Digital Scholarship | Digital Scholarship Sitemap

Directions for Research Data Management in UK Universities

JISC has released Directions for Research Data Management in UK Universities.

Here's an excerpt:

This report addresses five key topics:

  • Policy development and implementation
  • Skills and capability
  • Infrastructure and interoperability
  • Incentives for researchers and support stakeholders
  • Business case and sustainability

For each topic we have included a summary of the main current issues, alongside a vision of where the sector should aim to be in five years' time. We then suggest actions for each topic, divided into 'first steps' and then longer term, more complex priorities. Readers should note that each of the five topics do raise interrelated actions, for example, a usage statistics service is flagged as a potential infrastructure solution and this issue arises again as an action area that can help to incentivise research data management and sharing.

Digital Scholarship | Digital Scholarship Sitemap

"Storage is a Strategic Issue: Digital Preservation in the Cloud"

Gillian Oliver and Steve Knight have published "Storage is a Strategic Issue: Digital Preservation in the Cloud" in .

Here's an excerpt:

Worldwide, many governments are mandating a 'cloud first' policy for information technology infrastructures. In 2013, the National Library of New Zealand's National Digital Heritage Archive (NDHA) outsourced storage of its digital collections. A case study of the decision to outsource and its consequences was conducted, involving interviews of the representatives of three key stakeholders: IT, the NDHA, and the vendor. Clear benefits were identified by interviewees, together with two main challenges. The challenges related to occupational culture tensions, and a shift in funding models. Interviewees also considered whether the cultural heritage sector had any unique requirements. A key learning was that information managers were at risk of being excluded from the detail of outsourcing, and so needed to be prepared to assert their need to know based on their stewardship mandate.

Digital Scholarship | Digital Scholarship Sitemap

"Research Data Explored II: the Anatomy and Reception of figshare"

Peter Kraker et al. have self-archived "Research Data Explored II: the Anatomy and Reception of figshare."

Here's an excerpt:

In this paper, we present an analysis of figshare, one of the largest multidisciplinary repositories for research materials to date. We analysed the structure of items archived in figshare, their usage, and their reception in two altmetrics sources (PlumX and ImpactStory). We found that figshare acts as a platform for newly published research materials, and as an archive for PLOS. Depending on the function, we found different bibliometric characteristics. Items archived from PLOS tend to be coming from the natural sciences and are often unviewed and non-downloaded. Self-archived items, however, come from a variety of disciplines and exhibit some patterns of higher usage. In the altmetrics analysis, we found that Twitter was the social media service where research data gained most attention; generally, research data published in 2014 were most popular across social media services. PlumX detects considerably more items in social media and also finds higher altmetric scores than ImpactStory.

Digital Scholarship | Digital Scholarship Sitemap

"Joint Declaration of Data Citation Principles"

The Data Citation Synthesis Working Group has released the "Joint Declaration of Data Citation Principles."

Here's an excerpt:

The Data Citation Principles cover purpose, function and attributes of citations. These principles recognize the dual necessity of creating citation practices that are both human understandable and machine-actionable.

See also "An Introduction to the Joint Principles for Data Citation."

Digital Scholarship | Digital Scholarship Sitemap

"A Reputation Economy: Results from an Empirical Survey on Academic Data Sharing"

Benedikt Fecher et al. have self-archived "A Reputation Economy: Results from an Empirical Survey on Academic Data Sharing."

Here's an excerpt:

We surveyed 2661 individual academic researchers across all disciplines on their dealings with data, their publication practices, and motives for sharing or withholding research data. The results for 1564 valid responses show that researchers across disciplines recognise the benefit of secondary research data for their own work and for scientific progress as a whole-still they only practice it in moderation. An explanation for this evidence could be an academic system that is not driven by monetary incentives, nor the desire for scientific progress, but by individual reputation-expressed in (high ranked journal) publications. We label this system a Reputation Economy. This special economy explains our findings that show that researchers have a nuanced idea how to provide adequate formal recognition for making data available to others-namely data citations. We conclude that data sharing will only be widely adopted among research professionals if sharing pays in form of reputation. Thus, policy measures that intend to foster research collaboration need to understand academia as a reputation economy. Successful measures must value intermediate products, such as research data, more highly than it is the case now.

Digital Scholarship | Digital Scholarship Sitemap

"The Educational Value of Truly Interactive Science Publishing"

Michael J. Ackerman has published "The Educational Value of Truly Interactive Science Publishing" in The Journal of Electronic Publishing.

Here's an excerpt:

Interactive Scientific Publishing (ISP) has been developed by the Optical Society of America with support from the National Library of Medicine at NIH. It allows authors to electronically publish papers which are linked to the referenced 2D and 3D original image datasets. These image datasets can then be viewed and analyzed interactively by the reader. ISP provides the software for authors to assemble and link their source data to their publication. But more important is that it provides readers with image viewing and analysis tools. The goal of ISP is to improve learning and understanding of the presented information. This paper describes ISP and its effect on learning and understanding. ISP was shown to have enough educational value that readers were willing to invest in the required set-up and learning phases. The social aspects of data sharing and the enlarged review process may be the hardest obstacles to overcome.

Digital Scholarship | Digital Scholarship Sitemap

Data on the Web Best Practices

W3C has released a draft of Data on the Web Best Practices.

Here's an excerpt:

This document provides best practices related to the publication and usage of data on the Web designed to help support a self-sustaining ecosystem. Data should be discoverable and understandable by humans and machines. Where data is used in some way, whether by the originator of the data or by an external party, such usage should also be discoverable and the efforts of the data publisher recognized. In short, following these best practices will facilitate interaction between publishers and consumers.

Digital Scholarship | Digital Scholarship Sitemap

"Researcher Perspectives on Publication and Peer Review of Data"

John Ernest Kratz and Carly Strasser have published "Researcher Perspectives on Publication and Peer Review of Data" in PLOS ONE.

Here's an excerpt:

Data "publication" seeks to appropriate the prestige of authorship in the peer-reviewed literature to reward researchers who create useful and well-documented datasets. The scholarly communication community has embraced data publication as an incentive to document and share data. But, numerous new and ongoing experiments in implementation have not yet resolved what a data publication should be, when data should be peer-reviewed, or how data peer review should work. While researchers have been surveyed extensively regarding data management and sharing, their perceptions and expectations of data publication are largely unknown. To bring this important yet neglected perspective into the conversation, we surveyed ~ 250 researchers across the sciences and social sciences—asking what expectations "data publication" raises and what features would be useful to evaluate the trustworthiness, evaluate the impact, and enhance the prestige of a data publication. We found that researcher expectations of data publication center on availability, generally through an open database or repository. Few respondents expected published data to be peer-reviewed, but peer-reviewed data enjoyed much greater trust and prestige. The importance of adequate metadata was acknowledged, in that almost all respondents expected data peer review to include evaluation of the data's documentation. Formal citation in the reference list was affirmed by most respondents as the proper way to credit dataset creators. Citation count was viewed as the most useful measure of impact, but download count was seen as nearly as valuable. These results offer practical guidance for data publishers seeking to meet researcher expectations and enhance the value of published data.

Digital Scholarship | Digital Scholarship Sitemap

"Digital Curation and Doctoral Research"

Daisy Abbott has published "Digital Curation and Doctoral Research" in the International Journal of Digital Curation.

Here's an excerpt:

This article considers digital curation in doctoral study and the role of the doctoral supervisor and institution in facilitating students' acquisition of digital curation skills, including some of the potentially problematic expectations of the supervisory relationship with regards to digital curation. Research took the form of an analysis of the current digital curation training landscape, focusing on doctoral study and supervision. This was followed by a survey (n=116) investigating attitudes towards importance, expertise, and responsibilities regarding digital curation. This research confirms that digital curation is considered to be very important within doctoral study but that doctoral supervisors and particularly students consider themselves to be largely unskilled at curation tasks. It provides a detailed picture of curation activity within doctoral study and identifies the areas of most concern. A detailed analysis demonstrates that most of the responsibility for curation is thought to lie with students and that institutions are perceived to have very low responsibility and that individuals tend to over-assign responsibility to themselves. Finally, the research identifies which types of support system for curation are most used and makes suggestions for ways in which students, supervisors, institutions, and others can effectively and efficiently address problematic areas and improve digital curation within doctoral study.

Digital Scholarship | Digital Scholarship Sitemap

"What Factors Influence Where Researchers Deposit their Data? A Survey of Researchers Submitting to Data Repositories"

Shea Swauger and Todd J. Vision have published "What Factors Influence Where Researchers Deposit their Data? A Survey of Researchers Submitting to Data Repositories" in the International Journal of Digital Curation.

Here's an excerpt:

In order to better understand the factors that most influence where researchers deposit their data when they have a choice, we collected survey data from researchers who deposited phylogenetic data in either the TreeBASE or Dryad data repositories. Respondents were asked to rank the relative importance of eight possible factors. We found that factors differed in importance for both TreeBASE and Dryad, and that the rankings differed subtly but significantly between TreeBASE and Dryad users. On average, TreeBASE users ranked the domain specialization of the repository highest, while Dryad users ranked as equal highest their trust in the persistence of the repository and the ease of its data submission process. Interestingly, respondents (particularly Dryad users) were strongly divided as to whether being directed to choose a particular repository by a journal policy or funding agency was among the most or least important factors. Some users reported depositing their data in multiple repositories and archiving their data voluntarily.

Digital Scholarship | Digital Scholarship Sitemap

ERCIM News Special Issue on Scientific Data Sharing and Re-use

ERCIM has released a special issue of ERCIM News on scientific data sharing and re-use.

Here's an excerpt from "Introduction to the Special Theme Scientific Data Sharing and Re-use":

This special issue features a keynote paper from an EU funding organization, an invited paper from a global organization that aims to accelerate and facilitate research data sharing and exchange, an invited paper from a prominent US scientist and an invited paper from a large Australian data organization. The core part of this issue presents several contributions of European researchers that address the different aspects of the data sharing and (re)use problem.

Digital Scholarship | Digital Scholarship Sitemap

"Starting a Research Data Management Program Based in a University Library"

Margaret Henderson and Teresa L. Knott have self-archived "Starting a Research Data Management Program Based in a University Library."

Here's an excerpt:

As the need for research data management grows, many libraries are considering adding data services to help with the research mission of their institution. The Virginia Commonwealth University (VCU) Libraries created a position and hired a director of research data management in September 2013. The position was new to the libraries and the university. With the backing of the library administration, a plan for building relationships with VCU faculty, researchers, students, service and resource providers, including grant administrators, was developed to educate and engage the community in data management plan writing and research data management training.

Digital Scholarship | Digital Scholarship Sitemap

"Analyzing Data Citation Practices According to the Data Citation Index"

Nicolas Robinson-Garcia et al. have self-archived "Analyzing Data Citation Practices According to the Data Citation Index."

Here's an excerpt:

The findings of this study show that data citation practices are far from common in most research fields. Some differences have been reported on the way researchers cite data: while in the areas of Science and Engineering and Technology data sets were the most cited, in Social Sciences and Arts and Humanities data studies play a greater role. 88.1 percent of the records have received no citations, but some repositories show very low uncitedness rates. While data citation practices are rare in most fields, they have expanded in disciplines such as Crystallography or Genomics. We conclude by emphasizing the role that the DCI could play in encouraging the consistent, standardized citation of research data—a role that would enhance its value as a means of following the research process from data collection to publication.

Digital Scholarship | Digital Scholarship Sitemap

"Digital Forensics on A Shoestring: A Case Study from the University of Victoria"

John Durno and Jerry Trofimchuk have published "Digital Forensics on A Shoestring: A Case Study from the University of Victoria" in Code4Lib Journal.

Here's an excerpt:

While much has been written on the increasing importance of digital forensics in archival workflows, most of the literature focuses on theoretical issues or establishing best practices in the abstract. Where case studies exist, most have been written from the perspective of larger organizations with well-resourced digital forensics facilities. However organizations of any size are increasingly likely to receive donations of born-digital material on outdated media, and a need exists for more modest solutions to the problem of acquiring and preserving their contents. This case study outlines the development of a small-scale digital forensics program at the University of Victoria using inexpensive components and open source software, funded by a $2000 research grant from the Canadian Association of Research Libraries (CARL).

Digital Scholarship | Digital Scholarship Sitemap

Policy Recommendations for Open Access to Research Data

The RECODE project has released Policy Recommendations for Open Access to Research Data.

Here's an excerpt:

These policy recommendations are targeted at key stakeholders in the scholarly communication ecosystem, namely research funders, research institutions, data managers, and publishers. They will assist each of the stakeholders in furthering the goals of open access to research data by providing both over-arching and stakeholder-specific recommendations. These function, as suggestions to address and attend to central issues that RECODE identified through the research work.

The current report thus comprises:

  • summary of project findings
  • overarching recommendations
  • targeted policy recommendations for funders, research institutions, data managers, and publishers
  • practical guides for developing policies for funders, research institutions, data managers, and publishers
  • resources to expedite the process of policy development and implementation among stakeholders

Digital Scholarship | Digital Scholarship Sitemap

"Building Data Services from the Ground Up: Strategies and Resources"

Heather L. Coates has published "Building Data Services from the Ground Up: Strategies and Resources" in the Journal of eScience Librarianship.

Here's an excerpt:

There is a scarcity of practical guidance for developing data services in an academic library. Data services, like many areas of research, require the expertise and resources of teams spanning many disciplines. While library professionals are embedded into the teaching activities of our institutions, fewer of us are embedded in research activities occurring across the full life cycle. The significant challenges of managing, preserving, and sharing data for reuse demand that we take a more active role. Providing support for funder data management plans is just one option in the data services landscape. Awareness of the institutional and library culture in which we operate places an emphasis on the importance of relationships. Understanding the various cultures in which our researchers operate is crucial for delivering data services that are relevant and utilized. The goal of this article is to guide data specialists through this landscape by providing key resources and strategies for developing locally relevant services and by pointing to active communities of librarians and researchers tackling the challenges associated with digital research data.

Digital Scholarship | Digital Scholarship Sitemap

"Research Data Management and Libraries: Relationships, Activities, Drivers and Influences"

Stephen Pinfield, Andrew M. Cox, and Jen Smith have published "Research Data Management and Libraries: Relationships, Activities, Drivers and Influences " in PLOS ONE.

Here's an excerpt:

This paper analyses the contribution of academic libraries to research data management (RDM) in the wider institutional context. In particular it: examines the roles and relationships involved in RDM, identifies the main components of an RDM programme, evaluates the major drivers for RDM activities, and analyses the key factors influencing the shape of RDM developments. The study is written from the perspective of library professionals, analysing data from 26 semi-structured interviews of library staff from different UK institutions. This is an early qualitative contribution to the topic complementing existing quantitative and case study approaches. Results show that although libraries are playing a significant role in RDM, there is uncertainty and variation in the relationship with other stakeholders such as IT services and research support offices. Current emphases in RDM programmes are on developments of policies and guidelines, with some early work on technology infrastructures and support services. Drivers for developments include storage, security, quality, compliance, preservation, and sharing with libraries associated most closely with the last three. The paper also highlights a 'jurisdictional' driver in which libraries are claiming a role in this space. A wide range of factors, including governance, resourcing and skills, are identified as influencing ongoing developments. From the analysis, a model is constructed designed to capture the main aspects of an institutional RDM programme. This model helps to clarify the different issues involved in RDM, identifying layers of activity, multiple stakeholders and drivers, and a large number of factors influencing the implementation of any initiative. Institutions may usefully benchmark their activities against the data and model in order to inform ongoing RDM activity.

Digital Scholarship | "A Quarter-Century as an Open Access Publisher"

iPres 2014: Proceedings of the 11th International Conference on Digital Preservation

The International Conference on Digital Preservation has released iPres 2014: Proceedings of the 11th International Conference on Digital Preservation.

Here's an excerpt:

Papers covered a wide array of preservation topics including migration and emulation, file format management, registries and linked data, funding models, education and training, personal archiving and software-based art, web archiving, metadata and persistent identifiers.

Digital Scholarship | "A Quarter-Century as an Open Access Publisher"

2014 Open Data Index

Open Knowledge has published the 2014 Open Data Index.

Here's an excerpt from the announcement:

The Index ranks countries based on the availability and accessibility of information in ten key areas, including government spending, election results, transport timetables, and pollution levels.

The UK topped the 2014 Index retaining its pole position with an overall score of 96%, closely followed by Denmark and then France at number 3 up from 12th last year. Finland comes in 4th while Australia and New Zealand share the 5th place. Impressive results were seen from India at #10 (up from #27) and Latin American countries like Colombia and Uruguay who came in joint 12th.

Digital Scholarship | "A Quarter-Century as an Open Access Publisher"