Research Data Curation Bibliography
Charles W. Bailey, Jr.
Houston: Digital Scholarship
Version 6: 6/6/2016

Introduction

The Research Data Curation Bibliography includes over 560 selected English-language articles, books, and technical reports that are useful in understanding the curation of digital research data in academic and other research institutions.

The "digital curation" concept is still evolving. In "Digital Curation and Trusted Repositories: Steps toward Success," Christopher A. Lee and Helen R. Tibbo define digital curation as follows:

Digital curation involves selection and appraisal by creators and archivists; evolving provision of intellectual access; redundant storage; data transformations; and, for some materials, a commitment to long-term preservation. Digital curation is stewardship that provides for the reproducibility and re-use of authentic digital data and other digital assets. Development of trustworthy and durable digital repositories; principles of sound metadata creation and capture; use of open standards for file formats and data encoding; and the promotion of information management literacy are all essential to the longevity of digital resources and the success of curation efforts.

The Research Data Curation Bibliography covers topics such as research data creation, acquisition, metadata, repositories, provenance, management, policies, support services, funding agency requirements, peer review, publication, citation, sharing, reuse, and preservation.

This bibliography does not cover digital media works (such as MP3 files), editorials, e-mail messages, interviews, letters to the editor, presentation slides or transcripts, unpublished e-prints, or weblog postings. Coverage of conference papers and technical reports is very selective.

Most sources have been published from January 2009 through May 2016; however, a limited number of earlier key sources are also included. The bibliography includes links to freely available versions of included works. If such versions are unavailable, links to the publishers' descriptions are provided.

Such links, even to publisher versions and versions in disciplinary archives and institutional repositories, are subject to change. URLs may alter without warning (or automatic forwarding) or they may disappear altogether. Inclusion of links to works on authors' personal websites is highly selective. Note that e-prints and published articles may not be identical.

Abstracts are included in this bibliography if a work is under a Creative Commons Attribution License (BY and national/international variations), a Creative Commons public domain dedication (CC0), or a Creative Commons Public Domain Mark and this is clearly indicated in the work (see the "Note on the Inclusion of Abstracts" below for more details). In cases where the license has changed since publication, the most current license is described.

An archive of all versions of the bibliography is available.

For broader coverage of the digital curation literature, see the author's Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works,which presents over 650 English-language articles, books, and technical reports, and the Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works, 2012 Supplement, which presents over 130 additional sources.

Dedication

In memory of Paul Evan Peters (1947-1996), founding Executive Director of the Coalition for Networked Information, whose visionary leadership at the dawn of the Internet era fostered the development of scholarly electronic publishing.

Bibliography

Aalbersberg, IJsbrand Jan, Sophia Atzeni, Hylke Koers, Beate Specker, and Elena Zudilova-Seinstra. "Bringing Digital Science Deep Inside the Scientific Article: The Elsevier Article of the Future Project." LIBER Quarterly 23, no. 4 (2014): 275-299. http://liber.library.uu.nl/index.php/lq/article/view/8446

In 2009, Elsevier introduced the "Article of the Future" project to define an optimal way for the dissemination of science in the digital age, and in this paper we discuss three of its key dimensions. First we discuss interlinking scientific articles and research data stored with domain-specific data repositories—such interlinking is essential to interpret both article and data efficiently and correctly. We then present easy-to-use 3D visualization tools embedded in online articles: a key example of how the digital article format adds value to scientific communication and helps readers to better understand research results. The last topic covered in this paper is automatic enrichment of journal articles through text-mining or other methods. Here we share insights from a recent survey on the question: how can we find a balance between creating valuable contextual links, without sacrificing the high-quality, peer-reviewed status of published articles?

This work is licensed under a Creative Commons Attribution 4.0 License.

Aalbersberg, IJsbrand, Judson Dunham, and Hylke Koers. "Connecting Scientific Articles with Research Data: New Directions in Online Scholarly Publishing." Data Science Journal 12 (2013): WDS235-WDS242. http://datascience.codata.org/articles/abstract/10.2481/dsj.WDS-043/

Researchers across disciplines are increasingly utilizing electronic tools to collect, analyze, and organize data. However, when it comes to publishing their work, there are no common, well-established standards on how to make that data available to other researchers. Consequently, data are often not stored in a consistent manner, making it hard or impossible to find data sets associated with an article—even though such data might be essential to reproduce results or to perform further analysis. Data repositories can play an important role in improving this situation, offering increased visibility, domain-specific coordination, and expert knowledge on data management. As a leading STM publisher, Elsevier is actively pursuing opportunities to establish links between the online scholarly article and data repositories. This helps to increase usage and visibility for both articles and data sets and also adds valuable context to the data. These data-linking efforts tie in with other initiatives at Elsevier to enhance the online article in order to connect with current researchers' workflows and to provide an optimal platform for the communication of science in the digital era.

This work is licensed under a Creative Commons Attribution 3.0 License.

Abrams, Stephen, Patricia Cruse, Carly Strasser, Perry Willet, Geoffrey Boushey, Julia Kochi, Megan Laurance, and Angela Rizk-Jackson. "DataShare: Empowering Researcher Data Curation." International Journal of Digital Curation 9, no. 1 (2014): 110-118. http://www.ijdc.net/index.php/ijdc/article/view/9.1.110/345

Researchers are increasingly being asked to ensure that all products of research activity—not just traditional publications—are preserved and made widely available for study and reuse as a precondition for publication or grant funding, or to conform to disciplinary best practices. In order to conform to these requirements, scholars need effective, easy-to-use tools and services for the long-term curation of their research data. The DataShare service, developed at the University of California, is being used by researchers to: (1) prepare for curation by reviewing best practice recommendations for the acquisition or creation of digital research data; (2) select datasets using intuitive file browsing and drag-and-drop interfaces; (3) describe their data for enhanced discoverability in terms of the DataCite metadata schema; (4) preserve their data by uploading to a public access collection in the UC3 Merritt curation repository; (5) cite their data in terms of persistent and globally-resolvable DOI identifiers; (6) expose their data through registration with well-known abstracting and indexing services and major internet search engines; (7) control the dissemination of their data through enforceable data use agreements; and (8) discover and retrieve datasets of interest through a faceted search and browse environment. Since the widespread adoption of effective data management practices is highly dependent on ease of use and integration into existing individual, institutional, and disciplinary workflows, the emphasis throughout the design and implementation of DataShare is to provide the highest level of curation service with the lowest possible technical barriers to entry by individual researchers. By enabling intuitive, self-service access to data curation functions, DataShare helps to contribute to more widespread adoption of good data curation practices that are critical to open scientific inquiry, discourse, and advancement.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Accomazzi, Alberto, Edwin Henneken, Christopher Erdmann, and Arnold Rots."Telescope Bibliographies: An Essential Component of Archival Data Management and Operations." Proceedings of SPIE 8448 (2012): 84480K-1-84480K-10. http://arxiv.org/abs/1206.6352

Adamick, Jessica, Rebecca C. Reznik-Zellen, and Matt Sheridan. "Data Management Training for Graduate Students at a Large Research University." Journal of eScience Librarianship 1, no. 3 (2013): e1022. http://dx.doi.org/10.7191/jeslib.2012.1022

Adams, Sam, and Peter Murray-Rust. "Chempound—A Web 2.0-Inspired Repository for Physical Science Data." Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5873

Addison, Aaron, Jennifer Moore, and Cynthia Hudson-Vitale. "Forging Partnerships: Foundations of Geospatial Data Stewardship." Journal of Map & Geography Libraries 11, no. 3 (2015): 359-375. http://dx.doi.org/10.1080/15420353.2015.1054544

Akers, Katherine G. "Going Beyond Data Management Planning: Comprehensive Research Data Services." College & Research Libraries News 75, no. 8 (2014): 435-436. http://crln.acrl.org/content/75/8/435.full

———. "Looking Out for the Little Guy: Small Data Curation." Bulletin of the American Society for Information Science and Technology 39, no. 3 (2013): 58-59. http://www.asis.org/Bulletin/Feb-13/FebMar13_RDAP_Akers.pdf

Akers, Katherine G., and Jennifer Doty. "Differences among Faculty Ranks in Views on Research Data Management." IASSIST Quarterly 36 (2012): 16-20. http://www.iassistdata.org/iq/differences-among-faculty-ranks-views-research-data-management

———. "Disciplinary Differences in Faculty Research Data Management Practices and Perspectives." International Journal of Digital Curation 8, no. 2 (2013): 5-26. http://www.ijdc.net/index.php/ijdc/article/view/8.2.5/332

Academic librarians are increasingly engaging in data curation by providing infrastructure (e.g., institutional repositories) and offering services (e.g., data management plan consultations) to support the management of research data on their campuses. Efforts to develop these resources may benefit from a greater understanding of disciplinary differences in research data management needs. After conducting a survey of data management practices and perspectives at our research university, we categorized faculty members into four research domains—arts and humanities, social sciences, medical sciences, and basic sciences—and analyzed variations in their patterns of survey responses. We found statistically significant differences among the four research domains for nearly every survey item, revealing important disciplinary distinctions in data management actions, attitudes, and interest in support services. Serious consideration of both the similarities and dissimilarities among disciplines will help guide academic librarians and other data curation professionals in developing a range of data-management services that can be tailored to the unique needs of different scholarly researchers.

This work is licensed under a Creative Commons Attribution License.

Akers, Katherine G., and Jennifer A. Green. "Towards a Symbiotic Relationship between Academic Libraries and Disciplinary Data Repositories: A Dryad and University of Michigan Case Study." International Journal of Digital Curation 9, no. 1 (2014): 119-131. http://www.ijdc.net/index.php/ijdc/article/view/9.1.119/346

In addition to encouraging the deposit of research data into institutional data repositories, academic librarians can further support research data sharing by facilitating the deposit of data into external disciplinary data repositories.

In this paper, we focus on the University of Michigan Library and Dryad, a repository for scientific and medical data, as a case study to explore possible forms of partnership between academic libraries and disciplinary data repositories. We found that although few University of Michigan researchers have submitted data to Dryad, many have recently published articles in Dryad-integrated journals, suggesting significant opportunities for Dryad use on our campus. We suggest that academic libraries could promote the sharing and preservation of science and medical data by becoming Dryad members, purchasing vouchers to cover researchers' data submission costs, and hosting local curators who could directly work with campus researchers to improve the accuracy and completeness of data packages and thereby increase their potential for re-use.

By enabling the use of both institutional and disciplinary data repositories, we argue that academic librarians can achieve greater success in capturing the vast amounts of data that presently fail to depart researchers' hands and making that data visible to relevant communities of interest.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Akers, Katherine G., Fe C. Sferdean, Natsuko H. Nicholls, and Jennifer A. Green. "Building Support for Research Data Management: Biographies of Eight Research Universities." International Journal of Digital Curation 9, no. 2 (2014): 171-191. http://www.ijdc.net/index.php/ijdc/article/view/9.2.171/376

Academic research libraries are quickly developing support for research data management (RDM), including both new services and infrastructure. Here, we tell the stories of how eight different universities have developed programs of RDM support, focusing on the prominent role of the library in educating and assisting researchers with managing their data throughout the research lifecycle. Based on these stories, we construct timelines for each university depicting key steps in building support for RDM, and we discuss similarities and dissimilarities among universities in motivation to provide RDM support, collaborations among campus units, assessment of needs and services, and changes in staffing.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Akmon, Dharma, Ann Zimmerman, Morgan Daniels, and Margaret Hedstrom. "The Application of Archival Concepts to a Data-Intensive Environment: Working with Scientists to Understand Data Management and Preservation Needs." Archival Science 11, no. 3/4 (2011): 329-348. http://link.springer.com/article/10.1007%2Fs10502-011-9151-4

Albani, Sergio, and David Giaretta. "Long-Term Preservation of Earth Observation Data and Knowledge in ESA through CASPAR." International Journal of Digital Curation 4, no. 3 (2009): 4-16. http://www.ijdc.net/index.php/ijdc/article/view/130/162

Aleixandre-Benaven, Rafael, Luz María Moreno-Solano, Antonia Ferrer Sapena, and Enrique Alfonso Sánchez Pérez. "Correlation between Impact Factor and Public Availability of Published Research Data in Information Science and Library Science Journals." Scientometrics 107, no. 1 (2016): 1-13. http://dx.doi.org/10.1007/s11192-016-1868-7

Allard, Suzie. "DataONE: Facilitating eScience through Collaboration." Journal of eScience Librarianship 1, no. 1 (2012): e1004. http://dx.doi.org/10.7191/jeslib.2012.1004

Altman, Micah, Margaret O. Adams, Jonathan Crabtree, Darrell Donakowski, Marc Maynard, Amy Pienta, and Copeland H. Young. "Digital Preservation through Archival Collaboration: The Data Preservation Alliance for the Social Sciences." American Archivist 72, no. 1 (2009): 170-184. http://americanarchivist.org/doi/abs/10.17723/aarc.72.1.eu7252lhnrp7h188

Altman, Micah, Christine Borgman, Mercè Crosas, and Maryann Matone. "An Introduction to the Joint Principles for Data Citation." Bulletin of the Association for Information Science and Technology 41, no. 3 (2015): 43-45. https://www.asis.org/Bulletin/Feb-15/FebMar15_RDAP_Altman_EtAl.html

Altman, Micah, Eleni Castro, Mercè Crosas, Philip Durbin, Alex Garnett, and Jen Whitney. "Open Journal Systems and Dataverse Integration—Helping Journals to Upgrade Data Publication for Reusable Research." Code4Lib Journal, no. 30 (2015). http://journal.code4lib.org/articles/10989

This article describes the novel open source tools for open data publication in open access journal workflows. This comprises a plugin for Open Journal Systems that supports a data submission, citation, review, and publication workflow; and an extension to the Dataverse system that provides a standard deposit API. We describe the function and design of these tools, provide examples of their use, and summarize their initial reception. We conclude by discussing future plans and potential impact.

This work is licensed under a Creative Commons Creative Commons Attribution 3.0 United States License.

Altman, Micah, and Mercè Crosas. "The Evolution of Data Citation: From Principles to Implementation" IASSIST Quarterly 37, no. 1-4 (2013): 62-70. http://www.iassistdata.org/iq/evolution-data-citation-principles-implementation

Altman, Micah, and Gary King. "A Proposed Standard for the Scholarly Citation of Quantitative Data." D-Lib Magazine 13, no. 3/4 (2007). http://www.dlib.org/dlib/march07/altman/03altman.html

Anastasiadis, Stergios V., Syam Gadde, and Jeffrey S. Chase. "Scale and Performance in Semantic Storage Management of Data Grids." International Journal on Digital Libraries 5, no. 2 (2005): 84-98. http://link.springer.com/article/10.1007/s00799-004-0086-8

Anderson, W. L. "Some Challenges and Issues in Managing, and Preserving Access to, Long Lived Collections of Digital Scientific and Technical Data." Data Science Journal 3 (2004): 191-201. http://datascience.codata.org/articles/abstract/10.2481/dsj.3.191/

One goal of the Committee on Data for Science and Technology is to solicit information about, promote discussion of, and support action on the many issues related to scientific and technical data preservation, archiving, and access. This brief paper describes four broad categories of issues that help to organize discussion, learning, and action regarding the work needed to support the long-term preservation of, and access to, scientific and technical data. In each category, some specific issues and areas of concern are described.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Andreoli-Versbach, Patrick, and Frank Mueller-Langer. "Open Access to Data: An Ideal Professed but Not Practised." Research Policy 43, no. 9 (2014): 1621-1633. http://dx.doi.org/10.1016/j.respol.2014.04.008

Androulakis, Steve, Ashley M. Buckle, Ian Atkinson, David Groenewegen, Nick Nicholas, Andrew Treloar, and Anthony Beitz. "ARCHER—e-Research Tools for Research Data Management." International Journal of Digital Curation 4, no. 1 (2009): 22-33. http://www.ijdc.net/index.php/ijdc/article/view/99/74

Angevaare, Inge. "Taking Care of Digital Collections and Data: 'Curation' and Organisational Choices for Research Libraries." LIBER Quarterly: The Journal of European Research Libraries 19, no. 1 (2009): 1-12. http://liber.library.uu.nl/index.php/lq/article/view/7948

This article explores the types of digital information research libraries typically deal with and what factors might influence libraries' decisions to take on the work of data curation themselves, to take on the responsibility for data but market out the actual work, or to leave the responsibility to other organisations. The article introduces the issues dealt with in the LIBER Workshop 'Curating Research' to be held in The Hague on 17 April 2009 (http://www.kb.nl/curatingresearch) and this corresponding issue of LIBER Quarterly.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Arora, Ritu, Maria Esteva, and Jessica Trelogan. "Leveraging High Performance Computing for Managing Large and Evolving Data Collections." International Journal of Digital Curation 9, no. 2 (2014): 17-27. http://www.ijdc.net/index.php/ijdc/article/view/9.2.17/366

The process of developing a digital collection in the context of a research project often involves a pipeline pattern during which data growth, data types, and data authenticity need to be assessed iteratively in relation to the different research steps and in the interest of archiving. Throughout a project's lifecycle curators organize newly generated data while cleaning and integrating legacy data when it exists, and deciding what data will be preserved for the long term. Although these actions should be part of a well-oiled data management workflow, there are practical challenges in doing so if the collection is very large and heterogeneous, or is accessed by several researchers contemporaneously. There is a need for data management solutions that can help curators with efficient and on-demand analyses of their collection so that they remain well-informed about its evolving characteristics. In this paper, we describe our efforts towards developing a workflow to leverage open science High Performance Computing (HPC) resources for routinely and efficiently conducting data management tasks on large collections. We demonstrate that HPC resources and techniques can significantly reduce the time for accomplishing critical data management tasks, and enable a dynamic archiving throughout the research process. We use a large archaeological data collection with a long and complex formation history as our test case. We share our experiences in adopting open science HPC resources for large-scale data management, which entails understanding usage of the open source HPC environment and training users. These experiences can be generalized to meet the needs of other data curators working with large collections.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Aschenbrenner, Andreas, Harry Enke, Thomas Fischer, and Jens Ludwig. "Diversity and Interoperability of Repositories in a Grid Curation Environment." Journal of Digital Information 12, no. 2 (2011). http://journals.tdl.org/jodi/index.php/jodi/article/view/1896

Asher, Andrew, and Lori M. Jahnke. "Curating the Ethnographic Moment." Archive Journal, no. 3 (2013). http://www.archivejournal.net/issue/3/archives-remixed/curating-the-ethnographic-moment/

Ashley, Kevin. "Data Quality and Curation." Data Science Journal 12 (2013): GRDI65-GRDI68. http://datascience.codata.org/articles/abstract/10.2481/dsj.GRDI-011/

Data quality is an issue that touches on every aspect of the research data landscape and is therefore appropriate to examine in the context of planning for future research data infrastructures. As producers, researchers want to believe that they produce high quality data; as consumers, they want to obtain data of the highest quality. Data centres typically have stringent controls to ensure that they only acquire and disseminate data of the highest quality. Data managers will usually say that they improve the quality of the data they are responsible for. Much of the infrastructure that will emit, transform, integrate, visualise, manage, analyse, and disseminate data during its life will have dependencies, explicit or implicit, on the quality of the data it is dealing with.

This work is licensed under a Creative Commons Attribution 4.0 International License.

——— "Research Data And Libraries: Who Does What." Insights: the UKSG Journal 25, no. 2 (2012): 155-157. http://insights.uksg.org/articles/10.1629/2048-7754.25.2.155/

A range of external pressures are causing research data management (RDM) to be an increasing concern at senior level in universities and other research institutions. But as well as external pressures, there are also good reasons for establishing effective research data management services within institutions which can bring benefits to researchers, their institutions and those who publish their research. In this article some of these motivating factors, both positive and negative, are described. Ways in which libraries can play a role—or even lead—in the development of RDM services that work within the institution and as part of a national and international research data infrastructure are also set out.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Assante, Massimiliano, Leonardo Candela, Donatella Castelli, and Alice Tani. "Are Scientific Data Repositories Coping with Research Data Publishing?" Data Science Journal 15, no. 6 (2016): 1-24. http://doi.org/10.5334/dsj-2016-006

Research data publishing is intended as the release of research data to make it possible for practitioners to (re)use them according to "open science" dynamics. There are three main actors called to deal with research data publishing practices: researchers, publishers, and data repositories. This study analyses the solutions offered by generalist scientific data repositories, i.e., repositories supporting the deposition of any type of research data. These repositories cannot make any assumption on the application domain. They are actually called to face with the almost open ended typologies of data used in science. The current practices promoted by such repositories are analysed with respect to eight key aspects of data publishing, i.e., dataset formatting, documentation, licensing, publication costs, validation, availability, discovery and access, and citation. From this analysis it emerges that these repositories implement well consolidated practices and pragmatic solutions for literature repositories. These practices and solutions can not totally meet the needs of management and use of datasets resources, especially in a context where rapid technological changes continuously open new exploitation prospects.

This work is licensed under a Attribution 4.0 International License.

Bache, Richard, Simon Miles, Bolaji Coker, and Adel Taweel. "Informative Provenance for Repurposed Data: A Case Study using Clinical Research Data." International Journal of Digital Curation 8, no. 2 (2013): 27-46. http://www.ijdc.net/index.php/ijdc/article/view/8.2.27/333

The task repurposing of heterogeneous, distributed data for originally unintended research objectives is a non-trivial problem because the mappings required may not be precise. A particular case is clinical data collected for patient care being used for medical research. The fact that research repositories will record data differently means that assumptions must be made as how to transform of this data. Records of provenance that document how this process has taken place will enable users of the data warehouse to utilise the data appropriately and ensure that future data added from another source is transformed using comparable assumptions. For a provenance-based approach to be reusable and supportable with software tools, the provenance records must use a well-defined model of the transformation process. In this paper, we propose such a model, including a classification of the individual 'sub-functions' that make up the overall transformation. This model enables meaningful provenance data to be generated automatically. A case study is used to illustrate this approach and an initial classification of transformations that alter the information is created.

This work is licensed under a Creative Commons Attribution License.

Baker, Karen S., Ruth E. Duerr, and Mark A. Parsons. "Scientific Knowledge Mobilization: Co-evolution of Data Products and Designated Communities." International Journal of Digital Curation 10, no. 2 (2015): 110-135. http://www.ijdc.net/index.php/ijdc/article/view/10.2.110

Digital data are accumulating rapidly, yet issues relating to data production remain unexamined. Data sharing efforts in particular are nascent, disunited and incomplete. We investigate the development of data products tailored for diverse communities with differing knowledge bases. We explore not the technical aspects of how, why, or where data are made available, but rather the socio-scientific aspects influencing what data products are created and made available for use. These products differ from compact data summaries often published in journals. We report on development by a national data center of two data collections describing the changing polar environment. One collection characterizes sea ice products derived from satellite remote sensing data and development unfolds over three decades. The second collection characterizes the Greenland Ice Sheet melt where development of an initial collection of data products over a period of several months was informed by insights gained from earlier experience. In documenting the generation of these two collections, a data product development cycle supported by a data product team is identified as key to mobilizing scientific knowledge. The collections reveal a co-evolution of data products and designated communities where community interest may be triggered by events such as environmental disturbance and new modes of communication. These examples of data product development in practice illustrate knowledge mobilization in the earth sciences; the collections create a bridge between data producers and a growing number of audiences interested in making evidence-based decisions.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Baker, Karen S., and Lynn Yarmey. "Data Stewardship: Environmental Data Curation and a Web-of-Repositories." International Journal of Digital Curation 4, no. 2 (2009): 12-27. http://www.ijdc.net/index.php/ijdc/article/view/115/118

Balkestein, Marjan, and Heiko Tjalsma. "The ADA Approach: Retro-archiving Data in an Academic Environment." Archival Science 7, no. 1 (2007): 89-105. http://www.springerlink.com/content/r781021038425155/

Ball, Alexander, Kevin Ashley, Patrick McCann, Laura Molloy, and Veerle Van den Eynden. "Show Me The Data: The Pilot UK Research Data Registry." International Journal of Digital Curation 9, no. 1 (2014): 132-141. http://www.ijdc.net/index.php/ijdc/article/view/9.1.132/347

The UK Research Data (Metadata) Registry (UKRDR) pilot project is implementing a prototype registry for the UK's research data assets, enabling the holdings of subject-based data centres and institutional data repositories alike to be searched from a single location. The purpose of the prototype is to prove the concept of the registry, and uncover challenges that will need to be addressed if and when the registry is developed into a sustainable service. The prototype is being tested using metadata records harvested from nine UK data centres and the data repositories of nine UK universities.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Ball, Alexander, Sean Chen, Jane Greenberg, Cristina Perez, Keith Jeffery, and Rebecca Koskela. "Building a Disciplinary Metadata Standards Directory." International Journal of Digital Curation 9, no. 1 (2014): 142-151. http://www.ijdc.net/index.php/ijdc/article/view/9.1.142/348

The Research Data Alliance (RDA) Metadata Standards Directory Working Group (MSDWG) is building a directory of descriptive, discipline-specific metadata standards. The purpose of the directory is to promote the discovery, access and use of such standards, thereby improving the state of research data interoperability and reducing duplicative standards development work.

This work builds upon the UK Digital Curation Centre's Disciplinary Metadata Catalogue, a resource created with much the same aim in mind. The first stage of the MSDWG's work was to update and extend the information contained in the catalogue. In the current, second stage, a new platform is being developed in order to extend the functionality of the directory beyond that of the catalogue, and to make it easier to maintain and sustain. Future work will include making the directory more amenable to use by automated tools.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Ball, Alexander, Mansur Darlington, Thomas Howard, Chris McMahon, and Steve Culley. "Visualizing Research Data Records for Their Better Management." Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5917

Ball, Joanna. "Research Data Management for Libraries: Getting Started." Insights: The UKSG journal 26, no. 3 (2013): 256-260. http://insights.uksg.org/articles/10.1629/2048-7754.70/

Many libraries are keen to take on new roles in providing support for effective research data management (RDM), but lack the necessary skills and resources to do so. This article explores the approach used by the University of Sussex to engage with academic departments about their RDM practices and requirements in order to develop relevant library support services. It describes a project undertaken with three Academic Schools to inform a list of recommendations for senior management, to include areas which should be taken forward by the Library, IT and Research Office in order to create a sustainable RDM service. The article is unflinchingly honest in sharing the differing reactions to the project and the lessons learnt along the way.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Barateiro, José, Gonçalo Antunes, Manuel Cabral, José Borbinha, and Rodrigo Rodrigues. "Digital Preservation of Scientific Data." Lecture Notes in Computer Science 5173 (2008): 388-391. http://www.springerlink.com/content/x841w58j7535567x/

———. "Using a Grid for Digital Preservation." Lecture Notes in Computer Science 5362 (2008): 225-235. http://www.springerlink.com/content/k71v8x6081738x18/

Bardi, Alessia, and Paolo Manghi. "Enhanced Publications: Data Models and Information Systems." LIBER Quarterly 23, no. 4 (2014): 240-273. http://liber.library.uu.nl/index.php/lq/article/view/8445

"Enhanced publications" are commonly intended as digital publications that consist of a mandatory narrative part (the description of the research conducted) plus related "parts", such as datasets, other publications, images, tables, workflows, devices. The state-of-the-art on information systems for enhanced publications has today reached the point where some kind of common understanding is required, in order to provide the methodology and language for scientists to compare, analyse, or simply discuss the multitude of solutions in the field. In this paper, we thoroughly examined the literature with a two-fold aim: firstly, introducing the terminology required to describe and compare structural and semantic features of existing enhanced publication data models; secondly, proposing a classification of enhanced publication information systems based on their main functional goals.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Bardyn, Tania P., Taryn Resnick, and Susan K. Camina. "Translational Researchers' Perceptions of Data Management Practices and Data Curation Needs: Findings from a Focus Group in an Academic Health Sciences Library." Journal of Web Librarianship 6, no. 4 (2012): 274-287. http://www.tandfonline.com/doi/full/10.1080/19322909.2012.730375

Baru, Chaitanya. "Sharing and Caring of eScience Data." International Journal on Digital Libraries 7, no. 1/2 (2007): 113-116. http://link.springer.com/article/10.1007/s00799-007-0029-2

Baykoucheva, Svetla. Managing Scientific Information and Research Data. Elsevier: Waltham, MA, 2015. http://store.elsevier.com/Managing-Scientific-Information-and-Research-Data/Svetla-Baykoucheva/isbn-9780081001950/

Beagrie, Neil, Robert Beagrie, and Ian Rowlands. "Research Data Preservation and Access: The Views of Researchers." Ariadne, no. 60 (2009). http://www.ariadne.ac.uk/issue60/beagrie-et-al

Beagrie, Neil, Julia Chruszcz, and Brian Lavoie. Keeping Research Data Safe: A Cost Model and Guidance for UK Universities. London: JISC, 2008. http://www.jisc.ac.uk/media/documents/publications/keepingresearchdatasafe0408.pdf

Beagrie, Neil, and John Houghton. The Value and Impact of Data Sharing and Curation: A Synthesis of Three Recent Studies of UK Research Data Centres. London: JISC, 2014. http://repository.jisc.ac.uk/5568/1/iDF308_-_Digital_Infrastructure_Directions_Report%2C_Jan14_v1-04.pdf

Beale, Gareth, and Hembo Pagi. Datapool Imaging Case Study: Final Report. Southampton: University of Southampton, 2013. http://eprints.soton.ac.uk/id/eprint/350738

Beckett, Mark G., Chris R. Allton, Christine T. H. Davies, Ilan Davis, Jonathan M. Flynn, Eilidh J. Grant, Russell S. Hamilton, Alan C. Irving, R. D. Kenway, Radoslaw H. Ostrowski, James T. Perry, Jason R. Swedlow, and Arthur Trew. "Building a Scientific Data Grid with DiGS." Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 367, no. 1897 (2009): 2471-2481. http://rsta.royalsocietypublishing.org/content/367/1897/2471.abstract

Belter, Christopher W. "Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets." PLoS ONE 9, no. 3 (2014): e92590. http://dx.doi.org/10.1371/journal.pone.0092590

Evaluation of scientific research is becoming increasingly reliant on publication-based bibliometric indicators, which may result in the devaluation of other scientific activities—such as data curation—that do not necessarily result in the production of scientific publications. This issue may undermine the movement to openly share and cite data sets in scientific publications because researchers are unlikely to devote the effort necessary to curate their research data if they are unlikely to receive credit for doing so. This analysis attempts to demonstrate the bibliometric impact of properly curated and openly accessible data sets by attempting to generate citation counts for three data sets archived at the National Oceanographic Data Center. My findings suggest that all three data sets are highly cited, with estimated citation counts in most cases higher than 99% of all the journal articles published in Oceanography during the same years. I also find that methods of citing and referring to these data sets in scientific publications are highly inconsistent, despite the fact that a formal citation format is suggested for each data set. These findings have important implications for developing a data citation format, encouraging researchers to properly curate their research data, and evaluating the bibliometric impact of individuals and institutions.

This work is licensed under a Creative Commons Public Domain Dedication.

Bender, Stefam, and Jorg Heining. "The Research-Data-Centre in Research-Data-Centre Approach: A First Step towards Decentralised International Data Sharing." IASSIST Quarterly 35, no. 3 (2011): 10-16. http://www.iassistdata.org/iq/research-data-centre-research-data-centre-approach-first-step-towards-decentralised-international

Berman, Francine. "Got Data? A Guide to Data Preservation in the Information Age." Communications of the ACM 51, no. 12 (2008): 50-56. http://cacm.acm.org/magazines/2008/12/3360-got-data-a-guide-to-data-preservation-in-the-information-age/abstract

Bethune, Alec, Butch Lazorchak, and Zsolt Nagy. "GeoMAPP: A Geospatial Multistate Archive and Preservation Partnership." Journal of Map & Geography Libraries 6, no. 1 (2009): 45-56. http://www.tandfonline.com/doi/abs/10.1080/15420350903432630

Bird, Colin L., Cerys Willoughby, Simon J. Coles, and Jeremy G. Frey. "Data Curation Issues in the Chemical Sciences." Information Standards Quarterly 25, no. 3 (2013): 4-12. http://www.niso.org/publications/isq/2013/v25no3/bird

Bishoff, Carolyn, and Lisa Johnston. "Approaches to Data Sharing: An Analysis of NSF Data Management Plans from a Large Research University." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1231. http://doi.org/10.7710/2162-3309.1231

INTRODUCTION Sharing digital research data is increasingly common, propelled by funding requirements, journal publishers, local campus policies, or community-driven expectations of more collaborative and interdisciplinary research environments. However, it is not well understood how researchers are addressing these expectations and whether they are transitioning from individualized practices to more thoughtful and potentially public approaches to data sharing that will enable reuse of their data. METHODS The University of Minnesota Libraries conducted a local opt-in study of data management plans (DMPs) included in funded National Science Foundation (NSF) grant proposals from January 2011 through June 2014. In order to understand the current data management and sharing practices of campus researchers, we solicited, coded, and analyzed 182 DMPs, accounting for 41% of the total number of plans available. RESULTS DMPs from seven colleges and academic units were included. The College of Science of Engineering accounted for 70% of the plans in our review. While 96% of DMPs mentioned data sharing, we found a variety of approaches for how PIs shared their data, where data was shared, the intended audiences for sharing, and practices for ensuring long-term reuse. CONCLUSION DMPs are useful tools to investigate researchers' current plans and philosophies for how research outputs might be shared. Plans and strategies for data sharing are inconsistent across this sample, and researchers need to better understand what kind of sharing constitutes public access. More intervention is needed to ensure that researchers implement the sharing provisions in their plans to the fullest extent possible. These findings will help academic libraries develop practical, targeted data services for researchers that aim to increase the impact of institutional research.

This work is licensed under a Creative Commons Attribution 4.0 License.

Bishop, Bradley Wade, Tony H. Grubesic, and Sonya Prasertong. "Digital Curation and the GeoWeb: An Emerging Role for Geographic Information Librarians." Journal of Map & Geography Libraries: Advances in Geospatial Information, Collections & Archives 9, no. 3 (2013): 296-312. http://www.tandfonline.com/doi/full/10.1080/15420353.2013.817367

Borgman, Christine L. "The Conundrum of Sharing Research Data." Journal of the American Society for Information Science and Technology 63, no. 6 (2012): 1059-1078. http://ssrn.com/abstract=1869155

Borgman, Christine L., Jillian C. Wallis, and Noel Enyedy. "Little Science Confronts the Data Deluge: Habitat Ecology, Embedded Sensor Networks, and Digital Libraries." International Journal on Digital Libraries 7, no. 1 (2007): 17-30. http://escholarship.org/uc/item/6fs4559s

Borgman, Christine L., Jillian C. Wallis, and Matthew S. Mayernik. "Who's Got the Data? Interdependencies in Science and Technology Collaborations." Computer Supported Cooperative Work 21, no. 6 (2012): 485-523. https://works.bepress.com/borgman/260/

Bracke, Marianne Stowell. "Emerging Data Curation Roles for Librarians: A Case Study of Agricultural Data." Journal of Agricultural & Food Information 12, no. 1 (2011): 65-74. http://www.tandfonline.com/doi/abs/10.1080/10496505.2011.539158

Bradić-Martinović, Aleksandra, and Aleksandar Zdravković. "Researchers' Interest in Data Service in Bosnia and Herzegovina, Croatia, and Serbia." IASSIST Quarterly 38, no. 2 (2014): 22-28. http://www.iassistdata.org/sites/default/files/iqvol38_2_martinovic.pdf

Brandt, D. Scott, and Eugenia Kim. "Data Curation Profiles as a Means to Explore Managing, Sharing, Disseminating or Preserving Digital Outcomes." International Journal of Performance Arts and Digital Media 10, no. 1 (2014): 21-34. http://www.tandfonline.com/doi/full/10.1080/14794713.2014.912498

Bresnahan, Megan M., and Andrew M. Johnson. "Assessing Scholarly Communication and Research Data Training Needs." Reference Services Review 41, no. 3 (2013): 413-433. http://www.emeraldinsight.com/doi/abs/10.1108/RSR-01-2013-0003

Brewerton, Gary. "Research Data Management: A Case Study." Ariadne, no. 74 (2015). http://www.ariadne.ac.uk/issue74/brewerton

Briney, Kristin. Data Management for Researchers: Organize, Maintain and Share Your Data for Research Success Pelagic Publishing, 2015. http://www.pelagicpublishing.com/data-management-for-researchers.html

Briney, Kristin, Abigail Goben, and Lisa Zilinski. "Do You Have an Institutional Data Policy? A Review of the Current Landscape of Library Data Services and Institutional Data Policies." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1232. http://doi.org/10.7710/2162-3309.1232

INTRODUCTION Many research institutions have developed research data services in their libraries, often in anticipation of or in response to funder policy. However, policies at the institution level are either not well known or nonexistent. METHODS This study reviewed library data services efforts and institutional data policies of 206 American universities, drawn from the July 2014 Carnegie list of universities with "Very High" or "High" research activity designation. Twenty-four different characteristics relating to university type, library data services, policy type, and policy contents were examined. RESULTS The study has uncovered findings surrounding library data services, institutional data policies, and content within the policies. DISCUSSION Overall, there is a general trend toward the development and implementation of data services within the university libraries. Interestingly, just under half of the universities examined had a policy of some sort that either specified or mentioned research data. Many of these were standalone data policies, while others were intellectual property policies that included research data. When data policies were discoverable, not behind a log in, they focused on the definition of research data, data ownership, data retention, and terms surrounding the separation of a researcher from the institution. CONCLUSION By becoming well versed on research data policies, librarians can provide support for researchers by navigating the policies at their institutions, facilitating the activities needed to comply with the requirements of research funders and publishers. This puts academic libraries in a unique position to provide insight and guidance in the development and revisions of institutional data policies.

This work is licensed under a Creative Commons Attribution 4.0 License.

Broeder, Daan, and Laurence Lannom. "Data Type Registries: A Research Data Alliance Working Group." D-Lib Magazine 20, no. 1/2 (2014). http://www.dlib.org/dlib/january14/broeder/01broeder.html

Brownlee, Rowan. "Research Data and Repository Metadata: Policy and Technical Issues at the University of Sydney Library." Cataloging & Classification Quarterly 47, no. 3/4 (2009): 370-379. http://hdl.handle.net/2123/4996

Burton, A., D. Groenewegen, C. Love, A. Treloar, and R. Wilkinson. "Making Research Data Available in Australia." Intelligent Systems 27, no. 3 (2012): 40-43. http://doi.ieeecomputersociety.org/10.1109/MIS.2012.57

Burton, Adrian, and Andrew Treloar. "Designing for Discovery and Re-use: The 'ANDS Data Sharing Verbs' Approach to Service Decomposition." International Journal of Digital Curation 4, no. 3 (2009): 44-56. http://www.ijdc.net/index.php/ijdc/article/view/133/172

Buys, Cunera M., and Pamela L. Shaw. "Data Management Practices Across an Institution: Survey and Report." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1225. http://doi.org/10.7710/2162-3309.1225

INTRODUCTION Data management is becoming increasingly important to researchers in all fields. The E-Science Working Group designed a survey to investigate how researchers at Northwestern University currently manage data and to help determine their future needs regarding data management. METHODS A 21-question survey was distributed to approximately 12,940 faculty, graduate students, postdoctoral candidates, and selected research-affiliated staff at Northwestern's Evanston and Chicago Campuses. Survey questions solicited information regarding types and size of data, current and future needs for data storage, data retention and data sharing, what researchers are doing (or not doing) regarding data management planning, and types of training or assistance needed. There were 831 responses and 788 respondents completed the survey, for a response rate of approximately 6.4%. RESULTS Survey results indicate investigators need both short and long term storage and preservation solutions. However, 31% of respondents did not know how much storage they will require. This means that establishing a correctly sized research storage service will be difficult. Additionally, research data is stored on local hard drives, departmental servers or equipment hard drives. These types of storage solutions limit data sharing and long term preservation. Data sharing tends to occur within a research group or with collaborators prior to publication, expanding to more public availability after publication. Survey responses also indicate a need to provide increased consulting and support services, most notably for data management planning, awareness of regulatory requirements, and use of research software.

This work is licensed under a Creative Commons Attribution 4.0 License.

Byatt, Dorothy, Federico De Luca, Harry Gibbs, Meriel Patrick, Sally Rumsey, and Wendy White. Supporting Researchers with Their Research Data Management: Professional Service Training Requirements—A DataPool Project Report. Southampton, UK: University of Southampton, 2013. http://eprints.soton.ac.uk/id/eprint/352107

Through the JISC funded Institutional Research Management Blueprint Project (IDMB) the University of Southampton developed its 10 year blueprint (Brown et al, 2011) for building the required infrastructure. It did this by investigating what researchers were currently doing with their data and what they thought they required. As well as the blueprint, the IDMB project also developed a draft research data management policy to underpin this work. In DataPool: Engaging with our Research Data Management Policy White & Brown (2013) detail how this draft policy was refined and approved. The policy on its own is insufficient but is an important step in enabling the development of the supporting infrastructure, both technological and personnel. The training strand of the DataPool project included an assessment of professional development requirements for staff supporting researchers in managing their data throughout the research life cycle. This report will focus on the investigation undertaken to assess the level of expertise in the relevant support staff groups, identify the training needs of those staff and consider what networks need to be developed to enable collaborative support of researchers in the area of research data management. It will report on the results of the survey carried out at the University of Southampton.

This work is licensed under a Creative Commons Attribution 2.5 Generic License.

Byatt, Dorothy, Mark Scott, Gareth Beale, Simon J. Cox, and Wendy White. Developing Researcher Skills in Research Data Management: Training for the Future—A DataPool Project Report. Southampton, UK: University of Southampton, 2013. http://eprints.soton.ac.uk/351026/

This report will look at the multi-level approach to developing researcher skills in research data management in the University of Southampton, developed as part of the training strand of the JISC DataPool project, and embedded into the University engagement with research data management. It will look at how:

  • the multi-level approach to research data management training provides opportunities for cross- and multi-disciplinary sharing events as well as bespoke subject specific sessions;
  • co-delivery with active researchers and/or other professional support services benefits the presentation and relevance of the material to the researchers;
  • focussing the event and matching content to the expected audience is key;
  • using the Institutional Data Management Blueprint dual approach of bottom-up (researchers needs)/top-down (institutional policies and infrastructure) worked

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Byatt, Dorothy, and Wendy White. Research Data Management Planning, Guidance and Support: A DataPool Project Report. Southampton: University of Southampton, 2013. http://eprints.soton.ac.uk/id/eprint/351027

This report will review the development of research data management support in the University of Southampton following the approval of its research data management policy in February 2012. Wendy White (2013) in her report DataPool: Engaging with our Research Data Management Policy discusses the rationale and approach to the development of the policy. This report will look at the development of the research data management web pages, including the supporting policy guidance, and then will focus on the ResearchData@soton email, phone and desk side service launched to provide research data support to the University.

This work is licensed under a Creative Commons Attribution 2.5 Generic License.

Callaghan, Sarah. "Preserving the Integrity of the Scientific Record: Data Citation and Linking." Learned Publishing 27, no. 5 (2014): 15-24. http://www.ingentaconnect.com/content/alpsp/lp/2014/00000027/00000005/art00004

Callaghan, Sarah, Steve Donegan, Sam Pepler, Mark Thorley, Nathan Cunningham, Peter Kirsch, Linda Ault, Patrick Bell, Rod Bowie, Adam Leadbetter, Roy Lowry, Gwen Moncoiffé, Kate Harrison, Ben Smith-Haddon, Anita Weatherby, and Dan Wright. "Making Data a First Class Scientific Output: Data Citation and Publication by NERC's Environmental Data Centres." International Journal of Digital Curation 7, no. 1 (2012): 107-113. http://www.ijdc.net/index.php/ijdc/article/view/208/277

The NERC Science Information Strategy Data Citation and Publication project aims to develop and formalise a method for formally citing and publishing the datasets stored in its environmental data centres. It is believed that this will act as an incentive for scientists, who often invest a great deal of effort in creating datasets, to submit their data to a suitable data repository where it can properly be archived and curated. Data citation and publication will also provide a mechanism for data producers to receive credit for their work, thereby encouraging them to share their data more freely.

This work is licensed under a Creative Commons Attribution License.

Callaghan, Sarah, Jonathan Tedds, John Kunze, Varsha Khodiyar, Rebecca Lawrence, Matthew S. Mayernik, Fiona Murphy, Timothy Roberts, and Angus Whyte."Guidelines on Recommending Data Repositories as Partners in Publishing Research Data." International Journal of Digital Curation 9, no. 1 (2014): 152-163. http://www.ijdc.net/index.php/ijdc/article/view/9.1.152/349

This document summarises guidelines produced by the UK Jisc-funded PREPARDE data publication project on the key issues of repository accreditation. It aims to lay out the principles and the requirements for data repositories intent on providing a dataset as part of the research record and as part of a research publication. The data publication requirements that repository accreditation may support are rapidly changing, hence this paper is intended as a provocation for further discussion and development in the future.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Candela, Leonardo, Donatella Castelli, Paolo Manghi, and Alice Tani. "Data Journals: A Survey." Journal of the Association for Information Science and Technology 66, no. 9 (2015): 1747-1762. http://dx.doi.org/10.1002/asi.23358

Carlson, Jake. "Demystifying the Data Interview: Developing a Foundation for Reference Librarians to Talk with Researchers about Their Data." Reference Services Review 40, no. 1 (2012): 7-23. http://docs.lib.purdue.edu/lib_research/153/

——— "Opportunities and Barriers for Librarians in Exploring Data: Observations from the Data Curation Profile Workshops." Journal of eScience Librarianship 2, no. 2 (2013): 17-33. http://dx.doi.org/10.7191/jeslib.2013.1042

Carlson, Jake, Megan Sapp Nelson, Lisa R. Johnston, and Amy Koshoffer. "Developing Data Literacy Programs: Working with Faculty, Graduate Students and Undergraduates " Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 14-17. https://www.asist.org/publications/bulletin/aug-2015/developing-data-literacy-programs/

Carlson, Jake, and Marianne Stowell-Bracke. "Data Management and Sharing from the Perspective of Graduate Students: An Examination of the Culture and Practice at the Water Quality Field Station." portal: Libraries and the Academy 13, no. 4 (2013): 343-361. https://muse.jhu.edu/login?auth=0&type=summary&url=/journals/portal_libraries_and_the_academy/v013/13.4.carlson.html

Carroll, Michael W. "Sharing Research Data and Intellectual Property Law: A Primer." PLOS Biology 13, no. 8 (2015): e1002235. http://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002235

Sharing research data by depositing it in connection with a published article or otherwise making data publicly available sometimes raises intellectual property questions in the minds of depositing researchers, their employers, their funders, and other researchers who seek to reuse research data. In this context or in the drafting of data management plans, common questions are (1) what are the legal rights in data; (2) who has these rights; and (3) how does one with these rights use them to share data in a way that permits or encourages productive downstream uses? Leaving to the side privacy and national security laws that regulate sharing certain types of data, this Perspective explains how to work through the general intellectual property and contractual issues for all research data.

This work is licensed under a Attribution 4.0 International License.

Castro, Eleni, and Alex Garnett. "Building a Bridge Between Journal Articles and Research Data: The PKP-Dataverse Integration Project." International Journal of Digital Curation 9, no. 1 (2014): 176-184. http://www.ijdc.net/index.php/ijdc/article/view/9.1.176/351

A growing number of funding agencies and international scholarly organizations are requesting that research data be made more openly available to help validate and advance scientific research. Thus, this is an opportune moment for research data repositories to partner with journal editors and publishers in order to simplify and improve data curation and publishing practices. One practical example of this type of cooperation is currently being facilitated by a two year (2012-2014) one million dollar Sloan Foundation grant, integrating two well-established open source systems: the Public Knowledge Project's (PKP) Open Journal Systems (OJS), developed by Stanford University and Simon Fraser University; and Harvard University's Dataverse Network web application, developed by the Institute for Quantitative Social Science (IQSS). To help make this interoperability possible, an OJS Dataverse plugin and Data Deposit API are being developed, which together will allow authors to submit their articles and datasets through an existing journal management interface, while the underlying data are seamlessly deposited into a research data repository, such as the Harvard Dataverse. This practice paper will provide an overview of the project, and a brief exploration of some of the specific challenges to and advantages of this integration.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Chad, Ken, and Suzanne Enright. "The Research Cycle and Research Data Management (RDM): Innovating Approaches at the University of Westminster." Insights: The UKSG Journal 27, no. 2 (2014): 147-153. http://insights.uksg.org/article/view/2048-7754.152

This article presents a case study based on experience of delivering a more joined-up approach to supporting institutional research activity and processes, research data management (RDM) and open access (OA). The result of this small study, undertaken at the University of Westminster in 2013, indicates that a more holistic approach should be adopted, embedding RDM more fully into the wider research management landscape and taking researchers' priorities into consideration. Rapid development of an innovative pilot system followed closely on from a positive engagement with researchers, and today a purpose-built, integrated and fully working set of tools are functioning within the virtual research environment (VRE). This provides a coherent 'thread' to support researchers, doctoral students and professional support staff throughout the research cycle. The article describes the work entailed in more detail, together with the impact achieved so far and what future work is planned.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Chao, Tiffany C., Melissa H. Cragin, and Carole L. Palmer. "Data Practices and Curation Vocabulary (DPCVocab): An Empirically Derived Framework of Scientific Data Practices and Curatorial Processes." Journal of the Association for Information Science and Technology 66, no. 3 (2015): 616-633. http://onlinelibrary.wiley.com/doi/10.1002/asi.23184/abstract

Chapple, Michael J. "Speaking the Same Language: Building a Data Governance Program for Institutional Impact." EDUCAUSE Review 48, no. 6 (2013): 14-27. https://net.educause.edu/ir/library/pdf/ERM1362.pdf

Charbonneau, Deborah H. "Strategies for Data Management Engagement." Medical Reference Services Quarterly 32, no. 3 (2013): 365-374. http://www.tandfonline.com/doi/full/10.1080/02763869.2013.807089#abstract

Charbonneau, Deborah H., and Joan E. Beaudoin. "State of Data Guidance in Journal Policies: A Case Study in Oncology." International Journal of Digital Curation 10, no. 2 (2015): 136-156. http://www.ijdc.net/index.php/ijdc/article/view/10.2.136

This article reports the results of a study examining the state of data guidance provided to authors by 50 oncology journals. The purpose of the study was the identification of data practices addressed in the journals' policies. While a number of studies have examined data sharing practices among researchers, little is known about how journals address data sharing. Thus, what was discovered through this study has practical implications for journal publishers, editors, and researchers. The findings indicate that journal publishers should provide more meaningful and comprehensive data guidance to prospective authors. More specifically, journal policies requiring data sharing, should direct researchers to relevant data repositories, and offer better metadata consultation to strengthen existing journal policies. By providing adequate guidance for authors, and helping investigators to meet data sharing mandates, scholarly journal publishers can play a vital role in advancing access to research data.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Chervenaka, Ann, Ian Foster, Carl Kesselman, Charles Salisbury, and Steven Tuecke. "The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets." Journal of Network and Computer Applications 23, no. 3 (2000): 187-200. http://www.sciencedirect.com/science/article/pii/S1084804500901103

Childs, Sue, Julie McLeod, Elizabeth Lomas, and Glenda Cook. "Opening Research Data: Issues and Opportunities." Records Management Journal 24, no. 2 (2014): 14-162. http://dx.doi.org/10.1108/RMJ-01-2014-0005

Choudhury, G. Sayeed. "Case Study in Data Curation at Johns Hopkins University." Library Trends 57, no. 2 (2008): 211-220. http://hdl.handle.net/2142/10669

——— "Data Curation: An Ecological Perspective." College & Research Libraries News 71, no. 4 (2010): 194-196. http://crln.acrl.org/content/71/4/194.full.pdf+html

Choudhury, Sayeed, Tim DiLauro, Alex Szalay, Ethan Vishniac, Robert J. Hanisch, Julie Steffen, Robert Milkey, Teresa Ehling, and Ray Plante. "Digital Data Preservation for Scholarly Publications in Astronomy." International Journal of Digital Curation 2, no. 2 (2007): 20-30. http://www.ijdc.net/index.php/ijdc/article/view/41/26

Claibourn, Michele P. "Bigger on the Inside: building Research Data Services at the University of Virginia." Insights: The UKSG journal 28, no. 2 (2015): 100-106. http://insights.uksg.org/articles/10.1629/uksg.239/

Every story has a beginning, where the narrator chooses to start, though this is rarely the genesis. This story begins with the launch of the University of Virginia Library's new Research Data Services unit in October 2013. Born from the conjoining of a data management team and a data analysis team, Research Data Services expanded to encompass data discovery and acquisitions, research software support, and new expertise in the use of restricted data. Our purpose is to respond to the challenges created by the growing ubiquity and scale of data by helping researchers acquire, analyze, manage, and archive these resources. We have made serious strides toward becoming 'the face of data services at U.Va.' This article tells a bit of our story so far, relays some early challenges and how we've responded to them, outlines several initial successes, and summarizes a few lessons going forward.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Clements, Anna. "Research Information Meets Research Data Management. . . in the Library?" Insights: The UKSG journal 26, no. 3 (2013): 298-304. http://insights.uksg.org/articles/10.1629/2048-7754.99/

Research data management (RDM) is a major priority for many institutions as they struggle to cope with the plethora of pronouncements including funder policies, a G8 statement, REF2020 consultations, all stressing the importance of open data in driving everything from global innovation through to more accountable governance; not to mention the more direct possibility that non-compliance could result in grant income drying up. So, at the coalface, how do we become part of this global movement?

In this article the author explains the approach being taken at the University of St Andrews, building on the research information management infrastructure (data, systems and people) that has evolved since 2006. Continuing to navigate through the rapidly evolving research policy and cultural landscape, they aim to establish services to support their research community as it moves to this 'open by default' requirement of funders and governments.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Coates, Heather L. "Building Data Services From the Ground Up: Strategies and Resources." Journal of eScience Librarianship 3, no. 1 (2014): e1063. http://dx.doi.org/10.7191/jeslib.2014.1063

Collie, W. Aaron, and Michael Witt. "A Practice and Value Proposal for Doctoral Dissertation Data Curation." International Journal of Digital Curation 6, no. 2 (2011): 165-175. http://www.ijdc.net/index.php/ijdc/article/view/189/254

Collins, Ellen. "Use and Impact of UK Research Data Centres." International Journal of Digital Curation 6, no. 1 (2011): 20-31. http://www.ijdc.net/index.php/ijdc/article/view/160/228

Conway, Esther, David Giaretta, Simon Lambert, and Brian Matthews. "Curating Scientific Research Data for the Long Term: A Preservation Analysis Method in Context." International Journal of Digital Curation 6, no. 2 (2011): 38-52. http://www.ijdc.net/index.php/ijdc/article/view/182/264

Conway, Esther, Brian Matthews, David Giaretta, Simon Lambert, Michael Wilson, and Nick Draper. "Managing Risks in the Preservation of Research Data with Preservation Networks." International Journal of Digital Curation 7, no. 1 (2012): 3-15. http://www.ijdc.net/index.php/ijdc/article/view/200/269

Network modelling provides a framework for the systematic analysis of needs and options for preservation. A number of general strategies can be identified, characterised and applied to many situations; these strategies may be combined to produce robust preservation solutions tailored to the needs of the community and responsive to their environment. This paper provides an overview of this approach. We describe the components of a Preservation Network Model and go on to show how it may be used to plan preservation actions according to the requirements of the particular situation using illustrative examples from scientific archives.

This work is licensed under a Creative Commons Attribution License.

Conway, Esther, Sam Pepler, Wendy Garland, David Hoope, Fulvio Marelli, Luca Liberti, Emanuela Piervitali, Katrin Molch, Helen Glaves, and Lucio Badiali. "Ensuring the Long Term Impact of Earth Science Data through Data Curation and Preservation." Information Standards Quarterly 25, no. 3 (2013): 28-36. http://www.niso.org/publications/isq/2013/v25no3/conway

Corrall, Sheila, Mary Anne Kennan, and Waseem Afzal. "Bibliometrics and Research Data Management Services: Emerging Trends in Library Support for Research." Library Trends 61, no. 3 (2013): 636-674. http://d-scholarship.pitt.edu/18948/

Corti, Louise, Veerle Van den Eynden, Libby Bishop, and Matthew Woollard. Managing and Sharing Research Data: A Guide to Good Practice. Los Angeles: SAGE, 2014. https://us.sagepub.com/en-us/nam/managing-and-sharing-research-data/book240297

Costello, Mark J., and John Wieczorek. "Best Practice for Biodiversity Data Management and Publication." Biological Conservation 173, no. 1 (2014): 68-73. http://dx.doi.org/10.1016/j.biocon.2013.10.018

Council on Library and Information Resources, ed. Research Data Management: Principles, Practices, and Prospects. Washington, DC: Council on Library and Information Resources, 2013. http://www.clir.org/pubs/reports/pub160

Covey, Denise Troll. "ORCID @ CMU: Successes and Failures." Journal of eScience Librarianship 4, no. 2 (2015): e1083. http://escholarship.umassmed.edu/jeslib/vol4/iss2/6/

Cox, Andrew M., and Stephen Pinfield. "Research Data Management and Libraries: Current Activities and Future Priorities." Journal of Librarianship and Information Science 46 no. 4 (2014): 299-316. http://lis.sagepub.com/content/46/4/299.short

Cox, Andrew M., Stephen Pinfield, and Jennifer Smith. "Moving a Brick Building: UK Libraries Coping with Research Data Management as a 'Wicked' Problem " Journal of Librarianship and Information Science 48 no. 1 (2016): 3-17. http://lis.sagepub.com/content/48/1/3.abstract

The purpose of this paper is to explore the value to librarians of seeing research data management as a 'wicked' problem. Wicked problems are unique, complex problems which are defined differently by different stakeholders making them particularly intractable. Data from 26 semi-structured in-depth telephone interviews with librarians was analysed to see how far their perceptions of research data management aligned with the 16 features of a wicked problem identified from the literature. To a large extent research data management is perceived to be wicked, though over time good practices may emerge to help to 'tame' the problem. How interviewees thought research data management should be approached reflected this realisation. The generic value of the concept of wicked problems is considered and some first thoughts about how the curriculum for new entrants to the profession can prepare them for such problems are presented.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Cox, Andrew M., Eddy Verbaan, and Barbara Sen. "A Spider, an Octopus, or an Animal Just Coming into Existence? Designing a Curriculum for Librarians to Support Research Data Management." Journal of eScience Librarianship 3, no. 1 (2014): e1055. http://dx.doi.org/10.7191/jeslib.2014.1055

———. "Upskilling Liaison Librarians for Research Data Management." Ariadne, no. 70 (2012). http://www.ariadne.ac.uk/issue70/cox-et-al

In this context, JISC have funded the White Rose consortium of academic libraries at Leeds, Sheffield and York, working closely with the Sheffield Information School, in the RDMRose Project (link is external), to develop learning materials that will help librarians grasp the opportunity that RDM offers. The learning materials will be used in the Information School's Masters courses, and are also to be made available to other information sector training providers on a share-alike licence. A version will also be made available (from January 2013) as an Open Educational Resource for use by information professionals who want to update their competencies as part of their continuing professional development (CPD). The learning materials are being developed specifically for liaison librarians, to upskill existing professionals and to expand the knowledge base for new entrants to librarianship. It is hoped to accommodate the perspectives of any information professional, but the scope is not intended to encompass a syllabus for a data management specialist role (following the distinction made by Corrall [1]).

This article summarises current thinking developed within the project about the scope and level of such learning materials. This thinking is based on a number of sources: the literature and existing curricula and also the project vision and data collected during the project in focus groups with staff at the participating libraries.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Cragin, Melissa H., Carole L. Palmer, Jacob R. Carlson, and Michael Witt. "Data Sharing, Small Science and Institutional Repositories." Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 368, no. 1926 (2010): 4023-4038. http://rsta.royalsocietypublishing.org/content/368/1926/4023

Creamer, Andrew. "Current Issues and Approaches to Curating Student Research Data." Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 22-25. https://www.asist.org/files/bulletin/aug-15/creamer.pdf

Creamer, Andrew, Myrna E. Morales, Javier Crespo, Donna Kafel, and Elaine R. Martin. "An Assessment of Needed Competencies to Promote the Data Curation and Management Librarianship of Health Sciences and Science and Technology Librarians in New England." Journal of eScience Librarianship 1, no. 1 (2012): e1006. http://dx.doi.org/10.7191/jeslib.2012.1006

Creamer, Andrew T., Myrna E. Morales, Donna Kafel, Javier Crespo, and Elaine R. Martin. "A Sample of Research Data Curation and Management Courses." Journal of eScience Librarianship 1, no. 2 (2012). http://dx.doi.org/10.7191/jeslib.2012.1016

Crosas, Mercè. "A Data Sharing Story." Journal of eScience Librarianship 1, no. 3 (2012): e1020. http://dx.doi.org/10.7191/jeslib.2012.1020

———. "The Dataverse Network: An Open-Source Application for Sharing, Discovering and Preserving Data." D-Lib Magazine 17, no. 1/2 (2011). http://www.dlib.org/dlib/january11/crosas/01crosas.html

Crosas, Mercè, Gary King, James Honaker, and Latanya Sweeney. "Automating Open Science for Big Data." The ANNALS of the American Academy of Political and Social Science 659, no. 1 (2014): 260-273. http://gking.harvard.edu/publications/automating-open-science-big-data

Crowston, Kevin. ""Personas" to Support Development of Cyberinfrastructure for Scientific Data Sharing." Journal of eScience Librarianship 4, no. 2 (2015): e1082. http://escholarship.umassmed.edu/jeslib/vol4/iss2/2/

Cuevas-Vicenttín, Víctor, Parisa Kianmajd, Bertram Ludäscher, Paolo Missier, Fernando Chirigati, Yaxing Wei, David Koop, and Saumen Dey. "The PBase Scientific Workflow Provenance Repository." International Journal of Digital Curation 9, no. 2 (2014): 28-38. http://www.ijdc.net/index.php/ijdc/article/view/9.2.28/367

Scientific workflows and their supporting systems are becoming increasingly popular for compute-intensive and data-intensive scientific experiments. The advantages scientific workflows offer include rapid and easy workflow design, software and data reuse, scalable execution, sharing and collaboration, and other advantages that altogether facilitate "reproducible science". In this context, provenance—information about the origin, context, derivation, ownership, or history of some artifact—plays a key role, since scientists are interested in examining and auditing the results of scientific experiments.

However, in order to perform such analyses on scientific results as part of extended research collaborations, an adequate environment and tools are required. Concretely, the need arises for a repository that will facilitate the sharing of scientific workflows and their associated execution traces in an interoperable manner, also enabling querying and visualization. Furthermore, such functionality should be supported while taking performance and scalability into account.

With this purpose in mind, we introduce PBase: a scientific workflow provenance repository implementing the ProvONE proposed standard, which extends the emerging W3C PROV standard for provenance data with workflow specific concepts. PBase is built on the Neo4j graph database, thus offering capabilities such as declarative and efficient querying. Our experiences demonstrate the power gained by supporting various types of queries for provenance data. In addition, PBase is equipped with a user friendly interface tailored for the visualization of scientific workflow provenance data, making the specification of queries and the interpretation of their results easier and more effective.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Curdt, Constanze, and Dirk Hoffmeister. "Research Data Management Services for a Multidisciplinary, Collaborative Research Project: Design And Implementation of the TR32DB project Database." Program 49, no. 4 (2015): 494-512. http://dx.doi.org/10.1108/PROG-02-2015-0016

Curdt, Constanze, Dirk Hoffmeister, Guido Waldhoff, Christian Jekel, and Georg Bareth. "Scientific Research Data Management for Soil-Vegetation-Atmosphere Data—The TR32DB." International Journal of Digital Curation 7, no. 2 (2012): 68-80. http://www.ijdc.net/index.php/ijdc/article/view/220/295

The implementation of a scientific research data management system is an important task within long-term, interdisciplinary research projects. Besides sustainable storage of data, including accurate descriptions with metadata, easy and secure exchange and provision of data is necessary, as well as backup and visualisation. The design of such a system poses challenges and problems that need to be solved.

This paper describes the practical experiences gained by the implementation of a scientific research data management system, established in a large, interdisciplinary research project with focus on Soil-Vegetation-Atmosphere Data.

This work is icensed under a Creative Commons Attribution License.

Dallmeier-Tiessen, Suenje, Mariella Guercio, Robert Darby, Kathrin Gitmans, Simon Lambert, Brian Matthews, Jari Suhonen Salvatore Mele, and Michael Wilson. "Enabling Sharing and Reuse of Scientific Data." New Review of Information Networking 19, no. 1 (2014). http://www.tandfonline.com/doi/full/10.1080/13614576.2014.883936

Dallmeier-Tiessen, Sunje, Mariella Guercio, Robert Darby, Kathrin Gitmans, Simon Lambert, Jari Suhonen, and Michael Wilson. Compilation of Results on Drivers and Barriers and New Opportunities. Geneva: Opportunities for Data Exchange, 2012. https://core.ac.uk/download/files/324/30437755.pdf

Opportunities for Data Exchange (ODE) is a FP7 Project carried out by members of the Alliance for Permanent Access (APA), which is gathering evidence to support strategic investment in the emerging e-Infrastructure for data sharing, re-use and preservation. The ODE Conceptual Model has been developed within the Project to characterise the process of data sharing and the factors which give rise to variations in data sharing for different parties involved. Within the overall Conceptual Model there can be identified models of process, of context, and of drivers, barriers and enablers. The Conceptual Model has been evolved on the basis of existing knowledge and expertise, and draws on research conducted both outside of the ODE Project and in earlier stages of the Project itself (Sections 1-2).

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Dallmeier-Tiessen, Suenje, Mariella Guercio, Heikki Helin, Patricia Herterich, Kirnn Kaur, Artemis Lavasa, Juha Lehtonen, and Riina Salmivalli. Exemplar Good Governance Structures and Data Policies. Dorset, UK: Alliance for Permanent Access, 2014. http://www.alliancepermanentaccess.org/wp-content/uploads/sites/7/downloads/2014/06/APARSEN-REP-D35_1-01-1_0_incURN.pdf

Darlington, Jeffrey. "A National Archive of Datasets." Ariadne, no. 39 (2004). http://www.ariadne.ac.uk/issue39/ndad

Davis, Hilary M., and William M. Cross. "Using a Data Management Plan Review Service as a Training Ground for Librarians." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1243. http://doi.org/10.7710/2162-3309.1243

INTRODUCTION Research Data Management (RDM) offers opportunities and challenges at the interface of library support and researcher needs. Libraries are in a position of balancing the capacity to provide support at the point of need while also implementing training for subject liaison librarians grounded in the practical issues and realities facing researchers and their institutions. DESCRIPTION OF PROGRAM/SERVICE The North Carolina State University (NCSU) Libraries has deployed a Data Management Plan (DMP) Review service managed by a committee of librarians with diverse experience in data management and domain expertise. By rotating librarians through membership on the committee and by inviting subject liaisons librarians to participate in the DMP Review process, our training ground model aims to develop needed competencies and support researchers through relevant services and partnerships. AUDIT OF PROGRAM/SERVICE This article presents an audit of the DMP Review service as a training ground to develop and enhance competencies as identified by the Joint Task Force on Librarians' Competencies in Support of E-Research and Scholarly Communication. NEXT STEPS AND CONCLUSIONS The DMP Review service creates opportunities for librarians to learn valuable skills while simultaneously providing a time-sensitive service to researchers. The process of auditing competencies developed by participating in the DMP Review service highlights gaps needed to more fully support RDM and reinforces the capacity of the DMP Review service as a training ground to sustain and iterate learning opportunities for librarians engaged in research support and partnerships.

This work is licensed under a Creative Commons Attribution 4.0 License.

De La Beaujardière, Jeff. "NOAA Environmental Data Management." Journal of Map & Geography Libraries 12, no. 1 (2016): 5-27. http://dx.doi.org/10.1080/15420353.2015.1087446

Dearborn, Carly C., Amy J. Barto, and Neal A. Harmeyer. "The Purdue University Research Repository: HUBzero Customization for Dataset Publication and Digital Preservation." OCLC Systems & Services: International Digita Llibrary Perspectives 30, no. 1 (2014): 15-27. http://docs.lib.purdue.edu/lib_fsdocs/62/

Dehnhard, I., E. Weichselgartner, and G. Krampen. "Researcher's Willingness to Submit Data for Data Sharing: A Case Study on a Data Archive for Psychology." Data Science Journal 12 (2013): 172-180. http://dx.doi.org/10.2481/dsj.12-037

Data sharing has gained importance in scientific communities because scientific associations and funding organizations require long term preservation and dissemination of data. To support psychology researchers in data archiving and data sharing, the Leibniz Institute for Psychology Information developed an archiving facility for psychological research data in Germany: PsychData. In this paper we report different types of data requests that were sent to researchers with the aim of building up a sustainable data archive. Resulting response rates were rather low, however, comparable to those published by other authors. Possible reasons for the reluctance of researchers to submit data are discussed.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Delasalle, Jenny. "Research Data Management at the University of Warwick: Recent Steps towards a Joined-up Approach at a UK University"." LIBREAS. Library Ideas, no. 23 (2013): 97-105. http://edoc.hu-berlin.de/libreas/23/delasalle-jenny-1/PDF/delasalle.pdf

This paper charts the steps taken and possible ways forward for the University of Warwick in its approach to research data management, providing a typical example of a UK research university's approach in two strands: requirements and support. The UK government approach and funding landscape in relation to research data management provided drivers for the University of Warwick to set requirements and provide support, and examples of good practice at other institutions, support from a central national body (the UK Digital Curation Centre) and learning from other universities' experiences all proved valuable to the University of Warwick. Through interviews with researchers at Warwick, various issues and challenges are revealed: perhaps the biggest immediate challenges for Warwick going forward are overcoming scepticism amongst researchers, overcoming costs, and understanding the implications of involving third party companies in research data management. Building technical infrastructure could sit alongside and beyond those immediate steps and beyond the challenges that face one University are those that affect academia as a whole. Researchers and university administrators need to work together to address the broader challenges, such as the accessibility of data for future use and the reward for researchers who practice data management in exemplary ways, and indeed it may be that a wider, national or international but disciplinary technical infrastructure affects what an individual university needs to achieve. As we take these steps, universities and institutions are all learning from each other.

This work is licensed under a Creative Commons Attribution 3.0 License.

Delserone, Leslie M. "At the Watershed: Preparing for Research Data Management and Stewardship at the University of Minnesota Libraries." Library Trends 57, no. 2 (2008): 202-210. http://hdl.handle.net/2142/10670

Dietrich, Dianne. "Metadata Management in a Data Staging Repository." Journal of Library Metadata 10, no. 2/3 (2010): 79-98. http://www.tandfonline.com/doi/full/10.1080/19386389.2010.506376

Dietrich, Dianne, Trisha Adamus, Alison Miner, and Gail Steinhart. "De-Mystifying the Data Management Requirements of Research Funders." Issues in Science & Technology Librarianship, no. 70 (2012). http://www.istl.org/12-summer/refereed1.html

Dillo, Ingrid, and Peter Doorn. "The Front Office-Back Office Model: Supporting Research Data Management in the Netherlands." International Journal of Digital Curation 9, no. 2 (2014): 39-46. http://www.ijdc.net/index.php/ijdc/article/view/9.2.39/368

High quality and timely data management and secure storage of data, both during and after completion of research, are an essential prerequisite for sharing that data. It is therefore crucial that universities and research institutions themselves formulate a clear policy on data management within their organization. For the implementation of this data management policy, high quality support for researchers and an adequate technical infrastructure are indispensable.

This practice paper will present an overview of the merging federated data infrastructure in the Netherlands with its front office—back office model, as a use case of an efficient and effective national support infrastructure for researchers.

We will elaborate on the stakeholders involved, on the services they offer each other, and on the benefits of this model not only for the front and back offices themselves, but also for the researchers. We will also pay attention to a number of challenges that we are facing, like the implementation of a technical infrastructure for automatic data ingest and integrating access to research data.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Donnelly, Martin. "The DCC's Institutional Engagements: Raising Research Data Management Capacity in UK Higher Education." Bulletin of the American Society for Information Science and Technology 39, no. 6 (2013): 37-40. http://www.asis.org/Bulletin/Aug-13/AugSep13_Donnelly.pdf

Donnelly, Martin, Sarah Jones, and John W. Pattenden-Fail. "DMP Online: A Demonstration of the Digital Curation Centre's Web-based Tool for Creating, Maintaining and Exporting Data Management Plans." Lecture Notes in Computer Science 6273 (2010): 530-533. http://www.springerlink.com/content/jnr4775775186h62

Donnelly, Martin, and Robin North. "The Milieu and the MESSAGE: Talking to Researchers about Data Curation Issues in a Large and Diverse E-science Project." International Journal of Digital Curation 6, no. 1 (2011): 32-44. http://www.ijdc.net/index.php/ijdc/article/view/161/229

Doty, Jennifer, Joel Herndon, Jared Lyle, and Libbie Stephenson. "Learning to Curate." Bulletin of the American Society for Information Science and Technology 40, no. 6 (2014): 31-34. http://www.asis.org/Bulletin/Aug-14/AugSep14_DotyEtAl.pdf

Doty, Jennifer, Melanie T. Kowalski, Bethany C. Nash, and Simon F. O'Riordan. "Making Student Research Data Discoverable: A Pilot Program Using Dataverse." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1234. http://doi.org/10.7710/2162-3309.1234

INTRODUCTION The support and curation of research data underlying theses and dissertations are an opportunity for institutions to enhance their ETD collections. This article describes a pilot data archiving service that leverages Emory University's existing Electronic Theses and Dissertations (ETDs) program. DESCRIPTION OF PROGRAM This pilot service tested the appropriateness of Dataverse, a data repository, as a data archiving and access solution for Emory University using research data identified in Emory University's ETD repository, developed the legal documents necessary for a full implementation of Dataverse on campus, and expanded outreach efforts to meet the research data needs of graduate students. This article also situates the pilot service within the context of Emory Libraries and explains how it relates to other library efforts currently underway. NEXT STEPS The pilot project team plans to seek permission from alumni whose data were included in the pilot to make them available publicly in Dataverse, and the team will revise the ETD license agreement to allow this type of use. The team will also automate the ingest of supplemental ETD research data into the data repository where possible and create a workshop series for students who are creating research data as part of their theses or dissertations.

This work is licensed under a Creative Commons Attribution 4.0 License.

Douglass, Kimberly, Suzie Allard, Carol Tenopir, Lei Wu, and Mike Frame. "Managing Scientific Data as Public Assets: Data Sharing Practices and Policies among Full-Time Government Employees." Journal of the Association for Information Science and Technology 65, no. 2 (2014): 251-262. http://dx.doi.org/10.1002/asi.22988

Downs, Robert R., and Robert S. Chen. "Designing Submission and Workflow Services for Preserving Interdisciplinary Scientific Data." Earth Science Informatics 3, no. 1/2 (2010): 101-110. http://link.springer.com/article/10.1007%2Fs12145-010-0051-6

——— "Organizational Needs for Managing and Preserving Geospatial Data and Related Electronic Records." Data Science Journal 4 (2005). http://datascience.codata.org/articles/abstract/10.2481/dsj.4.255/

Government agencies and other organizations are required to manage and preserve records that they create and use to facilitate future access and reuse. The increasing use of geospatial data and related electronic records presents new challenges for these organizations, which have relied on traditional practices for managing and preserving records in printed form. This article reports on an investigation of current and future needs for managing and preserving geospatial electronic records on the part of localand state-level organizations in the New York City metropolitan region. It introduces the study and describes organizational needs observed, including needs for organizational coordination and interorganizational cooperation throughout the entire data lifecycle.

This work is licensed under a Creative Commons Attribution 3.0 License.

Downs, Robert R., Ruth Duerr, and Denise J. Hills. "Data Stewardship in the Earth Sciences." D-Lib Magazine 21, no. 7/8 (2015). http://www.dlib.org/dlib/july15/downs/07downs.html

Durantea, Kim, and Darren Hardya. "Discovery, Management, and Preservation of Geospatial Data Using Hydra." Journal of Map & Geography Libraries: Advances in Geospatial Information, Collections & Archives 11, no. 2 (2015). http://www.tandfonline.com/doi/abs/10.1080/15420353.2015.1041630?journalCode=wmgl20

Duranti, L. "The Long-Term Preservation of Accurate and Authentic Digital Data: The INTERPARES Project." Data Science Journal 4 (2006). http://datascience.codata.org/articles/abstract/10.2481/dsj.4.106/

This article presents the InterPARES Project, a multidisciplinary international research initiative aimed at developing the theoretical and methodological knowledge necessary for the long-term preservation of digital entities produced in the course of business or research activity so that their authenticity can be presumed or verified. The methodology, research activities, preliminary findings and projected products are discussed in the context of the issues that the project attempts to address.

This work is licensed under a Creative Commons Attribution 3.0 License.

Dürr, Eugène, Kees van der Meer, Wim Luxemburg, and Ronald Dekker. "Dataset Preservation for the Long Term: Results of the DareLux Project." International Journal of Digital Curation 3, no. 1 (2008): 29-43. http://www.ijdc.net/index.php/ijdc/article/view/61/40

Dürr, Eugène, Kees van der Meer, Wim Luxemburg, Maria Heijne, and Ronald Dekker. "Long-Time Preservation of Data Sets, Results of the DareLux Project." Information Services and Use 28, no. 3/4 (2008): 281-294. http://content.iospress.com/articles/information-services-and-use/isu571

Dyke, Kevin R., Ryan Mattke, Len Kne, and Shawn Rounds. "Placing Data in the Land of 10,000 Lakes: Navigating the History and Future of Geospatial Data Production, Stewardship, and Archiving in Minnesota." Journal of Map & Geography Libraries 12, no. 1 (2016): 52-72. http://hdl.handle.net/11299/178320

Eaker, Christopher. "Planning Data Management Education Initiatives: Process, Feedback, and Future Directions." Journal of eScience Librarianship 3, no. 1 (2014): e1054. http://dx.doi.org/10.7191/jeslib.2014.1054

Erway, Ricky. Starting the Conversation: University-wide Research Data Management Policy Dublin, Ohio OCLC Research, 2013. http://oclc.org/research/publications/library/2013/2013-08r.html

This call for action addresses the high-level benefits of adopting a university-wide policy regarding research data management. It identifies the various university stakeholders and suggests that the library initiate a conversation among them in order to get buy-in for a proactive, rather than reactive, high-level policy for responsible data planning and management that is supported and sustainable.

The intended audience for this call for action is library directors, not because they alone can make this happen, but to encourage them to initiate the conversation. They are invested, because the library may be the recipient of data in need of curation and of requests for guidance, but more importantly, library staff have significant skills and experience to contribute to the discussion. This is an opportunity for the library director to play an entrepreneurial role in furthering the mission of the larger enterprise.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Erway, Ricky, Laurence Horton, Amy Nurnberger, Reid Otsuji, and Amy Rushing. Building Blocks: Laying the Foundation for a Research Data Management Program. Dublin, OH: OCLC Research, 2016. http://www.oclc.org/content/dam/research/publications/2016/oclcresearch-data-management-building-blocks-2016.pdf

This document is intended for those who are just beginning to offer data services to researchers at their universities . Part 1 assumes that very little , if anything, is in place, and that resources are limited. It seeks to guide the individual who has data management program responsibilities in directions that will lay a very basic foundation. Part 2 helps identify steps for building on that foundation as needs become evident and as resources allow.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Erway, Ricky, and Amanda Rinehart. If You Build It, Will They Fund? Making Research Data Management Sustainable. Dublin, Ohio: OCLC Research, 2016. http://www.oclc.org/content/dam/research/publications/2016/oclcresearch-making-research-data-management-sustainable-2016.pdf

In order to explore the various possibilities, we provide an overview of several funding strategies and their standing in the US. The arguments for and against each strategy are presented and circumstances in other countries are described in the appendix.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Erwin, Tracey, and Julie Sweetkind-Singer. "The National Geospatial Digital Archive: A Collaborative Project to Archive Geospatial Data." Journal of Map & Geography Libraries 6, no. 1 (2010): 6-25. http://www.tandfonline.com/doi/abs/10.1080/15420350903432440

Erwin, Tracey, Julie Sweetkind-Singer, and Mary Lynette Larsgaard. "The National Geospatial Digital Archives—Collection Development: Lessons Learned." Library Trends 57, no. 3 (2009): 490-515. http://hdl.handle.net/2142/13592

Eschenfelder, Kristin R., and Andrew Johnson. "Managing the Data Commons: Controlled Sharing of Scholarly Data." Journal of the Association for Information Science and Technology 65, no. 9 (2014): 1757-1774, DOI: http://dx.doi.org/10.1002/asi.23086

Faniel, Ixchel M., and Ann Zimmerman. "Beyond the Data Deluge: A Research Agenda for Large-Scale Data Sharing and Reuse." International Journal of Digital Curation 6, no. 1 (2011): 58-69. http://www.ijdc.net/index.php/ijdc/article/view/163/231

Fary, Michael, and Kim Owen. Developing an Institutional Research Data Management Plan Service. Louisville, CO: EDUCAUSE, 2013. http://www.educause.edu/library/resources/developing-institutional-research-data-management-plan-service

Faundeen, John L. "The Challenge of Archiving and Preserving Remotely Sensed Data." Data Science Journal 2 (2003): 159-163. http://datascience.codata.org/articles/abstract/10.2481/dsj.2.159/

Few would question the need to archive the scientific and technical (S&T) data generated by researchers. At a minimum, the data are needed for change analysis. Likewise, most people would value efforts to ensure the preservation of the archived S&T data. Future generations will use analysis techniques not even considered today. Until recently, archiving and preserving these data were usually accomplished within existing infrastructures and budgets. As the volume of archived data increases, however, organizations charged with archiving S&T data will be increasingly challenged (U.S. General Accounting Office, 2002). The U.S. Geological Survey has had experience in this area and has developed strategies to deal with the mountain of land remote sensing data currently being managed and the tidal wave of expected new data. The Agency has dealt with archiving issues, such as selection criteria, purging, advisory panels, and data access, and has met with preservation challenges involving photographic and digital media. That experience has allowed the USGS to develop management approaches, which this paper outlines.

This work is licensed under a Creative Commons Attribution 3.0 License.

Fear, Kathleen. "Building Outreach on Assessment: Researcher Compliance with Journal Policies for Data Sharing." Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 18-21. https://www.asist.org/publications/bulletin/aug-2015/building-outreach-on-assessment/

———. "'You Made It, You Take Care of It': Data Management as Personal Information Management." International Journal of Digital Curation 6, no. 2 (2011): 53-77. http://www.ijdc.net/index.php/ijdc/article/view/183/250

Fear, Kathleen, and Devan Ray Donaldson. "Provenance and Credibility in Scientific Data Repositories." Archival Science 12, no. 3 (2012): 319-339. http://link.springer.com/article/10.1007/s10502-012-9172-7

Fearon, David, Jr., Betsy Gunia, Barbara E. Pralle, Sherry Lake, and Andrew L. Sallans. SPEC Kit 334: Research Data Management Services. Washington, DC: ARL, 2013. http://publications.arl.org/Research-Data-Management-Services-SPEC-Kit-334

Fecher, Benedikt, Sascha Friesike, and Marcel Hebing. "What Drives Academic Data Sharing?" PLoS ONE 10, no. 2 (2015): e0118053. http://doi.org/10.1371/journal.pone.0118053

Despite widespread support from policy makers, funding agencies, and scientific journals, academic researchers rarely make their research data available to others. At the same time, data sharing in research is attributed a vast potential for scientific progress. It allows the reproducibility of study results and the reuse of old data for new research questions. Based on a systematic review of 98 scholarly papers and an empirical survey among 603 secondary data users, we develop a conceptual framework that explains the process of data sharing from the primary researcher's point of view. We show that this process can be divided into six descriptive categories: Data donor, research organization, research community, norms, data infrastructure, and data recipients. Drawing from our findings, we discuss theoretical implications regarding knowledge creation and dissemination as well as research policy measures to foster academic collaboration. We conclude that research data cannot be regarded as knowledge commons, but research policies that better incentivise data sharing are needed to improve the quality of research results and foster scientific progress.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Federer, Lisa. "The Librarian as Research Informationist: A Case Study." Journal of the Medical Library Association 101, no. 4 (2013): 298-302. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3794685/

Feijen, Martin. What Researchers Want. Utrecht: SURFfoundation, 2011. http://www.surf.nl/binaries/content/assets/surf/en/knowledgebase/2011/What_researchers_want.pdf

In October 2010, the Dutch universities explored po ssible projects in the area of research data. One of the outcomes of this discussion was the decision to first investigate what researchers need with respect to storing and accessing research data . The present literature study is the result of that investigation. Fifteen sources were studied, consisting of reports from 2008-2010 covering the Netherlands, the UK, the USA, Australia and Europe.

This work is licensed under a Creative Commons Attribution 3.0 Netherlands Licence.

Ferguson, Jen. "Description and Annotation of Biomedical Data Sets." Journal of eScience Librarianship 1, no. 1 (2012: e1000). http://dx.doi.org/10.7191/jeslib.2012.1000

Ferreira, Filipe, Miguel E. Coimbra, Raquel Bairrão, Ricardo Viera, Ana T. Freitas, Luís M. S. Russo, and José Borbinha. "Data Management in Metagenomics: A Risk Management Approach." International Journal of Digital Curation 9, no. 1 (2014): 41-56. http://www.ijdc.net/index.php/ijdc/article/view/9.1.41/340

In eScience, where vast data collections are processed in scientific workflows, new risks and challenges are emerging. Those challenges are changing the eScience paradigm, mainly regarding digital preservation and scientific workflows. To address specific concerns with data management in these scenarios, the concept of the Data Management Plan was established, serving as a tool for enabling digital preservation in eScience research projects. We claim risk management can be jointly used with a Data Management Plan, so new risks and challenges can be easily tackled. Therefore, we propose an analysis process for eScience projects using a Data Management Plan and ISO 31000 in order to create a Risk Management Plan that can complement the Data Management Plan. The motivation, requirements and validation of this proposal are explored in the MetaGen-FRAME project, focused in Metagenomics.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Finney, K. "Managing Antarctic Data—A Practical Use Case." Data Science Journal 13 (2014): PDA8-PDA14. http://datascience.codata.org/articles/abstract/10.2481/dsj.IFPDA-02/

Scientific data management is performed to ensure that data are curated in a manner that supports their qualified reuse. Curation usually involves actions that must be performed by those who capture or generate data and by a facility with the capability to sustainably archive and publish data beyond an individual project's lifecycle. The Australian Antarctic Data Centre is such a facility. How this centre is approaching the administration of Antarctic science data is described in the following paper and serves to demonstrate key facets necessary for undertaking polar data management in an increasingly connected global data environment.

This work is licensed under a Creative Commons Attribution 3.0 License.

Fitt, Alistai, Rowena Rouse, and Sarah Taylor. "Research Data Management: An Approach from a Modern University with a Growing Research Portfolio." Journal of Digital Media Management 3, no. 4 (2015): 320-328. http://www.ingentaconnect.com/content/hsp/jdmm/2015/00000003/00000004/art00007

Florance, Patrick, Marc McGee, Christopher Barnett, Stephen McDonald. "The Open Geoportal Federation." Journal of Map & Geography Libraries 11, no. 3 (2015): 376=394. http://dx.doi.org/10.1080/15420353.2015.1054543

Fong, Bonnie L., and Minglu Wang. "Required Data Management Training for Graduate Students in an Earth and Environmental Sciences Department." Journal of eScience Librarianship 4, no. 1 (2015): e1067. http://escholarship.umassmed.edu/jeslib/vol4/iss1/3/

Fox, Robert. "The Art and Science of Data Curation." OCLC Systems & Services: International Digita Llibrary Perspectives 29, no. 4 (2013): 195-199. http://www.emeraldinsight.com/doi/abs/10.1108/OCLC-07-2013-0021

Frank, Rebecca D., Elizabeth Yake, and Ixchel M. Faniel. "Destruction/Reconstruction: Preservation of Archaeological and Zoological Research Data." Archival Science 15, no. 2 (2015): 141-167. http://link.springer.com/article/10.1007%2Fs10502-014-9238-9

Frey, Jeremy. "Curation of Laboratory Experimental Data as Part of the Overall Data Lifecycle." International Journal of Digital Curation 3, no. 1 (2008): 44-62. http://www.ijdc.net/index.php/ijdc/article/view/62/41

Frey, Jeremy, Simon J. Coles, Colin Bird, and Cerys Willoughby. "Collection, Curation, Citation at Source: Publication@Source 10 Years On." International Journal of Digital Curation 10, no. 2 (2015): 1-11. http://www.ijdc.net/index.php/ijdc/article/view/10.2.1

The Southampton chemical information group had its genesis in 2001, when we began an e-Science pilot project to investigate structure-property mapping, combinatorial chemistry, and the Grid. CombeChem instigated a range of activities that have since been underway for more than ten years, in many ways matching the expansion of interest in using the Web as a vehicle for collection, curation, dissemination, reuse, and exploitation of scientific data and information. Chemistry has frequently provided the exemplar case studies, notably for the series of projects—funded by Jisc and EPSRC—that investigated the issues associated with the long-term preservation of data to support the scholarly knowledge cycle, such as the eBank UK project.

Rapid developments in Internet access and mobile technology have significantly influenced the way researchers view connectivity, data standards, and the increasing importance and power of semantics and the Semantic Web. These technical advances interact strongly with the social dimension and have led to a reconsideration of the responsibilities of researchers for the quality of their research and for satisfying the requirements of modern stakeholders. Such obligations have given rise to discussions about Open Access and Open Data, creating a range of alternatives that are now technically feasible but need to be socially acceptable. Business plans are changing too, but in a strange contradiction, desire can run ahead of what is possible, sensible, and affordable, while lagging behind in imagination of what would be technically possible and potentially game-changing!

Taking the chemical sciences as our example and focusing on the curation of research data, we explore from our perspective, ten years back and ten years forward, how far we have been able to re-imagine the data/information value pathway from bench to publication. We assess not only the major advances and changes that have been achieved, but also where we have been less successful than we might have hoped. We explore the directions for the future, based on what is clearly already possible and on what we can envisage becoming feasible in the near future.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Friddell, J., E. LeDrew, and W. Vincent. "The Polar Data Catalogue: Best Practices for Sharing and Archiving Canada's Polar Data." Data Science Journal 13 (2014): PDA1-PDA7. http://datascience.codata.org/articles/abstract/10.2481/dsj.IFPDA-01/.

The Polar Data Catalogue (PDC) is a growing Canadian archive and public access portal for Arctic and Antarctic research and monitoring data. In partnership with a variety of Canadian and international multi-sector research programs, the PDC encompasses the natural, social, and health sciences. From its inception, the PDC has adopted international standards and best practices to provide a robust infrastructure for reliable security, storage, discoverability, and access to Canada's polar data and metadata. Current efforts focus on developing new partnerships and incentives for data archiving and sharing and on expanding connections to other data centres through metadata interoperability protocols.

This work is licensed under a Creative Commons Attribution 3.0 License.

Frisz, Chris, Geoffrey Brown, and Samuel Waggoner. "Assessing Migration Risk for Scientific Data Formats." International Journal of Digital Curation 7, no. 1 (2012): 27-38. http://www.ijdc.net/index.php/ijdc/article/view/202/271

The majority of information about science, culture, society, economy and the environment is born digital, yet the underlying technology is subject to rapid obsolescence. One solution to this obsolescence, format migration, is widely practiced and supported by many software packages, yet migration has well known risks. For example, newer formats—even where similar in function—do not generally support all of the features of their predecessors, and, where similar features exist, there may be significant differences of interpretation.

There appears to be a conflict between the wide use of migration and its known risks. In this paper we explore a simple hypothesis—that, where migration paths exist, the majority of data files can be safely migrated leaving only a few that must be handled more carefully—in the context of several scientific data formats that are or were widely used. Our approach is to gather information about potential migration mismatches and, using custom tools, evaluate a large collection of data files for the incidence of these risks. Our results support our initial hypothesis, though with some caveats. Further, we found that writing a tool to identify "risky" format features is considerably easier than writing a migration tool.

This work is licensed under a Creative Commons Attribution License.

Gabridge, Tracy. "The Last Mile: Liaison Roles in Curating Science and Engineering Research Data." Research Library Issues: A Bimonthly Report from ARL, CNI, AND SPARC, no. 265 (2009): 15-21. http://publications.arl.org/rli265/16

Ganley, Emma. "PLOS Data Policy: Catalyst for a Better Research Process." College & Research Libraries News 75, no. 6 (2014): 305-308. http://crln.acrl.org/content/75/6/305.full

Garrett, Leigh, Marie-Therese Gramstadt, and Carlos Silva. " Here, KAPTUR This! Identifying and Selecting the Infrastructure Required to Support the Curation and Preservation of Visual Arts Research Data." International Journal of Digital Curation 8, no. 2 (2013): 68-88. http://www.ijdc.net/index.php/ijdc/article/view/8.2.68/318

Research data is increasingly perceived as a valuable resource and, with appropriate curation and preservation, it has much to offer learning, teaching, research, knowledge transfer and consultancy activities in the visual arts. However, very little is known about the curation and preservation of this data: none of the specialist arts institutions have research data management policies or infrastructure and anecdotal evidence suggests that practice is ad hoc, left to individual researchers and teams with little support or guidance. In addition, the curation and preservation of such diverse and complex digital resources as found in the visual arts is, in itself, challenging. Led by the Visual Arts Data Service, a research centre of the University for the Creative Arts, in collaboration with the Glasgow School of Art; Goldsmiths College, University of London; and University of the Arts London, and funded by JISC, the KAPTUR project (2011-2013) seeks to address the lack of awareness and explore the potential of research data management systems in the arts by discovering the nature of research data in the visual arts, investigating the current state of research data management, developing a model of best practice applicable to both specialist arts institutions and arts departments in multidisciplinary institutions, and by applying, testing and piloting the model with the four institutional partners. Utilising the findings of the KAPTUR user requirement and technical review, this paper will outline the method and selection of an appropriate research data management system for the visual arts and the issues the team encountered along the way.

This work is licensed under a Creative Commons Attribution License.

Garrett, Leigh, Marie-Therese Gramstadt, Carlos Silva, and Anne Spalding. "KAPTUR the Highlights: Exploring Research Data Management in the Visual Arts." Ariadne, no. 71 (2013). http://www.ariadne.ac.uk/issue71/garrett-et-al

Gaudette, Glenn R., and Donna Kafel. "A Case Study: Data Management in Biomedical Engineering." Journal of eScience Librarianship 1, no. 3 (2012): e1027. http://dx.doi.org/10.7191/jeslib.2012.1027

Gelernter, Judith, and Michael Lesk. "Use of Ontologies for Data Integration and Curation." International Journal of Digital Curation 6, no. 1 (2011). http://www.ijdc.net/index.php/ijdc/article/view/164/232

Getler, Magdalena, Diana Sisu, Sarah Jones, and Kerry Miller. "DMPonline Version 4.0: User-Led Innovation." International Journal of Digital Curation 9, no. 1 (2014): 193-219. http://www.ijdc.net/index.php/ijdc/article/view/9.1.193/353

DMPonline is a web-based tool to help researchers and research support staff produce data management and sharing plans. Between October and December 2012, we examined DMPonline in unprecedented detail. The results of this evaluation led to some major changes. We have shortened the DCC Checklist for a Data Management Plan and revised how this is used in the tool. We have also amended the data model for DMPonline, improved workflows and redesigned the user interface.

This paper reports on the evaluation, outlining the methods used, the results gathered and how they have been acted upon. We conducted usability testing on v.3 of DMPonline and the v.4 beta prior to release. The results from these two rounds of usability testing are compared to validate the changes made. We also put forward future plans for a more iterative development approach and greater community input.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Giarlo, Michael J. "Academic Libraries as Data Quality Hubs." Journal of Librarianship and Scholarly Communication 1, no. 3 (2013): eP1059. http://dx.doi.org/10.7710/2162-3309.1059

Academic libraries have a critical role to play as data quality hubs on campus. There is an increased need to ensure data quality within 'e-science'. Given academic libraries' curation and preservation expertise, libraries are well suited to support the data quality process. Data quality measurements are discussed, including the fundamental elements of trust, authenticity, understandability, usability and integrity, and are applied to the Digital Curation Lifecycle model to demonstrate how these measures can be used to understand and evaluate data quality within the curatorial process. Opportunities for improvement and challenges are identified as areas that are fruitful for future research and exploration.

This work is licensed under a Creative Commons Creative Commons Attribution 3.0 License.

Goben, Abigail, and Rebecca Raszewski. "Research Data Management Self-Education for Librarians: A Webliography." Issues in Science and Technology Librarianship, no. 82 (2015). http://istl.org/15-fall/internet2.html

As data as a scholarly object continues to grow in importance in the research community, librarians are undertaking increasing responsibilities regarding data management and curation. New library initiatives include assisting researchers in finding data sets for reuse; locating and hosting repositories for required archiving; consultations on workflow, data management plans, and best practices; responding to changing funder policies (Whitmire, et al. 2015) and development of department or institutional policies. Librarians looking to provide services or expand into these areas will need both foundational resources and information about engaging the network of librarians exploring data. This webliography is intended for librarians seeking to enhance their own knowledge and assist peers in improving their data management awareness.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Goben, Abigail, and Dorothea Salo. "Federal Research Data Requirements Set to Change." College & Research Libraries News 74, no. 8 (2013): 421-425. http://crln.acrl.org/content/74/8/421.full

Goldman, Julie, Donna Kafel, and Elaine R. Martin. "Assessment of Data Management Services at New England Region Resource Libraries." Journal of eScience Librarianship 4, no. 1 (2015): e1068. http://escholarship.umassmed.edu/jeslib/vol4/iss1/4/

Goodison, Crystal, Alexis Guillaume Thomas, and Sam Palmer. "The Florida Geographic Data Library: Lessons Learned and Workflows for Geospatial Data Management." Journal of Map & Geography Libraries 12, no. 1 (2016): 73-99. http://dx.doi.org/10.1080/15420353.2015.1038861

Goodman, Alyssa, Alberto Pepe, Alexander W. Blocker, Christine L. Borgman, Kyle Cranmer, Merce Crosas, Rosanne Di Stefano, Yolanda Gil, Paul Groth, Margaret Hedstrom, David W. Hogg, Vinay Kashyap, Ashish Mahabal, Aneta Siemiginowska, and Aleksandra Slavkovic. "Ten Simple Rules for the Care and Feeding of Scientific Data." PLoS Computational Biology 10. no. 4 (2014): e1003542. http://dx.doi.org/10.1371/journal.pcbi.1003542

This article offers a short guide to the steps scientists can take to ensure that their data and associated analyses continue to be of value and to be recognized. In just the past few years, hundreds of scholarly papers and reports have been written on questions of data sharing, data provenance, research reproducibility, licensing, attribution, privacy, and more—but our goal here is not to review that literature. Instead, we present a short guide intended for researchers who want to know why it is important to "care for and feed" data, with some practical advice on how to do that. The final section at the close of this work (Links to Useful Resources) offers links to the types of services referred to throughout the text.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Gordon, Andrew S., David S. Millman, Lisa Steiger, Karen E. Adolph, and Rick O. Gilmore. "Researcher-Library Collaborations: Data Repositories as a Service for Researchers." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1238. http://doi.org/10.7710/2162-3309.1238

INTRODUCTION New interest has arisen in organizing, preserving, and sharing the raw materials-the data and metadata-that undergird the published products of research. Library and information scientists have valuable expertise to bring to bear in the effort to create larger, more diverse, and more widely used data repositories. However, for libraries to be maximally successful in providing the research data management and preservation services required of a successful data repository, librarians must work closely with researchers and learn about their data management workflows. DESCRIPTION OF SERVICES Databrary is a data repository that is closely linked to the needs of a specific scholarly community-researchers who use video as a main source of data to study child development and learning. The project's success to date is a result of its focus on community outreach and providing services for scholarly communication, engaging institutional partners, offering services for data curation with the guidance of closely involved information professionals, and the creation of a strong technical infrastructure. NEXT STEPS Databrary plans to improve its curation tools that allow researchers to deposit their own data, enhance the user-facing feature set, increase integration with library systems, and implement strategies for long-term sustainability.

This work is licensed under a Creative Commons Attribution 4.0 License.

Green, Katie, Kieron Niven, and Georgina Field. "Migrating 2 and 3D Datasets: Preserving AutoCAD at the Archaeology Data Service." ISPRS International Journal of Geo-Information 5, no. 4 (2016): 44. http://dx.doi.org/10.3390/ijgi5040044

The Archaeology Data Service (ADS) is a digital archive that has been promoting good practice in the use of digital archaeological data and supporting research, learning and teaching with high quality and dependable digital resources for twenty years. The ADS does this by preserving digital data in the long-term and by promoting and disseminating, open and free datasets, gathered from all sectors of archaeology. An integral component of the ADS remit has been the life-cycle principle of preservation, curation and dissemination of data in order to enable re-use. The ADS practices a combination of normalization, version migration, format migration and refreshment for the active management and ongoing preservation of all archived data types. This paper highlights the importance of the ongoing management of research data for long-term preservation. In particular this paper focuses on the challenges of migrating spatial data, specifically Computer Aided Design (CAD) files. Successful data migration of these files ensures that data is accessible and usable, and provides many opportunities through data re-use to combine and re-interrogate datasets, allowing new archaeological interpretations to be developed.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Greenberg, Jane. "Metadata for Scientific Data: Historical Considerations, Current Practice, and Prospects." Journal of Library Metadata 10, no. 2/3 (2010): 75-78. http://www.tandfonline.com/doi/abs/10.1080/19386389.2010.520262

Greenberg, Jane, Hollie C. White, Sarah Carrier, and Ryan Scherle. "A Metadata Best Practice for a Scientific Data Repository." Journal of Library Metadata 9, no. 3/4 (2009): 194-212. http://www.tandfonline.com/doi/full/10.1080/19386380903405090

Griffiths, Aaron. "The Publication of Research Data: Researcher Attitudes and Behaviour." International Journal of Digital Curation 4, no. 1 (2009): 46-56. http://www.ijdc.net/index.php/ijdc/article/view/101/76

Groenewegen, David, and Andrew Treloar. "Adding Value by Taking a National and Institutional Approach to Research Data: The ANDS Experience." International Journal of Digital Curation 8, no. 2 (2013): 89-98. http://www.ijdc.net/index.php/ijdc/article/view/8.2.89/319

The Australian National Data Service (ANDS) has been working to add value to Australia's research data environment since 2009. This paper looks at the changes that have occurred over this time, ANDS' role in those changes and the current state of the Australian research sector at this time, using case studies of selected institutions.

This work is licensed under a Creative Commons Attribution License.

Grootveld, Marjan, and Jeff van Egmond. "Peer-Reviewed Open Research Data: Results of a Pilot." International Journal of Digital Curation 7, no. 2 (2012): 81-91. http://www.ijdc.net/index.php/ijdc/article/view/221/290

Peer review of publications is at the core of science and primarily seen as instrument for ensuring research quality. However, it is less common to independently value the quality of the underlying data as well. In the light of the 'data deluge' it makes sense to extend peer review to the data itself and this way evaluate the degree to which the data are fit for re-use. This paper describes a pilot study at EASY—the electronic archive for (open) research data at our institution. In EASY, researchers can archive their data and add metadata themselves. Devoted to open access and data sharing, at the archive we are interested in further enriching these metadata with peer reviews.

As a pilot, we established a workflow where researchers who have downloaded data sets from the archive were asked to review the downloaded data set. This paper describes the details of the pilot including the findings, both quantitative and qualitative. Finally, we discuss issues that need to be solved when such a pilot is turned into a structural peer review functionality for the archiving system.

This work is licensed under a Creative Commons Attribution License.

Gutmann, M., K. Schürer, D. Donakowski, and Hilary Beedham. "The Selection, Appraisal, and Retention of Social Science Data." Data Science Journal 3 (2004): 209-221. http://datascience.codata.org/articles/abstract/10.2481/dsj.3.209/

The number of data collections produced in the social sciences prohibits the archiving of every scientific study. It is therefore necessary to make decisions regarding what can be preserved and why it should be preserved. This paper reviews the processes used by two data archives, one from the United States and one from the United Kingdom, to illustrate how data are selected for archiving, how they are appraised, and what steps are required to retain the usefulness of the data for future use. It also presents new initiatives that seek to encourage an increase in the long-term preservation of digital resources.

This work is licensed under a Creative Commons Attribution 3.0 License.

Gutmann, Myron P., Mark Abrahamson, Margaret O. Adams, Micah Altman, Caroline Arms, Kenneth Bollen, Michael Carlson, Jonathan Crabtree, Darrell Donakowski, Gary King, Jared Lyle, Marc Maynard, Amy Pienta, Richard Rockwell, Lois Timms-Ferrara, and Copeland H. Young. "From Preserving the Past to Preserving the Future: The Data-PASS Project and the Challenges of Preserving Digital Social Science Data." Library Trends 57, no. 3 (2009): 315-337. http://hdl.handle.net/2142/13593

Guy, Marieke, Martin Donnelly, and Laura Molloy. "Pinning It Down: Towards a Practical Definition of 'Research Data' for Creative Arts Institutions." International Journal of Digital Curation 8, no. 2 (2013): 99-110. http://www.ijdc.net/index.php/ijdc/article/view/8.2.99/320

There is a widespread understanding among scientific researchers about what is meant by 'research data'; however this does not readily translate into a creative context. As part of its engagement with the University of the Arts London (UAL) and via its support for the JISC Managing Research Data Programme, the Digital Curation Centre (DCC) and partners have worked towards an acceptable and practical definition of research data for creative arts institutions. This paper describes the activities carried out to help pin down such a definition, including a literature review, short and extended interviews with researchers, interactions with an academic arts research practitioner, and distillation of the results from a one-day workshop which took place in London in September 2012.

This work is licensed under a Creative Commons Attribution License.

Halbert, Martin. " The Problematic Future of Research Data Management: Challenges, Opportunities and Emerging Patterns Identified by the DataRes Project." International Journal of Digital Curation 8, no. 2 (2013): 111-122. http://www.ijdc.net/index.php/ijdc/article/view/8.2.111/321

This paper describes findings and projections from a project that has examined emerging policies and practices in the United States regarding the long-term institutional management of research data. The DataRes project at the University of North Texas (UNT) studied institutional transitions taking place during 2011-2012 in response to new mandates from U.S. governmental funding agencies requiring research data management plans to be submitted with grant proposals. Additional synergistic findings from another UNT project, termed iCAMP, will also be reported briefly.

This paper will build on these data analysis activities to discuss conclusions and prospects for likely developments within coming years based on the trends surfaced in this work. Several of these conclusions and prospects are surprising, representing both opportunities and troubling challenges, for not only the library profession but the academic research community as a whole.

This work is licensed under a Creative Commons Attribution License.

Hanson, Karen L., Theodora A. Bakker, Mario A. Svirsky, Arlene C. Neuman, and Neil Rambo. "Informationist Role: Clinical Data Management in Auditory Research." Journal of eScience Librarianship 2, no. 1 (2013): e1030. http://dx.doi.org/10.7191/jeslib.2013.1030

Harris-Pierce, Rebecca L., and Yan Quan Liu. "Is Data Curation Education at Library and Information Science Schools in North America Adequate?" New Library World 113, no. 11/12, (2012): 598-613. http://dx.doi.org/10.1108/03074801211282957

Hedges, Mark, Tobias Blanke, Stella Fabiane, Gareth Knight, and Eric Liao. "Sheer Curation of Experiments: Data, Process, Provenance." Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5883

Hedges, Mark, Tobias Blanke, and Adil Hasan. "Rule-Based Curation and Preservation of Data: A Data Grid Approach using iRODS." Future Generation Computer Systems 25, no. 4 (2009): 446-452. http://www.sciencedirect.com/science/article/pii/S0167739X08001660

Hedges, Mark, Mike Haft, and Gareth Knight. "FISHNet: Encouraging Data Sharing and Reuse in the Freshwater Science Community " Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5884

Heidorn, P. Bryan. "The Emerging Role of Libraries in Data Curation and E-Science." Journal of Library Administration 51, no. 7/8 (2011): 662-672. http://www.tandfonline.com/doi/abs/10.1080/01930826.2011.601269

——— "Shedding Light on the Dark Data in the Long Tail of Science." Library Trends 57, no. 2 (2008): 280-299. http://hdl.handle.net/2142/10672

Helbig, Kerstin. "Research Data Management Training for Geographers: First Impressions." ISPRS International Journal of Geo-Information 5, no. 4 (2016): 40. http://dx.doi.org/10.3390/ijgi5040040

Sharing and secondary analysis of data have become increasingly important for research. Especially in geography, the collection of digital data has grown due to technological changes. Responsible handling and proper documentation of research data have therefore become essential for funders, publishers and higher education institutions. To achieve this goal, universities offer support and training in research data management. This article presents the experiences of a pilot workshop in research data management, especially for geographers. A discipline-specific approach to research data management training is recommended. The focus of this approach increases researchers' interest and allows for more specific guidance. The instructors identified problems and challenges of research data management for geographers. In regards to training, the communication of benefits and reaching the target groups seem to be the biggest challenges. Consequently, better incentive structures as well as communication channels have to be established.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Helbig, Kerstin, Brigitte Hausstein, and Ralf Toepfer. "Supporting Data Citation: Experiences and Best Practices of a DOI Allocation Agency for Social Sciences." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1220. http://doi.org/10.7710/2162-3309.1220

INTRODUCTION As more and more research data becomes better and more easily available, data citation gains in importance. The management of research data has been high on the agenda in academia for more than five years. Nevertheless, not all data policies include data citation, and problems like versioning and granularity remain. SERVICE DESCRIPTION da|ra operates as an allocation agency for DataCite and offers the registration service for social and economic research data in Germany. The service is jointly run by GESIS and ZBW, thereby merging experiences on the fields of Social Sciences and Economics. The authors answer questions pertaining to the most frequent aspects of research data registration like versioning and granularity as well as recommend the use of persistent identifiers linked with enriched metadata at the landing page. NEXT STEPS The promotion of data sharing and the development of a citation culture among the scientific community are future challenges. Interoperability becomes increasingly important for publishers and infrastructure providers. The already existent heterogeneity of services demands solutions for better user guidance. Building information competence is an asset of libraries, which can and should be expanded to research data.

This work is licensed under a Creative Commons Attribution 4.0 Licensee.

Henderson, Margaret E., and Teresa L. Knott. "Starting a Research Data Management Program Based in a University Library." Medical Reference Services Quarterly 34, no. 1 (2015): 387-403. http://www.tandfonline.com/doi/full/10.1080/02763869.2015.986783#abstract

Henderson, Margaret, Yasmeen Shorish, and Steve Van Tuyl. "Research Data Management on a Shoestring Budget." Bulletin of the American Society for Information Science and Technology 40, no. 6 (2014): 14-17. https://www.asis.org/Bulletin/Aug-14/AugSep14_HendersonEtAl.pdf

Hendren, Christine Ogilvie, Christina M. Powers, Mark D, Hoover, and Stacey L. Harper. "The Nanomaterial Data Curation Initiative: A Collaborative Approach to Assessing, Evaluating, and Advancing the State of the Field." Beilstein Journal of Nanotechnology 6 (2015): 1752-1762. http://dx.doi.org/10.3762/bjnano.6.179

The Nanomaterial Data Curation Initiative (NDCI), a project of the National Cancer Informatics Program Nanotechnology Working Group (NCIP NanoWG), explores the critical aspect of data curation within the development of informatics approaches to understanding nanomaterial behavior. Data repositories and tools for integrating and interrogating complex nanomaterial datasets are gaining widespread interest, with multiple projects now appearing in the US and the EU. Even in these early stages of development, a single common aspect shared across all nanoinformatics resources is that data must be curated into them. Through exploration of sub-topics related to all activities necessary to enable, execute, and improve the curation process, the NDCI will provide a substantive analysis of nanomaterial data curation itself, as well as a platform for multiple other important discussions to advance the field of nanoinformatics. This article outlines the NDCI project and lays the foundation for a series of papers on nanomaterial data curation. The NDCI purpose is to: 1) present and evaluate the current state of nanomaterial data curation across the field on multiple specific data curation topics, 2) propose ways to leverage and advance progress for both individual efforts and the nanomaterial data community as a whole, and 3) provide opportunities for similar publication series on the details of the interactive needs and workflows of data customers, data creators, and data analysts. Initial responses from stakeholder liaisons throughout the nanoinformatics community reveal a shared view that it will be critical to focus on integration of datasets with specific orientation toward the purposes for which the individual resources were created, as well as the purpose for integrating multiple resources. Early acknowledgement and undertaking of complex topics such as uncertainty, reproducibility, and interoperability is proposed as an important path to addressing key challenges within the nanomaterial community, such as reducing collateral negative impacts and decreasing the time from development to market for this new class of technologies.

This work is licensed under a Creative Commons Attribution 2.0 Generic Licensee.

Hense, Andreas, and Florian Quadt. "Acquiring High Quality Research Data." D-Lib Magazine 17, no. 1/2 (2011). http://www.dlib.org/dlib/january11/hense/01hense.html

Herold, Philip. "Data Sharing Among Ecology, Evolution, and Natural Resources Scientists: An Analysis of Selected Publications." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1244. http://doi.org/10.7710/2162-3309.1244

INTRODUCTION Understanding the differing data management practices among academic disciplines is an important way to inform existing and emerging library research support and services. This paper reports findings from a study of data sharing practices among ecology, evolution, and natural resources scientists at the University of Minnesota. It examines data sharing rates, methods, and disciplinary differences and discusses the characteristics of researchers, data, methods, and aspects of data sharing across this group of disciplines. METHODS Data sharing practices are investigated by reviewing the two most recently published research articles (n=155) for each faculty member (n=78) in three departments at a single large research university. All mentions of data sharing in each publication were pursued in order to locate, analyze, and characterize shared data. RESULTS Seventy-two of 155 (46%) articles indicated that related research data was publicly shared by some method. The most prevalent method for data sharing was via journal websites, with 91% of data sharing articles using this method. Ecology, evolution, and behavior scientists shared data at the highest rate (70% of their articles), contrasting with fisheries, wildlife, and conservation biologists (18%), and forest resources (16%). DISCUSSION Differences between data sharing practices may be attributable to a range of influences: funder, journal, and institutional policies; disciplinary norms; and perceived or real rewards or incentives, as well as contrasting concerns, cost, or other barriers to sharing data. CONCLUSION Study results suggest differential approaches to data services outreach based on discipline and research type and support the need for education and influence on both scientist and journal practices.

This work is licensed under a Creative Commons Attribution 4.0 License.

Herterich, Patricia, and Sünje Dallmeier-Tiessen. "Data Citation Services in the High-Energy Physics Community." D-Lib Magazine 22, no. 1/2 (2016). http://www.dlib.org/dlib/january16/herterich/01herterich.html

Higman, Rosie, and Stephen Pinfield. "Research Data Management and Openness: The Role of Data Sharing in Developing Institutional Policies and Practices." Program 49, no. 4 (2015): 364-381. http://eprints.whiterose.ac.uk/89888/

Hiom, Debra, Dom Fripp, Stephen Gray, Kellie Snow, and Damian Steer. "Research Data Management at the University of Bristol: Charting a Course from Project to Service." Program 49, no. 4 (2015): 475-493. http://dx.doi.org/10.1108/PROG-02-2015-0019

Hswe, Patricia, and Ann Holt. "Joining in the Enterprise of Response in the Wake of the NSF Data Management Planning Requirement." Research Library Issues, no. 274 (2011): 11-17. http://publications.arl.org/rli274/12

Huang, Hong, Corinne Jörgensen, Besiki Stvilia. "Genomics Data Curation Roles, Skills and Perception of Data Quality." Library & Information Science Research 37, no. 1 (2015): 10-20. http://dx.doi.org/10.1016/j.lisr.2014.08.003

Hunter, Jane. "Scientific Publication Packages—A Selective Approach to the Communication and Archival of Scientific Output." International Journal of Digital Curation 1, no. 1 (2006): 33-52. http://www.ijdc.net/index.php/ijdc/article/view/8

Ishida, Mayu. "The New England Collaborative Data Management Curriculum Pilot at the University of Manitoba: A Canadian Experience." Journal of eScience Librarianship 4, no. 2 (2015): e1061. http://escholarship.umassmed.edu/jeslib/vol3/iss1/10/

Jacobs, Clifford A., and Steven J. Worley. "Data Curation in Climate and Weather: Transforming Our Ability to Improve Predictions through Global Knowledge Sharing." International Journal of Digital Curation 4, no. 2 (2009): 68-79. http://www.ijdc.net/index.php/ijdc/article/view/119/122

Jahnke, Lori, Andrew Asher, and Spencer D. C. Keralis. The Problem of Data. Washington, DC: Council on Library and Information Resources, 2012. http://www.clir.org/pubs/reports/pub154

Johnson, Andrew M., and Shelley Knuth. "Data Management Plan Requirements for Campus Grant Competitions: Opportunities for Research Data Services Assessment and Outreach." Journal of eScience Librarianship 5, no. 1 (2016): e1089. http://escholarship.umassmed.edu/jeslib/vol5/iss1/1/

Objective: To examine the effects of research data services (RDS) on the quality of data management plans (DMPs) required for a campus-level faculty grant competition, as well as to explore opportunities that the local DMP requirement presented for RDS outreach.

Methods: Nine reviewers each scored a randomly assigned portion of DMPs from 82 competition proposals. Each DMP was scored by three reviewers, and the three scores were averaged together to obtain the final score. Interrater reliability was measured using intraclass correlation. Unpaired t-tests were used to compare mean DMP scores for faculty who utilized RDS services with those who did not. Unpaired t-tests were also used to compare mean DMP scores for proposals that were funded with proposals that were not funded. One-way ANOVA was used to compare mean DMP scores among proposals from six broad disciplinary categories.

Results: Analyses showed that RDS consultations had a statistically significant effect on DMP scores. Differences between DMP scores for funded versus unfunded proposals and among disciplinary categories were not significant. The DMP requirement also provided a number of both expected and unexpected outreach opportunities for RDS services.

Conclusions: Requiring DMPs for campus grant competitions can provide important assessment and outreach opportunities for research data services. While these results might not be generalizable to DMP review processes at federal funding agencies, they do suggest the importance, at any level, of developing a shared understanding of what constitutes a high quality DMP among grant applicants, grant reviewers, and RDS providers.

This work is licensed under a Creative Commons Attribution 4.0 License.

Johnson, Andrew W., and Megan M. Bresnahan. "DataDay!: Designing and Assessing a Research Data Workshop for Subject Librarians." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1229. http://doi.org/10.7710/2162-3309.1229

BACKGROUND Many libraries have launched or adapted services to address the research data needs of campus faculty and students. At the University of Colorado Boulder (CU-Boulder), local demand for research data training emerged from a broader assessment of training needs for subject librarians. The findings from this assessment led to the development of a day-long workshop called DataDay! that aimed to expand and translate the skills of subject librarians into the context of research data support. DESCRIPTION OF PROGRAM The DataDay! workshop incorporated hands-on exercises with expert presentations, informal discussions, and print handouts. The workshop allowed participants to gain experience with activities like working with real data sets and developing materials for outreach about research data services. Several instruments were used to assess the workshop learning outcomes, which included changes in knowledge and comfort levels related to engaging in research data support. Assessment activities also measured how well participants applied concepts taught in the workshop to novel situations. NEXT STEPS Future research data training efforts for CU-Boulder librarians will be informed by the DataDay! workshop assessment results, and this workshop may provide a model for other institutions to use to train subject librarians to adapt to new roles in support of research data. There is also a need for the lessons learned from local training efforts like DataDay! to inform the development of resources to support the broader subject librarian community as their institutions launch and grow research data services.

This work is licensed under a Creative Commons Attribution 4.0 License.

Johnston, Lisa, Meghan Lafferty, and Beth Petsan. "Training Researchers on Data Management: A Scalable, Cross-Disciplinary Approach." Journal of eScience Librarianship 1, no. 2 (2012): e1012. http://dx.doi.org/10.7191/jeslib.2012.1012

Johnston, Wayne. "Digital Preservation Initiatives in Ontario: Trusted Digital Repositories and Research Data Repositories." Partnership: The Canadian Journal of Library and Information Practice and Research 7, no. 2 (2012). http://journal.lib.uoguelph.ca/index.php/perj/article/viewArticle/2014

Joint, Nicholas. "Data Preservation, the New Science and the Practitioner Librarian." Library Review 56, no. 6 (2007): 451-455. http://strathprints.strath.ac.uk/id/eprint/7182

Jones, Sarah. "Developments in Research Funder Data Policy." International Journal of Digital Curation 7, no. 1 (2012): 114-125. http://www.ijdc.net/index.php/ijdc/article/view/209/278

This paper reviews developments in funders' data management and sharing policies, and explores the extent to which they have affected practice. The Digital Curation Centre has been monitoring UK research funders' data policies since 2008. There have been significant developments in subsequent years, most notably the joint Research Councils UK's Common Principles on Data Policy and the Engineering and Physical Sciences Research Council's Policy Framework on Research Data. This paper charts these changes and highlights shifting emphasises in the policies. Institutional data policies and infrastructure are increasingly being developed as a result of these changes. While action is clearly being taken, questions remain about whether the changes are affecting practice on the ground.

This work is licensed under a Creative Commons Attribution License.

Jones, Sarah, Alexander Ball, and Çuna Ekmekcioglu. "The Data Audit Framework: A First Step in the Data Management Challenge." International Journal of Digital Curation 3, no. 2 (2008): 112-120. http://www.ijdc.net/index.php/ijdc/article/view/91

Jones, Sarah, Graham Pryor, and Angus Whyte. How to Develop Research Data Management Services—A Guide for HEIs. Edinburgh: Digital Curation Centre, 2013. http://www.dcc.ac.uk/resources/how-guides/how-develop-rdm-services

The purpose of this guide is to help institutions understand the key aims and issues associated with planning and implementing research data management (RDM) services. It explains the components and processes of RDM services and describes the roles and responsibilities of those who will deliver and use them.

This work is licensed under a Creative Commons Creative Commons Attribution 2.5 Scotland License.

Joque, Justin. "From Data to the Creation of Meaning Part 1: Unit of Analysis as Epistemological Problem." IASSIST Quarterly 38, no. 2 (2014): 7-11. http://www.iassistdata.org/iq/data-creation-meaning-part-1-unit-analysis-epistemological-problem

Kaboré, Esther Dzalé Yeumo. "Opening and Linking Agricultural Research Data." D-Lib Magazine 20, no. 1/2 (2014). http://www.dlib.org/dlib/january14/kabore/01kabore.html

Kafel, Donna, Andrew T. Creamer, and Elaine R. Martin. "Building the New England Collaborative Data Management Curriculum." Journal of eScience Librarianship 3, no. 1 (2014): e1066. http://dx.doi.org/10.7191/jeslib.2014.1066

Kansa, Eric C., Sarah Whitcher Kansa, and Benjamin Arbuckle. "Publishing and Pushing: Mixing Models for Communicating Research Data in Archaeology." International Journal of Digital Curation 9, no. 1 (2014): 57-70. http://www.ijdc.net/index.php/ijdc/article/view/9.1.57/341

We present a case study of data integration and reuse involving 12 researchers who published datasets in Open Context, an online data publishing platform, as part of collaborative archaeological research on early domesticated animals in Anatolia. Our discussion reports on how different editorial and collaborative review processes improved data documentation and quality, and created ontology annotations needed for comparative analyses by domain specialists. To prepare data for shared analysis, this project adapted editor-supervised review and revision processes familiar to conventional publishing, as well as more novel models of revision adapted from open source software development of public version control. Preparing the datasets for publication and analysis required significant investment of effort and expertise, including archaeological domain knowledge and familiarity with key ontologies. To organize this work effectively, we emphasized these different models of collaboration at various stages of this data publication and analysis project. Collaboration first centered on data editors working with data contributors, then widened to include other researchers who provided additional peer-review feedback, and finally the widest research community, whose collaboration is facilitated by GitHub's version control system. We demonstrate that the "publish" and "push" models of data dissemination need not be mutually exclusive; on the contrary, they can play complementary roles in sharing high quality data in support of research. This work highlights the value of combining multiple models in different stages of data dissemination.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Kaplan, Diane E. "The Stanley Milgram Papers: A Case Study on Appraisal of and Access to Confidential Data Files." American Archivist 59, no. 3 (2009): 288-297. http://americanarchivist.org/doi/abs/10.17723/aarc.59.3.k3245057x1902078

Karasti, Helena, and Karen S. Baker. "Digital Data Practices and the Long Term Ecological Research Program Growing Global." International Journal of Digital Curation 3, no. 2 (2008): 42-58. http://www.ijdc.net/index.php/ijdc/article/view/86/57

Karasti, Helena, Karen S. Baker, and Eija Halkola. "Enriching the Notion of Data Curation in E-science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network." Computer Supported Cooperative Work 15, no. 4 (2006): 321-358. http://www.springerlink.com/content/f778uh7077914q20/

Keil, Deborah E. "Research Data Needs from Academic Libraries: The Perspective of a Faculty Researcher." Journal of Library Administration 54, no. 3 (2014): 233-240. http://www.tandfonline.com/doi/abs/10.1080/01930826.2014.915168#.U5tHiihgjm4

Kellam, Lynda, and Kristi Thompson, eds. Databrarianship: The Academic Data Librarian in Theory and Practice Chicago: ALA, 2016. http://www.alastore.ala.org/detail.aspx?ID=11774

Kennan, Mary Anne, Sheila Corrall, and Waseem Afzal. "'Making Space' in Practice And Education: Research Support Services in Academic Libraries." Library Management 35, no. 8/9 (2014): 666-683. http://www.emeraldinsight.com/doi/abs/10.1108/LM-03-2014-0037

Kenyon, Jeremy, Bruce Godfrey, and Gail Z. Eckwright. "Geospatial Data Curation at the University of Idaho." Journal of Web Librarianship 6, no. 4 (2012): 251-262. http://www.tandfonline.com/doi/full/10.1080/19322909.2012.729983

Kerby, Erin E. "Research Data Practices in Veterinary Medicine: A Case Study." Journal of eScience Librarianship 4, no. 1 (2015): e1073. http://escholarship.umassmed.edu/jeslib/vol4/iss1/6/

Kervin, Karina E., William K. Michener, and Robert B. Cook. "Common Errors in Ecological Data Sharing." Journal of eScience Librarianship 2, no. 2 (2013): e1024. http://dx.doi.org/10.7191/jeslib.2013.1024

Khan, Huda, Brian Caruso, Jon Corson-Rikert, Dianne Dietrich, Brian Lowe, and Gail Steinhart. "DataStaR: Using the Semantic Web approach for Data Curation." International Journal of Digital Curation 62, no. 2 (2011). http://www.ijdc.net/index.php/ijdc/article/view/192/257

Khayat, Mohammad, and Steven J. Kempler. "Life Cycle Management Considerations of Remotely Sensed Geospatial Data and Documentation for Long Term Preservation." Journal of Map & Geography Libraries: Advances in Geospatial Information, Collections & Archives 11, no. 3 (2015): 271-288. http://www.tandfonline.com/doi/abs/10.1080/15420353.2015.1072122?journalCode=wmgl20

Kim, Jeonghyun. "Data Sharing and Its Implications for Academic Libraries." New Library World 114, no. 11/12 (2013): 494-506. http://www.emeraldinsight.com/journals.htm?articleid=17100244

Kim, Suntae, and Wongoo Lee. "Global Data Repository Status and Analysis: Based on Korea, China and Japan." Library Hi Tech 32, no. 4 (2014): 706-722. http://www.emeraldinsight.com/doi/abs/10.1108/LHT-06-2014-0064

Kim, Youngseek, Benjamin K. Addom, and Jeffrey M. Stanton. "Education for eScience Professionals: Integrating Data Curation and Cyberinfrastructure." International Journal of Digital Curation 6, no. 1 (2011): 125-138. http://www.ijdc.net/index.php/ijdc/article/view/168/236

Klump, Jens, Roland Bertelmann, Jan Brase, Michael Diepenbroek, Hannes Grobe, Heinke Höck, Michael Lautenschlager, Uwe Schindler, Irina Sens, and Joachim Wächter. "Data Publication in the Open Access Initiative." Data Science Journal. 5 (2006): 79-83. http://doi.org/10.2481/dsj.5.79

Knight, Gareth. "Building a Research Data Management Service for the London School of Hygiene & Tropical Medicine." Program 49, no. 4 (2015): 424-439. http://dx.doi.org/10.1108/PROG-01-2015-0011

———. "A Digital Curate's Egg: A Risk Management Approach to Enhancing Data Management Practices." Journal of Web Librarianship 6, no. 4 (2012): 228-250. http://www.tandfonline.com/doi/abs/10.1080/19322909.2012.729992

Knight, Gareth, and Maureen Pennock. "Data without Meaning: Establishing the Significant Properties of Digital Research." International Journal of Digital Curation 4, no. 1 (2009): 159-174. http://www.ijdc.net/index.php/ijdc/article/view/110/87

Knuth, Shelley L., Andrew M. Johnson, and Thomas Hauser. "Research Data Services at the University of Colorado Boulder." Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 35-38. https://www.asist.org/publications/bulletin/aug-2015/research-data-services/

Kolb, Tracy L., E. Agnes Blukacz-Richards, Andrew M. Muir, Randall M. Claramunt, Marten A. Koops, William W. Taylor, Trent M. Sutton, Michael T. Arts, and Ed Bissel. "How to Manage Data to Enhance Their Potential for Synthesis, Preservation, Sharing, and Reuse—A Great Lakes Case Study." Fisheries 38, no. 2 (2013): 52-64. http://www.tandfonline.com/doi/full/10.1080/03632415.2013.757975#.Ub36a_nfBsk

Koltay, Tibor. "Data Literacy: In Search of a Name and Identity." Journal of Documentation 71, no. 2 (2015): 401-415. http://dx.doi.org/10.1108/JD-02-2014-0026

Kong, Nicole Ningning. "Exploring Best Management Practices for Geospatial Data in Academic Libraries." Journal of Map & Geography Libraries 11, no. 2 (2015): 207-225. http://dx.doi.org/10.1080/15420353.2015.1043170

Kotarski, Rachael, Susan Reilly, Sabine Schrimpf, Eefke Smit, and Karen Walshe. Best Practices for Citability of Data and Evolving Roles in Scholarly Communication. Geneva: Opportunities for Data Exchange, 2012. https://core.ac.uk/download/files/324/30437756.pdf

This report sets out the current thinking on data citation best practice and presents the results of a survey of librarians asking how new support roles could and should be developed. The findings presented here build on the extensive desk research carried out for the report "Integration of Data and Publication" (Reilly, Schallier, Schrimpf, Smit, & Wilkinson, Sept 2011), which identified that data citation was an area of opportunity for both researchers and libraries. That report also recounted the findings of a workshop held at the LIBER 2011 Conference in Barcelona. The workshop, based on preliminary findings on the integration of data and publications, revealed that, although libraries saw the emerging research data landscape as an opportunity, there was a real need to define future directions and the scope of the role of libraries in data exchange. The issue of data citation was also identified as a fundamental issue to be addressed when exploring the way forward. This previous work is supported here with further information gathered through extensive desk research, structured interviews and an online survey of LIBER members to explore best practice in data citation and evolving support roles for libraries.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Kouper, Inna. "CLIR/DLF Digital Curation Postdoctoral Fellowship—The Hybrid Role of Data Curator." Bulletin of the American Society for Information Science and Technology 39, no. 2 (2013): 46-47. http://www.asis.org/Bulletin/Dec-12/DecJan13_RDAP_Kouper.pdf

Kowalczyk, Stacy, and Kalpana Shankar. "Data Sharing in the Sciences." Annual Review of Information Science and Technology 45, no. 1 (2011): 247-294. http://onlinelibrary.wiley.com/doi/10.1002/aris.2011.1440450113/abstract

Kozlowski, Wendy. "Funding Agency Responses to Federal Requirements for Public Access to Research Results." Bulletin of the American Society for Information Science and Technology 40, no. 6 (2014): 26-30. http://www.asis.org/Bulletin/Aug-14/AugSep14_Kozlowski.pdf

Kraft, Angelina, Matthias Razum, Jan Potthoff, Andrea Porzel, Thomas Engel, Frank Lange, Karina van den Broek, and Filipe Furtado. "The RADAR Project—A Service for Research Data Archival and Publication." ISPRS International Journal of Geo-Information 5, no. 3 (2016): 28. http://dx.doi.org/10.3390/ijgi5030028

The aim of the RADAR (Research Data Repository) project is to set up and establish an infrastructure that facilitates research data management: the infrastructure will allow researchers to store, manage, annotate, cite, curate, search and find scientific data in a digital platform available at any time that can be used by multiple (specialized) disciplines. While appropriate and innovative preservation strategies and systems are in place for the big data communities (e.g., environmental sciences, space, and climate), the stewardship for many other disciplines, often called the "long tail research domains", is uncertain. Funded by the German Research Foundation (DFG), the RADAR collaboration project develops a service oriented infrastructure for the preservation, publication and traceability of (independent) research data. The key aspect of RADAR is the implementation of a two-stage business model for data preservation and publication: clients may preserve research results for up to 15 years and assign well-graded access rights, or to publish data with a DOI assignment for an unlimited period of time. Potential clients include libraries, research institutions, publishers and open platforms that desire an adaptable digital infrastructure to archive and publish data according to their institutional requirements and workflows.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Kratz, John, and Carly Strasser. "Data Publication Consensus and Controversies." F1000Research, no. 3 (2014): 94. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4097345/pdf/f1000research-3-5878.pdf

The movement to bring datasets into the scholarly record as first class research products (validated, preserved, cited, and credited) has been inching forward for some time, but now the pace is quickening. As data publication venues proliferate, significant debate continues over formats, processes, and terminology. Here, we present an overview of data publication initiatives underway and the current conversation, highlighting points of consensus and issues still in contention. Data publication implementations differ in a variety of factors, including the kind of documentation, the location of the documentation relative to the data, and how the data is validated. Publishers may present data as supplemental material to a journal article, with a descriptive "data paper," or independently. Complicating the situation, different initiatives and communities use the same terms to refer to distinct but overlapping concepts. For instance, the term published means that the data is publicly available and citable to virtually everyone, but it may or may not imply that the data has been peer-reviewed. In turn, what is meant by data peer review is far from defined; standards and processes encompass the full range employed in reviewing the literature, plus some novel variations. Basic data citation is a point of consensus, but the general agreement on the core elements of a dataset citation frays if the data is dynamic or part of a larger set. Even as data publication is being defined, some are looking past publication to other metaphors, notably "data as software," for solutions to the more stubborn problems.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Krier, Laura, and Carly A. Strasser. Data Management for Libraries: A LITA Guide. Chicago: ALA, 2014. http://www.alastore.ala.org/detail.aspx?ID=10737

Kruse, Filip, and Jesper Boserup Thestrup. "Research Libraries' New Role in Research Data Management, Current Trends and Visions in Denmark." LIBER Quarterly 23, no. 4 (2014): 310-333. http://liber.library.uu.nl/index.php/lq/article/view/9173

The amount of research data is growing constantly, due to new technology with new potentials for collecting and analysing both digital data and research objects. This growth creates a demand for a coherent IT-infrastructure. Such an infrastructure must be able to provide facilities for storage, preservation and a more open access to data in order to fulfil the demands from the researchers themselves, the research councils and research foundations.

This paper presents the findings of a research project carried out under the auspices of DEFF (Danmarks Elektroniske Fag- og Forskningsbibliotek—Denmark's Electronic research Library)[i] to analyse how the Danish universities store, preserve and provide access to research data. It shows that they do not have a common IT-infrastructure for research data management. This paper describes the various paths chosen by individual universities and research institutions, and the background for their strategies of research data management. Among the main reasons for the uneven practices are the lack of a national policy in this field, the different scientific traditions and cultures and the differences in the use and organization of IT-services.

This development contains several perspectives that are of particular relevance to research libraries. As they already curate digital collections and are active in establishing web archives, the research libraries become involved in research and dissemination of knowledge in new ways. This paper gives examples of how The State and University Library's services facilitate research data management with special regard to digitization of research objects, storage, preservation and sharing of research data. This paper concludes that the experience and skills of research libraries make the libraries important partners in a research data management infrastructure.

This work is licensed under a Creative Commons Attribution 4.0 License.

Kugler, Tracy A., David C. Van Riper, Steven M. Manson, David A. Haynes II, Joshua Donato, and Katie Stinebaugh. "Terra Populus: Workflows for Integrating and Harmonizing Geospatial Population and Environmental Data." Journal of Map & Geography Libraries 11, no. 2 (2015): 180-206. http://dx.doi.org/10.1080/15420353.2015.1036484

Kutay, Stephen. "Advancing Digital Repository Services for Faculty Primary Research Assets: An Exploratory Study." The Journal of Academic Librarianship 40, no. 6 (2014): 642-649. http://www.sciencedirect.com/science/article/pii/S0099133314001827

Lage, Kathryn, Barbara Losoff, and Jack Maness. "Receptivity to Library Involvement in Scientific Data Curation: A Case Study at the University of Colorado Boulder." portal: Libraries & the Academy 11, no. 4 (2011): 915-937. http://muse.jhu.edu/login?auth=0&type=summary&url=/journals/portal_libraries_and_the_academy/v011/11.4.lage.html

Lagoze, Carl. "eBird: Curating Citizen Science Data for Use by Diverse Communities." International Journal of Digital Curation 9, no. 1 (2014): 71-82. http://www.ijdc.net/index.php/ijdc/article/view/9.1.71/342

In this paper we describe eBird, a highly successful citizen science project. With over 150,000 participants worldwide and an accumulation of over 140,000,000 bird observations globally in the last decade, eBird has evolved into a major tool for scientific investigations in diverse fields such as ornithology, computer science, statistics, ecology and climate change. eBird's impact in scientific research is grounded in careful data curation practices that pay attention to all stages of the data lifecycle, and attend to the needs of stakeholders engaged in that data lifecycle. We describe the important aspects of eBird, paying particular attention to the mechanisms to improve data quality; describe the data products that are available to the global community; investigate some aspects of the downloading community; and demonstrate significant results that derive from the use of openly-available eBird data.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Lamb, Ian, and Catherine Larson. "Shining a Light on Scientific Data: Building a Data Catalog to Foster Data Sharing and Reuse." Code4Lib Journal, no. 32 (2016). http://journal.code4lib.org/articles/11421

The scientific community's growing eagerness to make research data available to the public provides libraries—with our expertise in metadata and discovery—an interesting new opportunity. This paper details the in-house creation of a "data catalog" which describes datasets ranging from population-level studies like the US Census to small, specialized datasets created by researchers at our own institution. Based on Symfony2 and Solr, the data catalog provides a powerful search interface to help researchers locate the data that can help them, and an administrative interface so librarians can add, edit, and manage metadata elements at will. This paper will outline the successes, failures, and total redos that culminated in the current manifestation of our data catalog.

This work is licensed under a Creative Commons Attribution 3.0 United States License.

Lambert, Paul, Vernon Gayle, Larry Tan, Ken Turner, Richard Sinnott, and Ken Prandy. "Data Curation Standards and Social Science Occupational Information Resources." International Journal of Digital Curation 2, no. 1 (2007): 73-91. http://www.ijdc.net/index.php/ijdc/article/view/26/15

Latham, Bethany, and Jodi Welch Poe. "The Library as Partner in University Data Curation: A Case Study in Collaboration." Journal of Web Librarianship 6, no. 4 (2012): 288-304. http://www.tandfonline.com/doi/full/10.1080/19322909.2012.729429

Laughton, P., and T. du Plessis. "Data Curation in the World Data System: Proposed Framework." Data Science Journal 12 (2013): 56-70. http://datascience.codata.org/articles/abstract/10.2481/dsj.13-029/

The value of data in society is increasing rapidly. Organisations that work with data should have standard practices in place to ensure successful curation of data. The World Data System (WDS) consists of a number of data centres responsible for curating research data sets for the scientific community. The WDS has no formal data curation framework or model in place to act as a guideline for member data centres. The objective of this research was to develop a framework for the curation of data in the WDS. A multiple-case case study was conducted. Interviews were used to gather qualitative data and analysis of the data, which led to the development of this framework. The proposed framework is largely based on the Open Archival Information System (OAIS) functional model and caters for the curation of both analogue and digital data.

This work is licensed under a Creative Commons Attribution 3.0 License.

Laure, Erwin, and Dejan Vitlacil. "Data Storage and Management for Global Research Data Infrastructures—Status and Perspectives." Data Science Journal 12 (2013): GRDI37-GRDI42. http://datascience.codata.org/articles/abstract/10.2481/dsj.GRDI-007/

In the vision of Global Research Data Infrastructures (GRDIs), data storage and management plays a crucial role. A successful GRDI will require a common globally interoperable distributed data system, formed out of data centres, that incorporates emerging technologies and new scientific data activities. The main challenge is to define common certification and auditing frameworks that will allow storage providers and data communities to build a viable partnership based on trust. To achieve this, it is necessary to find a long-term commitment model that will give financial, legal, and organisational guarantees of digital information preservation. In this article we discuss the state of the art in data storage and management for GRDIs and point out future research directions that need to be tackled to implement GRDIs.

This work is licensed under a Creative Commons Attribution 3.0 License.

Lawrence, Bryan, Catherine Jones, Brian Matthews, Sam Pepler, and Sarah Callaghan. "Citation and Peer Review of Data: Moving towards Formal Data Publication." International Journal of Digital Curation 6, no. 2 (2011): 4-37. http://www.ijdc.net/index.php/ijdc/article/view/181/265

Layne, R., A. Capel, N. Coo, and M. Wheatley. "Long Term Preservation of Scientific Data: Lessons from Jet and Other Domains." Fusion Engineering and Design 87, no. 12 (2012): 2209-2212. http://www.sciencedirect.com/science/article/pii/S0920379612003225?np=y

Leadbetter, A., L. Raymond, C. Chandler, L. Pikula, P. Pissierssens, and E. Urban. Ocean Data Publication Cookbook. Oostende, Belgium: UNESCO, 2013. http://www.iode.org/index.php?option=com_oe&task=viewDocumentRecord&docID=10574

Lee, Dong Joon, and Besiki Stvilia. "Developing a Data Identifier Taxonomy." Cataloging & Classification Quarterly 52, no. 3 (2014): 303-336. http://www.tandfonline.com/doi/abs/10.1080/01639374.2014.880166

Littauer, Richard, Karthik Ram, Bertram Ludäscher, William Michener, and Rebecca Koskela. "Trends in Use of Scientific Workflows: Insights from a Public Repository and Recommendations for Best Practice." International Journal of Digital Curation 7, no. 2 (2012): 92-100. http://www.ijdc.net/index.php/ijdc/article/view/222/291

Scientific workflows are typically used to automate the processing, analysis and management of scientific data. Most scientific workflow programs provide a user-friendly graphical user interface that enables scientists to more easily create and visualize complex workflows that may be comprised of dozens of processing and analytical steps. Furthermore, many workflows provide mechanisms for tracing provenance and methodologies that foster reproducible science. Despite their potential for enabling science, few studies have examined how the process of creating, executing, and sharing workflows can be improved. In order to promote open discourse and access to scientific methods as well as data, we analyzed a wide variety of workflow systems and publicly available workflows on the public repository myExperiment. It is hoped that understanding the usage of workflows and developing a set of recommended best practices will lead to increased contribution of workflows to the public domain.

This work is licensed under a Creative Commons Attribution License.

Locher, Anita E. "Starting Points for Lowering the Barrier to Spatial Data Preservation." Journal of Map & Geography Libraries 12, no. 1 (2016): 28-51. http://dx.doi.org/10.1080/15420353.2015.1080781

Losee, Robert M. "Informational Facts and the Metainformation Inherent in IFacts: The Soul of Data Sciences." Journal of Library Metadata 13, no. 1 (2013): 59-74. http://www.tandfonline.com/doi/abs/10.1080/19386389.2013.778732#.Ub37EPnfBsk

Lötter, Lucia, and Christa van Zyl. "A Reflection on a Data Curation Journey." Journal of Empirical Research on Human Research Ethics 10. no. 3 (2015): 338-343. http://dx.doi.org/10.1177/1556264615592846

This commentary is a reflection on experience of data preservation and sharing (i.e., data curation) practices developed in a South African research organization. The lessons learned from this journey have echoes in the findings and recommendations emerging from the present study in Low and Middle-Income Countries (LMIC) and may usefully contribute to more general reflection on the management of change in data practice.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Lubell, Josh, Sudarsan Rachuri, Mahesh Mani, and Eswaran Subrahmanian. "Sustaining Engineering Informatics: Toward Methods and Metrics for Digital Curation." International Journal of Digital Curation 3, no. 2 (2008). http://www.ijdc.net/index.php/ijdc/article/view/87/58

Lynch, Clifford. "The Need for Research Data Inventories and the Vision for SHARE." Information Standards Quarterly 26, no. 2 (2014): 29-31. http://www.niso.org/publications/isq/2014/v26no2/lynch/

Lyon, Liz. "The Informatics Transform: Re-engineering Libraries for the Data Decade." International Journal of Digital Curation 7, no. 1 (2012): 126-138. http://www.ijdc.net/index.php/ijdc/article/view/210/279

In this paper, Liz Lyon explores how libraries can re-shape to better reflect the requirements and challenges of today's data-centric research landscape. The Informatics Transform presents five assertions as potential pathways to change, which will help libraries to re-position, re-profile, and re-structure to better address research data management challenges. The paper deconstructs the institutional research lifecycle and describes a portfolio of ten data support services which libraries can deliver to support the research lifecycle phases. Institutional roles and responsibilities for research data management are also unpacked, building on the framework from the earlier Dealing with Data Report. Finally, the paper examines critical capacity and capability challenges and proposes some innovative steps to addressing the significant skills gaps.

This work is licensed under a Creative Commons Attribution License.

Macdonald, Stuart, and Luis Martinez-Uribe. "Collaboration to Data Curation: Harnessing Institutional Expertise." New Review of Academic Librarianship 16, no. supplement 1 (2010): 4-16. http://www.tandfonline.com/doi/full/10.1080/13614533.2010.505823

MacMillan, Don. "Data Sharing and Discovery: What Librarians Need to Know." The Journal of Academic Librarianship 40, no. 5 (2014): 541-549. http://www.sciencedirect.com/science/article/pii/S0099133314000950

———. "Developing Data Literacy Competencies to Enhance Faculty Collaborations." LIBER Quarterly 24, no. 3 (2015): 140-160. http://doi.org/10.18352/lq.9868

In order to align information literacy instruction with changing faculty and student needs, librarians must expand their skills and competencies beyond traditional information sources. In the sciences, this increasingly means integrating the data resources used by researchers into instruction for undergraduate students. Open access data repositories allow students to work with more primary data than ever before, but only if they know how and where to look. This paper will describe the development of two information literacy workshops designed to scaffold student learning in the biological sciences across two second-year courses, detailing the long-term collaboration between a librarian and an instructor that now serves over 500 students per semester. In each workshop, students are guided through the discovery and analysis of life sciences data from multiple sites, encouraged to integrate text and data sources, and supported in completing research assignments.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Maday, Charlotte, and Magalie Moysan. "Records Management for Scientific Data." Archives and Manuscripts 42, no. 2 (2014): 190-192. http://www.tandfonline.com/doi/abs/10.1080/01576895.2014.911686

Mallery, Mary. "Dmptool: Guidance and Resources for Your Data Management Plan." Technical Services Quarterly 31, no. 2 (2014): 197-199. http://dx.doi.org/10.1080/07317131.2014.875394

Manghi, Paolo, Lukasz Bolikowski, Natalia Manold, Jochen Schirrwagen, and Tim Smith. "OpenAIREplus: the European Scholarly Communication Data Infrastructure." D-Lib Magazine 18, no. 9/10 (2012). http://dx.doi.org/10.1045/september2012-manghi

Mannheimer, Ayoung Yoon, Jane Greenberg, Elena Feinstein, and Ryan Scherle. "A Balancing Act: The Ideal and the Realistic in Dveloping Dryad's Preservation Policy." First Monday 19, no, 8 (2014). http://dx.doi.org/10.5210/fm.v19i8.5415

Marcial, Laura Haak, and Bradley M. Hemminger. "Scientific Data Repositories on the Web: An Initial Survey." Journal of the American Society for Information Science and Technology 61, no. 10 (2010): 2029-2048. http://dx.doi.org/10.1002/asi.21339

Marshall, Brianna, Katherine O'Bryan, Na Qin, and Rebecca Vernon. "Organizing, Contextualizing, and Storing Legacy Research Data: A Case Study of Data Management for Librarians." Issues in Science and Technology Librarianship, no. 74 (2013). http://www.istl.org/13-fall/article1.html

Martinez-Uribe, Luis, and Stuart Macdonald. "User Engagement in Research Data Curation." Lecture Notes in Computer Science 5714 (2009): 309-314. http://www.springerlink.com/content/7mnq13x34717p483

Mathiak, Brigitte, and Katarina Boland. "Challenges in Matching Dataset Citation Strings to Datasets in Social Science." D-Lib Magazine 21, no. 1/2 (2015). http://doi.org/10.1371/journal.pone.0118053

Mattern, Eleanor, Wei Jeng, Daqing He, Liz Lyon, and Aaron Brenner. "Using Participatory Design and Visual Narrative Inquiry to Investigate Researchers—Data Challenges and Recommendations for Library Research Data Services." Program 49, no. 4 (2015): 408-423. http://d-scholarship.pitt.edu/26143/

Matthews, Brian, Shoaib Sufi, Damian Flannery, Laurent Lerusse, Tom Griffin, Michael Gleaves, and Kerstin Kleese. "Using a Core Scientific Metadata Model in Large-Scale Facilities." International Journal of Digital Curation 5, no. 1 (2010): 106-118. http://www.ijdc.net/index.php/ijdc/article/view/149/211

Mattmann, C., Crichton, D. J., A. F. Hart, S. C. Kelly, and J. S. Hughes. "Experiments with Storage and Preservation of NASA's Planetary Data via the Cloud." IT Professional 12, no. 5 (2010): 28-35. http://www.computer.org/csdl/mags/it/2010/05/mit2010050028-abs.html

Mauthner, Natasha Susan, and Odette Parry. "Open Access Digital Data Sharing: Principles, Policies and Practices." Social Epistemology: A Journal of Knowledge, Culture and Policy 27, no. 1 (2013): 47-67. http://www.tandfonline.com/doi/abs/10.1080/02691728.2012.760663

Mayernik, Matthew S. "Data Citation Initiatives and Issues." Bulletin of the American Society for Information Science and Technology 38, no. 5 (2012): 23-28. http://www.asis.org/Bulletin/Jun-12/JunJul12_MayernikDataCitation.pdf

———. "Research Data and Metadata Curation as Institutional Issues." Journal of the Association for Information Science and Technology 67, no. 4 (2015): 973-993. http://onlinelibrary.wiley.com/doi/10.1002/asi.23425/abstract

Mayernik, Matthew S., Sarah Callaghan, Roland Leighm, Jonathan Tedds, and Steven Worley. "Peer Review of Datasets: When, Why, and How." Bulletin of the American Meteorological Society 96 (2015): 191-201. http://dx.doi.org/10.1175/BAMS-D-13-00083.1

Mayernik, Matthew S., G. Sayeed Choudhury, Tim DiLauro, Elliot Metsger, Barbara Pralle, Mike Rippin, and Ruth Duerr. "The Data Conservancy Instance: Infrastructure and Organizational Services for Research Data Curation." D-Lib Magazine 18, no. 9/10 (2012). http://www.dlib.org/dlib/september12/mayernik/09mayernik.html

Mayernik, Matthew S., Tim DiLauro, Ruth Duerr, Elliot Metsger, Anne E. Thessen, and G. Sayeed Choudhury. "Data Conservancy Provenance, Context, and Lineage Services: Key Components for Data Preservation and Curation." Data Science Journal 12 (2013): 158-171. http://datascience.codata.org/articles/abstract/10.2481/dsj.12-039/

Among the key services that institutional data management infrastructures must provide are provenance and lineage tracking and the ability to associate data with contextual information needed for understanding and use. These functionalities are critical for addressing a number of key issues faced by data collectors and users, including trust in data, results traceability, data transparency, and data citation support. In this paper, we describe the support for these services within the Data Conservancy Service (DCS) software. The DCS provenance, context, and lineage services cross the four layers in the DCS data curation stack model: storage, archiving, preservation, and curation.

This work is licensed under a Creative Commons Attribution 3.0 License.

McEwen, Leah, and Ye Li. "Academic Librarians at Play in the Field of Cheminformatics: Building the Case for Chemistry Research Data Management." Journal of Computer-Aided Molecular Design 28, no. 10 (2014): 975-988. http://dx.doi.org/10.1007/s10822-014-9777-4

McGarva, Guy, Steve Morris, and Greg Janée. Preserving Geospatial Data. York, UK: Digital Preservation Coalition, 2009. http://www.dpconline.org/component/docman/doc_download/363-preserving-geospatial-data-by-guy-mcgarva-steve-morris-and-gred-greg-janee

McLure, Merinda, Allison V. Level, Catherine L. Cranston, Beth Oehlert, and Mike Culbertson. "Data Curation: A Study of Researcher Practices and Needs." portal: Libraries and the Academy 14, no. 2 (2014). http://muse.jhu.edu/login?auth=0&type=summary&url=/journals/portal_libraries_and_the_academy/v014/14.2.mclure.html

McNally, Ruth, Adrian Mackenzie, Allison Hui, and Jennifer Tomomitsu. "Understanding the 'Intensive' in 'Data Intensive Research': Data Flows in Next Generation Sequencing and Environmental Networked Sensors." International Journal of Digital Curation 7, no. 1 (2012): 81-94. http://www.ijdc.net/index.php/ijdc/article/view/206/275

Genomic and environmental sciences represent two poles of scientific data. In the first, highly parallel sequencing facilities generate large quantities of sequence data. In the latter, loosely networked remote and field sensors produce intermittent streams of different data types. Yet both genomic and environmental sciences are said to be moving to data intensive research. This paper explores and contrasts data flow in these two domains in order to better understand how data intensive research is being done. Our case studies are next generation sequencing for genomics and environmental networked sensors.

Our objective was to enrich understanding of the 'intensive' processes and properties of data intensive research through a 'sociology' of data using methods that capture the relational properties of data flows. Our key methodological innovation was the staging of events for practitioners with different kinds of expertise in data intensive research to participate in the collective annotation of visual forms. Through such events we built a substantial digital data archive of our own that we then analysed in terms of three traits of data flow: durability, replicability and metrology.

Our findings are that analysing data flow with respect to these three traits provides better insight into how doing data intensive research involves people, infrastructures, practices, things, knowledge and institutions. Collectively, these elements shape the topography of data and condition how it flows. We argue that although much attention is given to phenomena such as the scale, volume and speed of data in data intensive research, these are measures of what we call 'extensive' properties rather than intensive ones. Our thesis is that extensive changes, that is to say those that result in non-linear changes in metrics, can be seen to result from intensive changes that bring multiple, disparate flows into confluence.

If extensive shifts in the modalities of data flow do indeed come from the alignment of disparate things, as we suggest, then we advocate the staging of workshops and other events with the purpose of developing the 'missing' metrics of data flow.

This work is licensed under a Creative Commons Attribution License.

Medeiros, Norm. "A Public Trust: Libraries and Data Curation " OCLC Systems & Services: International Digita Llibrary Perspectives 29, no. 4 (2013): 192-194. http://www.emeraldinsight.com/journals.htm?issn=1065-075x&volume=29&issue=4&articleid=17098936&show=html

Meghini, Carlo. "Data Preservation." Data Science Journal 12 (2013): GRDI51-GRDI57. http://datascience.codata.org/articles/abstract/10.2481/dsj.GRDI-009/

Digital information is a vital resource in our knowledge economy, valuable for research and education, science and the humanities, creative and cultural activities, and public policy (The Blue Ribbon Task Force on Sustainable Digital Preservation and Access, 2010). New high-throughput instruments, telescopes, satellites, accelerators, supercomputers, sensor networks, and running simulations are generating massive amounts of data (Thanos, 2011). These data are used by decision makers for improving the quality of life of citizens. Moreover, researchers are employing sophisticated technologies to analyse these data to address questions that were unapproachable just a few years ago (Helbing & Balietti, 2011). Digital technologies have fostered a new world of research characterized by immense datasets, unprecedented levels of openness among researchers, and new connections among researchers, policy makers, and the public (The National Academy of Sciences, 2009).

This work is licensed under a Creative Commons Attribution 3.0 License.

Michener, William K. "Ecological Data Sharing." Ecological Informatics 29, part 1 (2015): 33-44. http://dx.doi.org/10.1016/j.ecoinf.2015.06.010

Data sharing is the practice of making data available for use by others. Ecologists are increasingly generating and sharing an immense volume of data. Such data may serve to augment existing data collections and can be used for synthesis efforts such as meta-analysis, for parameterizing models, and for verifying research results (i.e., study reproducibility). Large volumes of ecological data may be readily available through institutions or data repositories that are the most comprehensive available and can serve as the core of ecological analysis. Ecological data are also employed outside the research context and are used for decision-making, natural resource management, education, and other purposes. Data sharing has a long history in many domains such as oceanography and the biodiversity sciences (e.g., taxonomic data and museum specimens), but has emerged relatively recently in the ecological sciences.

A review of several of the large international and national ecological research programs that have emerged since the mid-1900s highlights the initial failures and more recent successes as well as the underlying causes-from a near absence of effective policies to the emergence of community and data sharing policies coupled with the development and adoption of data and metadata standards and enabling tools. Sociocultural change and the move towards more open science have evolved more rapidly over the past two decades in response to new requirements set forth by governmental organizations, publishers and professional societies. As the scientific culture has changed so has the cyberinfrastructure landscape. The introduction of community-based data repositories, data and metadata standards, software tools, persistent identifiers, and federated search and discovery have all helped promulgate data sharing. Nevertheless, there are many challenges and opportunities especially as we move towards more open science. Cyberinfrastructure challenges include a paucity of easy-to-use metadata management systems, significant difficulties in assessing data quality and provenance, and an absence of analytical and visualization approaches that facilitate data integration and harmonization. Challenges and opportunities abound in the sociocultural arena where funders, researchers, and publishers all have a stake in clarifying policies, roles and responsibilities, as well as in incentivizing data sharing. A set of best practices and examples of software tools are presented that can enable research transparency, reproducibility and new knowledge by facilitating idea generation, research planning, data management and the dissemination of data and results.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Michener, William K., Suzie Allard, Amber Budden, Robert B. Cook, Kimberly Douglass, Mike Frame, Steve Kelling, Rebecca Koskela, Carol Tenopir, and David A. Vieglais. "Participatory Design of DataONE—Enabling Cyberinfrastructure for the Biological and Environmental Sciences." Ecological Informatics 11 (2012): 5-15. http://dx.doi.org/10.1016/j.ecoinf.2011.08.007

Michener, William K, and Matthew B. Jones. "Ecoinformatics: Supporting Ecology as a Data-Intensive Science." Trends in Ecology & Evolution 27, no. 2 (2012): 85-93. http://dx.doi.org/10.1016/j.tree.2011.11.016

Michener, William K., Todd Vision, Patricia Cruse, Dave Vieglais, John Kunze, and Greg Janée. "DataONE: Data Observation Network for Earth—Preserving Data and Enabling Innovation in the Biological and Environmental Sciences." D-Lib Magazine 17, no. 1/2 (2011). http://www.dlib.org/dlib/january11/michener/01michener.html

Miksa, Tomasz, Stephan Strodl, and Andreas Rauber. "Process Management Plans." International Journal of Digital Curation 9, no. 1 (2014): 83-97. http://www.ijdc.net/index.php/ijdc/article/view/9.1.83/343

In the era of research infrastructures and big data, sophisticated data management practices are becoming essential building blocks of successful science. Most practices follow a data-centric approach, which does not take into account the processes that created, analysed and presented the data. This fact limits the possibilities for reliable verification of results. Furthermore, it does not guarantee the reuse of research, which is one of the key aspects of credible data-driven science. For that reason, we propose the introduction of the new concept of Process Management Plans, which focus on the identification, description, sharing and preservation of the entire scientific processes. They enable verification and later reuse of result data and processes of scientific experiments. In this paper we describe the structure and explain the novelty of Process Management Plans by showing in what way they complement existing Data Management Plans. We also highlight key differences, major advantages, as well as references to tools and solutions that can facilitate the introduction of Process Management Plans.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Minor, David, Matt Critchlow, Arwen Hutt, Declan Fleming, Mary Linn Bergstrom, and Don Sutton. "Research Data Curation Pilots: Lessons Learned." International Journal of Digital Curation 9, no. 1 (2014): 220-230. http://www.ijdc.net/index.php/ijdc/article/view/9.1.220/354

In the spring of 2011, the UC San Diego Research Cyberinfrastructure (RCI) Implementation Team invited researchers and research teams to participate in a research curation and data management pilot program. This invitation took the form of a campus-wide solicitation. More than two dozen applications were received and, after due deliberation, the RCI Oversight Committee selected five curation-intensive projects. These projects were chosen based on a number of criteria, including how they represented campus research, varieties of topics, researcher engagement, and the various services required. The pilot process began in September 2011, and will be completed in early 2014. Extensive lessons learned from the pilots are being compiled and are being used in the on-going design and implementation of the permanent Research Data Curation Program in the UC San Diego Library.

In this paper, we present specific implementation details of these various services, as well as lessons learned. The program focused on many aspects of contemporary scholarship, including data creation and storage, description and metadata creation, citation and publication, and long term preservation and access. Based on the lessons learned in our processes, the Research Data Curation Program will provide a suite of services from which campus users can pick and choose, as necessary. The program will provide support for the data management requirements from national funding agencies.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Minor, David, Don Sutton, Ardys Kozbial, Brad Westbrook, Michael Burek, and Michael Smorul. "Chronopolis Digital Preservation Network." International Journal of Digital Curation 5, no. 1 (2010). http://www.ijdc.net/index.php/ijdc/article/view/150/212

Mischo, William H., Mary C. Schlembach, and Megan N. O'Donnell. "An Analysis of Data Management Plans in University of Illinois National Science Foundation Grant Proposals." Journal of eScience Librarianship 3, no. 1 (2014): e1060. http://dx.doi.org/10.7191/jeslib.2014.1060

Missier, Paolo, Bertram Ludäscher, Saumen Dey, Michael Wang, Tim McPhillips, Shawn Bowers, Michael Agun, and Ilkay Altintas. "Golden Trail: Retrieving the Data History That Matters from a Comprehensive Provenance Repository." International Journal of Digital Curation 7, no. 1 (2012): 139-150. http://www.ijdc.net/index.php/ijdc/article/view/211/280

Experimental science can be thought of as the exploration of a large research space, in search of a few valuable results. While it is this "Golden Data" that gets published, the history of the exploration is often as valuable to the scientists as some of its outcomes. We envision an e-research infrastructure that is capable of systematically and automatically recording such history—an assumption that holds today for a number of workflow management systems routinely used in e-science. In keeping with our gold rush metaphor, the provenance of a valuable result is a "Golden Trail". Logically, this represents a detailed account of how the Golden Data was arrived at, and technically it is a sub-graph in the much larger graph of provenance traces that collectively tell the story of the entire research (or of some of it).

In this paper we describe a model and architecture for a repository dedicated to storing provenance traces and selectively retrieving Golden Trails from it. As traces from multiple experiments over long periods of time are accommodated, the trails may be sub-graphs of one trace, or they may be the logical representation of a virtual experiment obtained by joining together traces that share common data.

The project has been carried out within the Provenance Working Group of the Data Observation Network for Earth (DataONE) NSF project. Ultimately, our longer-term plan is to integrate the provenance repository into the data preservation architecture currently being developed by DataONE.

This work is licensed under a Creative Commons Attribution License.

Mitchell, Erik T. "Research Support: The New Mission for Libraries." Journal of Web Librarianship 7, no. 1 (2013): 109-113. http://www.tandfonline.com/doi/abs/10.1080/19322909.2013.757930?journalCode=wjwl20#.Ub37e_nfBsk

Mohr, Alicia Hofelich, Josh Bishoff, Carolyn Bishoff, Steven Braun, Christine Storino, and Lisa R. Johnston. "When Data Is a Dirty Word: A Survey to Understand Data Management Needs Across Diverse Research Disciplines." Bulletin of the Association for Information Science and Technology 42, no. 1 (2015): 51-53. https://www.asist.org/publications/bulletin/oct-15/when-data-is-a-dirty-word/

Molloy, Laura. "Digital Curation Skills in the Performing Arts—An Investigation of Practitioner Awareness and Knowledge of Digital Object Management and Preservation." International Journal of Performance Arts and Digital Media 10, no. 1 (2014): 7-20. http://www.tandfonline.com/doi/full/10.1080/14794713.2014.912496

Molloy, Laura, Simon Hodson, Meik Poschen, and Jonathan Tedds. "Gathering Evidence of Benefits: A Structured Approach from the JISC Managing Research Data Programme." International Journal of Digital Curation 8, no. 2 (2013): 123-133. http://www.ijdc.net/index.php/ijdc/article/view/8.2.123

The work of the Jisc Managing Research Data programme is—along with the rest of the UK higher education sector—taking place in an environment of increasing pressure on research funding. In order to justify the investment made by Jisc in this activity—and to help make the case more widely for the value of investing time and money in research data management—individual projects and the programme as a whole must be able to clearly express the resultant benefits to the host institutions and to the broader sector. This paper describes a structured approach to the measurement and description of benefits provided by the work of these projects for the benefit of funders, institutions and researchers. We outline the context of the programme and its work; discuss the drivers and challenges of gathering evidence of benefits; specify benefits as distinct from aims and outputs; present emerging findings and the types of metrics and other evidence which projects have provided; explain the value of gathering evidence in a structured way to demonstrate benefits generated by work in this field; and share lessons learned from progress to date.

This work is licensed under a Creative Commons Attribution License.

Molloy, Laura, and Kellie Snow. "The Data Management Skills Support Initiative: Synthesising Postgraduate Training in Research Data Management." International Journal of Digital Curation 7, no. 2 (2012): 101-109. http://www.ijdc.net/index.php/ijdc/article/view/223/292

This paper will describe the efforts and findings of the JISC Data Management Skills Support Initiative ('DaMSSI'). DaMSSI was co-funded by the JISC Managing Research Data programme and the Research Information Network (RIN), in partnership with the Digital Curation Centre, to review, synthesise and augment the training offerings of the JISC Research Data Management Training Materials ('RDMTrain') projects.

DaMSSI tested the effectiveness of the Society of College, National and University Libraries' Seven Pillars of Information Literacy model (SCONUL, 2011), and Vitae's Researcher Development Framework ('Vitae RDF') for consistently describing research data management ('RDM') skills and skills development paths in UK HEI postgraduate courses.

With the collaboration of the RDMTrain projects, we mapped individual course modules to these two models and identified basic generic data management skills alongside discipline-specific requirements. A synthesis of the training outputs of the projects was then carried out, which further investigated the generic versus discipline-specific considerations and other successful approaches to training that had been identified as a result of the projects' work. In addition we produced a series of career profiles to help illustrate the fact that data management is an essential component—in obvious and not-so-obvious ways—of a wide range of professions.

We found that both models had potential for consistently and coherently describing data management skills training and embedding this within broader institutional postgraduate curricula. However, we feel that additional discipline-specific references to data management skills could also be beneficial for effective use of these models. Our synthesis work identified that the majority of core skills were generic across disciplines at the postgraduate level, with the discipline-specific approach showing its value in engaging the audience and providing context for the generic principles.

Findings were fed back to SCONUL and Vitae to help in the refinement of their respective models, and we are working with a number of other projects, such as the DCC and the EC-funded Digital Curator Vocational Education Europe (DigCurV2) initiative, to investigate ways to take forward the training profiling work we have begun.

This work is licensed under a Creative Commons Attribution License.

Moon, Jeff. "Developing a Research Data Management Service—A Case Study " Partnership: the Canadian Journal of Library and Information Practice and Research 9, no. 1 (2014). https://journal.lib.uoguelph.ca/index.php/perj/article/viewFile/2988/3266

Mooney, Hailey. "A Practical Approach to Data Citation: The Special Interest Group on Data Citation and Development of the Quick Guide to Data Citation." IASSIST Quarterly 37, 1-4 (2013): 71-77. http://iassistdata.org/iq/practical-approach-data-citation-special-interest-group-data-citation-and-development-quick-guide

Mooney, Hailey, and Mark P. Newton. "The Anatomy of a Data Citation: Discovery, Reuse, and Credit." Journal of Librarianship and Scholarly Communication 1, no. 1 (2012). http://dx.doi.org/10.7710/2162-3309.1035

INTRODUCTION Data citation should be a necessary corollary of data publication and reuse. Many researchers are reluctant to share their data, yet they are increasingly encouraged to do just that. Reward structures must be in place to encourage data publication, and citation is the appropriate tool for scholarly acknowledgment. Data citation also allows for the identification, retrieval, replication, and verification of data underlying published studies. METHODS This study examines author behavior and sources of instruction in disciplinary and cultural norms for writing style and citation via a content analysis of journal articles, author instructions, style manuals, and data publishers. Instances of data citation are benchmarked against a Data Citation Adequacy Index. RESULTS Roughly half of journals point toward a style manual that addresses data citation, but the majority of journal articles failed to include an adequate citation to data used in secondary analysis studies. DISCUSSION Full citation of data is not currently a normative behavior in scholarly writing. Multiplicity of data types and lack of awareness regarding existing standards contribute to the problem. CONCLUSION Citations for data must be promoted as an essential component of data publication, sharing, and reuse. Despite confounding factors, librarians and information professionals are well-positioned and should persist in advancing data citation as a normative practice across domains. Doing so promotes a value proposition for data sharing and secondary research broadly, thereby accelerating the pace of scientific research.

This work is licensed under a Creative Commons Attribution 3.0 License.

Moore, Reagan W. "Building Preservation Environments with Data Grid Technology." American Archivist 69, no. 1 (2007): 139-158. http://americanarchivist.org/doi/abs/10.17723/aarc.69.1.176p51l2w5278567

———. "Geospatial Web Services and Geoarchiving: New Opportunities and Challenges in Geographic Information Service." Library Trends 55, no. 2 (2006): 285-303. http://hdl.handle.net/2142/3684

Morris, Steven. Issues in the Appraisal and Selection of Geospatial Data. Washington, DC: National Digital Stewardship Alliance, 2013. http://www.digitalpreservation.gov/ndsa/working_groups/documents/NDSA_AppraisalSelection_report_final102413.pdf

This paper proposes a series of appraisal and selection recommended practices regarding data relevancy, documentation, currency, research and application needs, usability, risk and ease of acquisition that will help organizations make the initial steps to initiate a digital stewardship plan for geospatial information that touches on each point of the information lifecycle.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

———. "The North Carolina Geospatial Data Archiving Project: Challenges and Initial Outcomes." Journal of Map & Geography Libraries 6, no. 1 (2009): 26-44. http://www.tandfonline.com/doi/abs/10.1080/15420350903432507

Morris, Steven, James Tuttle, and Jefferson Essic. "A Partnership Framework for Geospatial Data Preservation in North Carolina." Library Trends 57, no. 3 (2009): 516-540. http://hdl.handle.net/2142/13603

Murillo, Angela P. "Data at Risk Initiative: Examining and Facilitating the Scientific Process in Relation to Endangered Data." Data Science Journal 12 (2014): 207-219 http://datascience.codata.org/articles/abstract/10.2481/dsj.12-048/

Examining the scientific process in relation to endangered data, data reuse, and sharing is crucial in facilitating scientific workflow. Deterioration, format obsolescence, and insufficient metadata for discovery are significant problems leading to loss of scientific data. The research presented in this paper considers these potentially lost data. Four one-hour focus groups and a demographic survey were conducted with 14 scientists to learn about their attitudes toward endangered data, data sharing, data reuse, and their opinions of the DARI inventory. The results indicate that unavailability, lack of context, accessibility issues, and potential endangerment are key concerns to scientists.

This work is licensed under a Creative Commons Attribution 3.0 License.

Murphy, Fiona. "Data and Scholarly Publishing: The Transforming Landscape." Learned Publishing 27, no. 5 (2014): 3-7. http://www.ingentaconnect.com/content/alpsp/lp/2014/00000027/00000005/art00002

———. "An Update on Peer Review and Research Data." Learned Publishing 29, no. 1 (2016): 51-53. http://onlinelibrary.wiley.com/wol1/doi/10.1002/leap.1005/abstract

Musgrave, Simon. "Improving Access to Recorded Language Data." D-Lib Magazine 20, no. 1/2 (2014). http://www.dlib.org/dlib/january14/musgrave/01musgrave.html

National Academy of Sciences Committee on Ensuring the Utility and Integrity of Research Data in a Digital Age. Ensuring the Integrity, Accessibility, and Stewardship of Research Data in the Digital Age. Washington, DC: National Academies Press, 2009. http://www.nap.edu/catalog.php?record_id=12615

National Research Council Committee on Archiving and Accessing Environmental and Geospatial Data at NOAA. Environmental Data Management at NOAA: Archiving, Stewardship, and Access. Washington, DC: National Academies Press, 2007. http://www.nap.edu/catalog.php?record_id=12017

National Science Board. Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century. Washington, DC: National Science Foundation, 2005. http://www.nsf.gov/pubs/2005/nsb0540/

Naum, Alexandra. "Research Data Storage And Management: Library Staff Participation in Showcasing Research Data at the University of Adelaide." The Australian Library Journal 63, no. 1 (2014): 35-44. http://dx.doi.org/10.1080/00049670.2014.890019

Neuroth, Heike, Felix Lohmeier, and Kathleen Marie Smith. "TextGrid—Virtual Research Environment for the Humanities." International Journal of Digital Curation 6, no. 2 (2011): 222-231. http://www.ijdc.net/index.php/ijdc/article/view/193

Nicholl, Natsuko H., Sara M. Samuel, Leena N. Lalwani, Paul F. Grochowski, and Jennifer A. Green. "Resources to Support Faculty Writing Data Management Plans: Lessons Learned from an Engineering Pilot." International Journal of Digital Curation 9, no. 1 (2014): 242-252. http://www.ijdc.net/index.php/ijdc/article/view/9.1.242

Recent years have seen a growing emphasis on the need for improved management of research data. Academic libraries have begun to articulate the conceptual foundations, roles, and responsibilities involved in data management planning and implementation. This paper provides an overview of the Engineering data support pilot at the University of Michigan Library as part of developing new data services and infrastructure. Through this pilot project, a team of librarians had an opportunity to identify areas where the library can play a role in assisting researchers with data management, and has put forth proposals for immediate steps that the library can take in this regard. The paper summarizes key findings from a faculty survey and discusses lessons learned from an analysis of data management plans from accepted NSF proposals. A key feature of this Engineering pilot project was to ensure that these study results will provide a foundation for librarians to educate and assist researchers with managing their data throughout the research lifecycle.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Nicholson, Shawn W., and Terrence B. Bennett. "Data Sharing: Academic Libraries and the Scholarly Enterprise." portal: Libraries and the Academy 11, no. 1 (2011): 505-516. http://muse.jhu.edu/login?auth=0&type=summary&url=/journals/portal_libraries_and_the_academy/v011/11.1.nicholson.html

Nielsen, Hans Jørn, and Birger Hjørland. "Curating Research Data: The Potential Roles of Libraries and Information Professionals." Journal of Documentation 70, no. 2 (2014): 221-240. http://www.emeraldinsight.com/doi/abs/10.1108/JD-03-2013-0034

Niua, Jinfang. "Aggregate Control of Scientific Data." Archives and Records: The Journal of the Archives and Records Association 37, no. 1 (2016): 53-64. http://www.tandfonline.com/doi/abs/10.1080/23257962.2016.1145578

Noonan, Daniel, and Tamar Chute. "Data Curation and the University Archives." The American Archivist 77, no. 1 (2014):201-240. http://hdl.handle.net/1811/62042

Norman, Belinda, and Kate Valentine Stanton. "From Project to Strategic Vision: Taking the Lead in Research Data Management Support at the University of Sydney Library." International Journal of Digital Curation 9, no. 1 (2014): 253-262. http://www.ijdc.net/index.php/ijdc/article/view/9.1.253/357

This paper explores three stories, each occurring a year apart, illustrating an evolution toward a strategic vision for Library leadership in supporting research data management at the University of Sydney. The three stories describe activities undertaken throughout the Seeding the Commons project and beyond, as the establishment of ongoing roles and responsibilities transition the Library from project partner to strategic leader in the delivery of research data management support. Each story exposes key ingredients that characterise research data management support: researcher engagement; partnerships; and the complementary roles of policy and practice.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Norman, Hazel. "Mandating Data Archiving: Experiences from the Frontline." Learned Publishing 27, no. 5 (2014): 35-38. http://www.ingentaconnect.com/content/alpsp/lp/2014/00000027/00000005/art00007

Norton, Hannah F., Michele R. Tennant, Cecilia Botero, and Rolando Garcia-Milian. "Assessment of and Response to Data Needs of Clinical and Translational Science Researchers and Beyond." Journal of eScience Librarianship 5, no. 1 (2016): e1090. http://escholarship.umassmed.edu/jeslib/vol5/iss1/2/

Objective and Setting: As universities and libraries grapple with data management and "big data," the need for data management solutions across disciplines is particularly relevant in clinical and translational science (CTS) research, which is designed to traverse disciplinary and institutional boundaries. At the University of Florida Health Science Center Library, a team of librarians undertook an assessment of the research data management needs of CTS researchers, including an online assessment and follow-up one-on-one interviews.

Design and Methods: The 20-question online assessment was distributed to all investigators affiliated with UF's Clinical and Translational Science Institute (CTSI) and 59 investigators responded. Follow-up in-depth interviews were conducted with nine faculty and staff members.

Results: Results indicate that UF's CTS researchers have diverse data management needs that are often specific to their discipline or current research project and span the data lifecycle. A common theme in responses was the need for consistent data management training, particularly for graduate students; this led to localized training within the Health Science Center and CTSI, as well as campus-wide training. Another campus-wide outcome was the creation of an action-oriented Data Management/Curation Task Force, led by the libraries and with participation from Research Computing and the Office of Research.

Conclusions: Initiating conversations with affected stakeholders and campus leadership about best practices in data management and implications for institutional policy shows the library's proactive leadership and furthers our goal to provide concrete guidance to our users in this area.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Ogburn, Joyce L. "The Imperative for Data Curation." portal: Libraries & the Academy 10, no. 2 (2010): 241-246. http://muse.jhu.edu/login?auth=0&type=summary&url=/journals/portal_libraries_and_the_academy/v010/10.2.ogburn.html

Olendorf, Robert, and Steve Koch. "Beyond the Low Hanging Fruit: Data Services and Archiving at the University of New Mexico." Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5878

O'Malley, Donna L. "Gaining Traction in Research Data Management Support: A Case Study." Journal of eScience Librarianship 3, no. 1 (2014): e1059. http://dx.doi.org/10.7191/jeslib.2014.1059

Oostdijk, Nelleke, Henk van den Heuvel, and Maaske Treurniet. "The CLARIN-NL Data Curation Service: Bringing Data to the Foreground." International Journal of Digital Curation 8, no. 2 (2013): 134-145. http://www.ijdc.net/index.php/ijdc/article/view/8.2.134/323

After decades in which a great deal of effort was spent on the creation of resources, there are currently several initiatives worldwide that aim to create an interoperable, sustainable research infrastructure. An integral part of such an infrastructure constitutes the resources (data and tools) which researchers in the various disciplines employ. Whether the infrastructure will be successful in supporting the needs of the research communities it intends to cater for depends on a number of factors. One factor is that resources that are or could be relevant to the wider research community are made visible through this infrastructure and, to the greatest extent possible, accessible and usable. In practice, the durable availability of resources is often not properly regulated within research projects.

CLARIN-NL is directed at creating an interoperable language resources infrastructure for the humanities in the Netherlands. The Data Curation Service was established in order to salvage language resources in this field that are threatened to be lost. In the CLARIN context, a great deal of attention is given to standards, formats and intellectual property rights. Consequently, the Data Curation Service (DCS) has a role as mediator in bringing researchers in the field of humanities and existing data centres closer together.

This article consists of two parts: the first part provides the background to the work of the DCS while the second part illustrates the work of the DCS by describing the actual curation of a collection of language learner data.

This work is licensed under a Creative Commons Attribution License.

Palaiologk, Anna S., Anastasios A. Economides, Heiko D. Tjalsma, and Laurents B. Sesink. "An Activity-Based Costing Model for Long-Term Preservation and Dissemination of Digital Research Data: The Case of DANS." International Journal on Digital Libraries 12, no. 4 (2012): 195-214. http://link.springer.com/article/10.1007%2Fs00799-012-0092-1

Palmer, Carole L., Bryan P. Heidorn, Dan Wright, and Melissa H. Cragin. "Graduate Curriculum for Biological Information Specialists: A Key to Integration of Scale in Biology." International Journal of Digital Curation 2, no. 2 (2007): 31-40. http://www.ijdc.net/index.php/ijdc/article/view/42/27

Palmer, Carole L., Nicholas M. Weber, Trevor Muñoz, and Allen H. Renear. "Foundations of Data Curation: The Pedagogy and Practice of 'Purposeful Work' with Research Data." Archive Journal, no. 3 (2013). http://www.archivejournal.net/issue/3/archives-remixed/foundations-of-data-curation-the-pedagogy-and-practice-of-purposeful-work-with-research-data/

Palumbo, Laura B., Ron Jantz, Yu-Hung Lin, Aletia Morgan, Minglu Wang, Krista White, Ryan Womack, Yingting Zhang, and Yini Zhu. "Preparing to Accept Research Data: Creating Guidelines for Librarians." Journal of eScience Librarianship 4, no. 2 (2015): e1080. http://escholarship.umassmed.edu/jeslib/vol4/iss2/1/

Papineau, Diane, and Butch Lazorchak. Geospatial Data Stewardship: Key Online Resources. Washington, DC: National Digital Stewardship Alliance, 2014. http://www.digitalpreservation.gov/ndsa/working_groups/documents/NDSA_Geo-stewardship-key-resources_final030414.pdf

This document lists online resources that highlight key concepts and practices supporting the preservation and stewardship of digital geospatial data and information. GIS practitioners take the initial preservation actions in the decisions they make regarding data creation and management. Librarians, archivists and museum professionals are often called on to support access and the long-term historical and temporal analysis of these same materials. The resources below offer a starting point to methods, tools and approaches across the information lifecycle to assist in understanding current best practices in the stewardship of geospatial data. These resources will be regularly updated at http://www.digitalpreservation.gov/ndsa/working_groups/geo-stewardship-resources.html.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Parham, Susan Wells, Jon Bodnar, and Sara Fuchs. "Supporting Tomorrow's Research: Assessing Faculty Data Curation Needs at Georgia Tech " College & Research Libraries News 73, no. 1 (2012): 10-13. http://crln.acrl.org/content/73/1/10.full.pdf+html

Parham, Susan Wells, and Chris Doty. "NSF DMP Content Analysis: What Are Researchers Saying?" Bulletin of the American Society for Information Science and Technology 39, no. 1 (2012): 37-38. http://doi.org/10.1002/bult.2012.1720390113

Parsons, M., and P. Fox. "Is Data Publication the Right Metaphor?" Data Science Journal 12 (2013): WDS32-WDS46. http://datascience.codata.org/articles/abstract/10.2481/dsj.WDS-042/

International attention to scientific data continues to grow. Opportunities emerge to re-visit long-standing approaches to managing data and to critically examine new capabilities. We describe the cognitive importance of metaphor. We describe several metaphors for managing, sharing, and stewarding data and examine their strengths and weaknesses. We particularly question the applicability of a "publication" approach to making data broadly available. Our preliminary conclusions are that no one metaphor satisfies enough key data system attributes and that multiple metaphors need to co-exist in support of a healthy data ecosystem. We close with proposed research questions and a call for continued discussion.

This work is licensed under a Creative Commons Attribution 3.0 License.

Parsons, Mark A. "Organizational Status of RDA." D-Lib Magazine 20, no. 1/2 (2014). http://www.dlib.org/dlib/january14/parsons/01parsons.html

———. "The Research Data Alliance: Implementing the Technology, Practice and Connections of a Data Infrastructure." Bulletin of the American Society for Information Science and Technology 39, no. 6 (2013): 33-36. http://www.asis.org/Bulletin/Aug-13/AugSep13_Parsons_Berman.html

Parsons, Thomas. "Creating a Research Data Management Service." International Journal of Digital Curation 8, no. 2 (2013): 146-156. http://www.ijdc.net/index.php/ijdc/article/view/8.2.146/324

This paper provides an overview of the elements required to create a sustainable research data management (RDM) service. The paper summarises key learning and lessons learnt from the University of Nottingham's project to create an RDM service for researchers. Collective experiences and learning from three key areas are covered, including: data management requirements gathering and validation, RDM training, and the creation of an RDM website.

This work is licensed under a Creative Commons Attribution License.

Parsons, Thomas, Shirley Grimshaw, and Laurian Williamson. Research Data Management Survey: Report. Nottingham, UK: University of Nottingham, 2013. http://eprints.nottingham.ac.uk/1893/

Partlo, Kristin. "From Data to the Creation of Meaning Part II: Data Librarian as Translator." IASSIST Quarterly 38, no. 2 (2014): 12-15. http://www.iassistdata.org/sites/default/files/iqvol38_2_partlo.pdf

Patel, Manjula, and Alexander Ball. "Challenges and Issues Relating to the Use of Representation Information for the Digital Curation of Crystallography and Engineering Data." International Journal of Digital Curation 3, no. 1 (2008): 76-88. http://www.ijdc.net/index.php/ijdc/article/view/64/43

Peer, Limor, and Ann Green. "Building an Open Data Repository for a Specialized Research Community: Process, Challenges and Lessons." International Journal of Digital Curation 7, no. 1 (2012): 151-162. http://www.ijdc.net/index.php/ijdc/article/view/212/281

In 2009, the Institution for Social and Policy Studies (ISPS) at Yale University began building an open access digital collection of social science experimental data, metadata, and associated files produced by ISPS researchers. The digital repository was created to support the replication of research findings and to enable further data analysis and instruction. Content is submitted to a rigorous process of quality assessment and normalization, including transformation of statistical code into R, an open source statistical software. Other requirements included: (a) that the repository be integrated with the current database of publications and projects publicly available on the ISPS website; (b) that it offered open access to datasets, documentation, and statistical software program files; (c) that it utilized persistent linking services and redundant storage provided within the Yale Digital Commons infrastructure; and (d) that it operated in accordance with the prevailing standards of the digital preservation community. In partnership with Yale's Office of Digital Assets and Infrastructure (ODAI), the ISPS Data Archive was launched in the fall of 2010. We describe the process of creating the repository, discuss prospects for similar projects in the future, and explain how this specialized repository fits into the larger digital landscape at Yale.

This work is licensed under a Creative Commons Attribution License.

Peer, Limor, Ann Green, and Elizabeth Stephenson. "Committing to Data Quality Review." International Journal of Digital Curation 9, no. 1 (2014): 263-291. http://www.ijdc.net/index.php/ijdc/article/view/9.1.263/358

Amid the pressure and enthusiasm for researchers to share data, a rapidly growing number of tools and services have emerged. What do we know about the quality of these data? Why does quality matter? And who should be responsible for data quality? We believe an essential measure of data quality is the ability to engage in informed reuse, which requires that data are independently understandable. In practice, this means that data must undergo quality review, a process whereby data and associated files are assessed and required actions are taken to ensure files are independently understandable for informed reuse. This paper explains what we mean by data quality review, what measures can be applied to it, and how it is practiced in three domain-specific archives. We explore a selection of other data repositories in the research data ecosystem, as well as the roles of researchers, academic libraries, and scholarly journals in regard to their application of data quality measures in practice. We end with thoughts about the need to commit to data quality and who might be able to take on those tasks.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Pejša, Stanislav, Shirley J. Dyke, and Thomas J. Hacker. "Building Infrastructure for Preservation and Publication of Earthquake Engineering Research Data." International Journal of Digital Curation 9, no. 2 (2014): 83-97. http://www.ijdc.net/index.php/ijdc/article/view/9.2.83/371

The objective of this paper is to showcase the progress of the earthquake engineering community during a decade-long effort supported by the National Science Foundation in the George E. Brown Jr., Network for Earthquake Engineering Simulation (NEES). During the four years that NEES network operations have been headquartered at Purdue University, the NEEScomm management team has facilitated an unprecedented cultural change in the ways research is performed in earthquake engineering. NEES has not only played a major role in advancing the cyberinfrastructure required for transformative engineering research, but NEES research outcomes are making an impact by contributing to safer structures throughout the USA and abroad. This paper reflects on some of the developments and initiatives that helped instil change in the ways that the earthquake engineering and tsunami community share and reuse data and collaborate in general.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Peng, Ge, Nancy A. Ritchey, Kenneth S. Casey, Edward J. Kearns, Jeffrey L. Privette, Drew Saunders, Philip Jones, Tom Maycock, and Steve Ansari. "Scientific Stewardship in the Open Data and Big Data Era—Roles and Responsibilities of Stewards and Other Major Product Stakeholders." D-Lib Magazine 22, no. 5/6 (2016). http://www.dlib.org/dlib/may16/peng/05peng.html

Pepe, Alberto, Alyssa Goodman, August Muench, Merce Crosas, and Christopher Erdmann. " How Do Astronomers Share Data? Reliability and Persistence of Datasets Linked in AAS Publications and a Qualitative Study of Data Practices among US Astronomers." PLoS ONE 9, no. 8 (2014): e104798. http://dx.doi.org/10.1371/journal.pone.0104798

We analyze data sharing practices of astronomers over the past fifteen years. An analysis of URL links embedded in papers published by the American Astronomical Society reveals that the total number of links included in the literature rose dramatically from 1997 until 2005, when it leveled off at around 1500 per year. The analysis also shows that the availability of linked material decays with time: in 2011, 44% of links published a decade earlier, in 2001, were broken. A rough analysis of link types reveals that links to data hosted on astronomers' personal websites become unreachable much faster than links to datasets on curated institutional sites. To gauge astronomers' current data sharing practices and preferences further, we performed in-depth interviews with 12 scientists and online surveys with 173 scientists, all at a large astrophysical research institute in the United States: the Harvard-Smithsonian Center for Astrophysics, in Cambridge, MA. Both the in-depth interviews and the online survey indicate that, in principle, there is no philosophical objection to data-sharing among astronomers at this institution. Key reasons that more data are not presently shared more efficiently in astronomy include: the difficulty of sharing large data sets; over reliance on non-robust, non-reproducible mechanisms for sharing data (e.g. emailing it); unfamiliarity with options that make data-sharing easier (faster) and/or more robust; and, lastly, a sense that other researchers would not want the data to be shared. We conclude with a short discussion of a new effort to implement an easy-to-use, robust, system for data sharing in astronomy, at theastrodata.org, and we analyze the uptake of that system to-date.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Pepe, Alberto, Matthew Mayernik, Christine L. Borgman, and Herbert Van de Sompel. "From Artifacts to Aggregations: Modeling Scientific Life Cycles on the Semantic Web." Journal of the American Society for Information Science and Technology 61, no. 3 (2010): 567-582. http://arxiv.org/abs/0906.2549

Pepler, Sam, and Sarah Callaghan. "Twenty Years of Data Management in the British Atmospheric Data Centre." International Journal of Digital Curation 10, no. 2 (2015): 23-32. http://www.ijdc.net/index.php/ijdc/article/view/10.2.23

The British Atmospheric Data Centre (BADC) has existed in its present form for 20 years, having been formally created in 1994. It evolved from the GDF (Geophysical Data Facility), a SERC (Science and Engineering Research Council) facility, as a result of research council reform where NERC (Natural Environment Research Council) extended its remit to cover atmospheric data below 10km altitude. With that change the BADC took on data from many other atmospheric sources and started interacting with NERC research programmes.

The BADC has now hit early adulthood. Prompted by this milestone, we examine in this paper whether the data centre is creaking at the seams or is looking forward to the prime of its life, gliding effortlessly into the future. Which parts of it are bullet proof and which parts are held together with double-sided sticky tape? Can we expect to see it in its present form in another twenty years' time?

To answer these questions, we examine the interfaces, technology, processes and organisation used in the provision of data centre services by looking at three snapshots in time, 1994, 2004 and 2014, using metrics and reports from the time to compare and contrasts the services using BADC. The repository landscape has changed massively over this period and has moved the focus for technology and development as the broader community followed emerging trends, standards and ways of working. The incorporation of these new ideas has been both a blessing and a curse, providing the data centre staff with plenty of challenges and opportunities.

We also discuss key data centre functions including: data discovery, data access, ingestion, data management planning, preservation plans, agreements/licences and data policy, storage and server technology, organisation and funding, and user management. We conclude that the data centre will probably still exist in some form in 2024 and that it will most likely still be reliant on a file system. However, the technology delivering this service will change and the host organisation and funding routes may vary.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Peters, Christie, and Anita Riley Dryden. "Assessing the Academic Library's Role in Campus-Wide Research Data Management: A First Step at the University of Houston." Science & Technology Libraries 30, no. 4 (2011): 387-403. http://www.tandfonline.com/doi/full/10.1080/0194262X.2011.626340#abstract

Peters, Christie, and Porcia Vaughn. "Initiating Data Management Instruction to Graduate Students at the University of Houston Using the New England Collaborative Data Management Curriculum." Journal of eScience Librarianship 3, no. 1 (2014): e1064. http://dx.doi.org/10.7191/jeslib.2014.1064

Pinfield,Stephen, Andrew M. Cox, and Jen Smith. "Research Data Management and Libraries: Relationships, Activities, Drivers and Influences." PLoS ONE 9, no. 12 (2014): e114734. http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0114734

The management of research data is now a major challenge for research organisations. Vast quantities of born-digital data are being produced in a wide variety of forms at a rapid rate in universities. This paper analyses the contribution of academic libraries to research data management (RDM) in the wider institutional context. In particular it: examines the roles and relationships involved in RDM, identifies the main components of an RDM programme, evaluates the major drivers for RDM activities, and analyses the key factors influencing the shape of RDM developments. The study is written from the perspective of library professionals, analysing data from 26 semi-structured interviews of library staff from different UK institutions. This is an early qualitative contribution to the topic complementing existing quantitative and case study approaches. Results show that although libraries are playing a significant role in RDM, there is uncertainty and variation in the relationship with other stakeholders such as IT services and research support offices. Current emphases in RDM programmes are on developments of policies and guidelines, with some early work on technology infrastructures and support services. Drivers for developments include storage, security, quality, compliance, preservation, and sharing with libraries associated most closely with the last three. The paper also highlights a 'jurisdictional' driver in which libraries are claiming a role in this space. A wide range of factors, including governance, resourcing and skills, are identified as influencing ongoing developments. From the analysis, a model is constructed designed to capture the main aspects of an institutional RDM programme. This model helps to clarify the different issues involved in RDM, identifying layers of activity, multiple stakeholders and drivers, and a large number of factors influencing the implementation of any initiative. Institutions may usefully benchmark their activities against the data and model in order to inform ongoing RDM activity.

This work is licensed under a Creative Commons Attribution License.

Pink, Catherine. "Meeting the Data Management Compliance Challenge: Funder Expectations and Institutional Reality." International Journal of Digital Curation 8, no. 2 (2013): 157-171. http://www.ijdc.net/index.php/ijdc/article/view/8.2.157/325

In common with many global research funding agencies, in 2011 the UK Engineering and Physical Sciences Research Council (EPSRC) published its Policy Framework on Research Data along with a mandate that institutions be fully compliant with the policy by May 2015. The University of Bath has a strong applied science and engineering research focus and, as such, the EPSRC is a major funder of the university's research. In this paper, the Jisc-funded Research360 project shares its experience in developing the infrastructure required to enable a research-intensive institution to achieve full compliance with a particular funder's policy, in such a way as to support the varied data management needs of both the University of Bath and its external stakeholders. A key feature of the Research360 project was to ensure that after the project's completion in summer 2013 the newly developed data management infrastructure would be maintained up to and beyond the EPSRC's 2015 deadline. Central to these plans was the 'University of Bath Roadmap for EPSRC', which was identified as an exemplar response by the EPSRC. This paper explores how a roadmap designed to meet a single funder's requirements can be compatible with the strategic goals of an institution. Also discussed is how the project worked with Charles Beagrie Ltd to develop a supporting business case, thus ensuring implementation of these long-term objectives. This paper describes how two new data management roles, the Institutional Data Scientist and Technical Data Coordinator, have contributed to delivery of the Research360 project and the importance of these new types of cross-institutional roles for embedding a new data management infrastructure within an institution. Finally, the experience of developing a new institutional data policy is shared. This policy represents a particular example of the need to reconcile a funder's expectations with the needs of individual researchers and their collaborators.

This work is licensed under a Creative Commons Attribution License.

Piorun, Mary E., Donna Kafel, Tracey Leger-Hornby, Siamak Najafi, Elaine R. Martin, Paul Colombo, and Nancy R. LaPelle. "Teaching Research Data Management: An Undergraduate/Graduate Curriculum." Journal of eScience Librarianship 1, no. 1 (2012): e1003. http://dx.doi.org/10.7191/jeslib.2012.1003

Piwowar, Heather A., and Wendy W. Chapman. "Public Sharing of Research Datasets: A Pilot Study of Associations." Journal of Informetrics 4, no. 2 (2010): 148-156. http://dx.doi.org/10.1016/j.joi.2009.11.010

Piwowar, Heather A., Roger S. Day, and Douglas B. Fridsma. "Sharing Detailed Research Data Is Associated with Increased Citation Rate." PLoS ONE 2, no, (2007): e308. http://dx.doi.org/10.1371/journal.pone.0000308

Background

Sharing research data provides benefit to the general scientific community, but the benefit is less obvious for the investigator who makes his or her data available.

Principal Findings

We examined the citation history of 85 cancer microarray clinical trial publications with respect to the availability of their data. The 48% of trials with publicly available microarray data received 85% of the aggregate citations. Publicly available data was significantly (p?=?0.006) associated with a 69% increase in citations, independently of journal impact factor, date of publication, and author country of origin using linear regression.

Significance

This correlation between publicly available data and increased literature impact may further motivate investigators to share their detailed research data.

This work is licensed under a Creative Commons Attribution License.

Plale, Beth. "Synthesis of Working Group and Interest Group Activity One Year into the Research Data Alliance." D-Lib Magazine 20, no. 1/2 (2014). http://www.dlib.org/dlib/january14/plale/01plale.html

Plale, Beth, Bin Cao, Chathura Herath, and Yiming Sun. "Data Provenance for Preservation of Digital Geoscience Data." Geological Society of America Special Papers 482 (2011): 125-137. http://dx.doi.org/10.1130/2011.2482(11)

Plale, Beth, Robert H. McDonald, Kavitha Chandraseka, Inna Kouper, Stacy Konkiel, Margaret L. Hedstrom, James Myers, and Praveen Kumar. "SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science." International Journal of Digital Curation 8, no. 2 (2014): 172-180. http://www.ijdc.net/index.php/ijdc/article/view/8.2.172/335

Major research universities are grappling with their response to the deluge of scientific data emerging through research by their faculty. Many are looking to their libraries and the institutional repositories for a solution. Scientific data introduces substantial challenges that the document-based institutional repository may not be suited to deal with. The Sustainable Environment-Actionable Data (SEAD) Virtual Archive (VA) specifically addresses the challenges of 'long tail' scientific data. In this paper, we propose requirements, policy and architecture to support not only the preservation of scientific data today using institutional repositories, but also rich access to data and their use into the future.

This work is licensed under a Creative Commons Attribution License.

Poole, Alex H. "How Has Your Science Data Grown? Digital Curation and the Human Factor: A Critical Literature Review." Archival Science 15, no. 2, (2015): 101-139. http://dx.doi.org/10.1007/s10502-014-9236-y

———. "Now is the Future Now? The Urgency of Digital Curation in the Digital Humanities." Digital Humanities Quarterly 7, no. 2 (2013). http://www.digitalhumanities.org/dhq/vol/7/2/000163/000163.html

Porcal-Gonzalo, Maria C. "A Strategy for the Management, Preservation, and Reutilization of Geographical Information Based on the Lifecycle of Geospatial Data: An Assessment and a Proposal Based on Experiences from Spain and Europe." Journal of Map & Geography Libraries 11, no. 3 (2015) 289-329. http://dx.doi.org/10.1080/15420353.2015.1064054

Pouchard, Line, Andrew Woolf, and David Bernholdt. "Data Grid Discovery and Semantic Web Technologies for the Earth Sciences." International Journal on Digital Libraries 5, no. 2 (2005): 72-83. http://link.springer.com/article/10.1007/s00799-004-0085-9

Pronk, Tessa E., Paulien H. Wiersma, Anne van Weerden, and Feike Schieving. "A Game Theoretic Analysis of Research Data Sharing." PeerJ 3 2015): e1242. https://doi.org/10.7717/peerj.1242

Prost, Hélène, Cécile Malleret, and Joachim Schöpfel. "Hidden Treasures: Opening Data in PhD Dissertations in Social Sciences and Humanities." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1230. http://doi.org/10.7710/2162-3309.1230

PURPOSE The paper provides empirical evidence on research data submitted together with PhD dissertations in social sciences and humanities. APPROACH We conducted a survey on nearly 300 print and electronic dissertations in social sciences and humanities from the University of Lille 3 (France), submitted between 1987 and 2013. FINDINGS After a short overview on open access to electronic dissertations, on small data in dissertations, on data management and curation, and on the challenge for academic libraries, the paper presents the results of the survey. Special attention is paid to the size of the research data in appendices, to their presentation and link to the text, to their sources and typology, and to their potential for further research. Methodological shortfalls of the study are discussed, and barriers to open data (metadata, structure, format) and legal questions (privacy, third-party rights) are addressed. The conclusion provides some recommendations for the assistance and advice to PhD students in managing and depositing their research data. PRACTICAL IMPLICATIONS Our survey can be helpful for academic libraries to develop assistance and advice for PhD students in managing their research data in collaboration with the research structures and the graduate schools. ORIGINALITY There is a growing body of research papers on data management and curation. Produced along with PhD dissertations, little is known about the characteristics of this material, in particular in social sciences and humanities and the impact on the role of academic libraries.

This work is licensed under a Creative Commons Attribution 4.0 License.

Pryor, Graham. "Attitudes and Aspirations in a Diverse World: The Project StORe Perspective on Scientific Repositories." International Journal of Digital Curation 2, no. 1 (2007): 135-144. http://www.ijdc.net/index.php/ijdc/article/view/32/21

———. "A Maturing Process of Engagement: Raising Data Capabilities in UK Higher Education." International Journal of Digital Curation 8, no. 2 (2013): 181-193. http://www.ijdc.net/index.php/ijdc/article/view/8.2.181/326

In the spring of 2011, the UK's Digital Curation Centre (DCC) commenced a programme of outreach designed to assist individual universities in their development of aptitude for managing research data. This paper describes the approaches taken, covering the context in which these institutional engagements have been discharged and examining the aims, methodology and processes employed. It also explores what has worked and why, as well as the pitfalls encountered, including example outcomes and identifiable or predicted impact. Observing how the research data landscape is constantly undergoing change, the paper concludes with an indication of the steps being taken to refit the DCC institutional engagement to the evolving needs of higher education.

This work is licensed under a Creative Commons Attribution License.

——— "Multi-Scale Data Sharing in the Life Sciences: Some Lessons for Policy Makers." International Journal of Digital Curation 4, no. 3 (2009): 71-82. http://www.ijdc.net/index.php/ijdc/article/view/135/178

Pryor, Graham, ed. Managing Research Data. London: Facet Publishing, 2012. http://www.facetpublishing.co.uk/title.php?id=7562

Pryor, Graham, and Martin Donnelly. "Skilling Up to Do Data: Whose Role, Whose Responsibility, Whose Career?" International Journal of Digital Curation 4, no. 2 (2009): 158-170. http://www.ijdc.net/index.php/ijdc/article/view/126/133

Pryor, Graham, Sarah Jones, and Angus Whyte. Delivering Research Data Management Services. London: Facet Publishing, 2013. http://www.facetpublishing.co.uk/title.php?id=049337

Qin, Jian, and John D'ignazio. "The Central Role of Metadata in a Science Data Literacy Course." Journal of Library Metadata 10, no. 2/3 (2010): 188-204. http://www.tandfonline.com/doi/full/10.1080/19386389.2010.506379

Raboin, Regina, Rebecca C. Reznik-Zellen, and Dorothea Salo. "Forging New Service Paths: Institutional Approaches to Providing Research Data Management Services." Journal of eScience Librarianship 1, no. 3 (2012): e1021. http://dx.doi.org/10.7191/jeslib.2012.1021

Ragon, Bart. "The Political Economy of Federally Sponsored Data." Journal of eScience Librarianship 2, no. 2 (2013): e1050. http://dx.doi.org/10.7191/jeslib.2013.1050

Rajasekar, Arcot, Reagan Moore, Mike Wan, and Wayne Schroeder. "Policy-Based Distributed Data Management Systems." Journal of Digital Information 11, no. 1 (2010). http://journals.tdl.org/jodi/index.php/jodi/article/view/756

Rambo, Neil. Research Data Management Roles for Libraries. New York: Ithaka S+R, 2015. http://dx.doi.org/10.18665/sr.274643

Ramírez, Marisa L. "Whose Role Is It Anyway?: A Library Practitioner's Appraisal of the Digital Data Deluge." Bulletin of the American Society for Information Science & Technology 37, no. 5 (2011): 21-23. http://www.asis.org/Bulletin/Jun-11/JunJul11_Ramirez.html

Ray, Joyce M., ed. Research Data Management: Practical Strategies for Information Professionals. West Lafayette, IN: Purdue University Press 2014. http://www.thepress.purdue.edu/titles/format/9781557536648

Read, Kevin B., Alisa Surkis, Catherine Larson, Aileen McCrillis, Alice Graff, Joey Nicholson, and Juanchan Xu. "Starting the Data Conversation: Informing Data Services at an Academic Health Sciences Library." Journal of the Medical Library Association 10, no. 3 (2015): 131-135. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4511052/

Recker, Astrid, Stefan Müller, Jessica Trixa, and Natascha Schumann. "Paving the Way for Data-Centric, Open Science: An Example From the Social Sciences." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1227. http://doi.org/10.7710/2162-3309.1227

INTRODUCTION Data has moved into the spotlight as an important scholarly output that should be shared with the scientific community for replication and re-use in new contexts. This has a direct impact on libraries, archives, and other service providers in the data curation and access landscape. DESCRIPTION OF PROJECT The GESIS Data Archive for the Social Sciences (DAS) has been curating and disseminating social science research data since 1960. The article presents tools, services, and strategies developed by the DAS to support the research community in adequately responding to the legal, ethical, and practical challenges that the transformation towards data-centric, open science presents. These include GESIS's Secure Data Center, the data publication platform "datorium" and a recent project to create a georeferencing service for survey data. LESSONS LEARNED The experiences gained through these activities show that getting involved-now, rather than further down the road-pays off in that it allows service providers to actively shape the ongoing transformation. At the same time, by cooperating with suitable partners, the effort and investment of resources can be kept at a manageable level for individual organizations.

This work is licensed under a Creative Commons Attribution 4.0 License.

Reed, Robyn B. "Diving into Data: Planning a Research Data Management Event." Journal of eScience Librarianship 4, no. 1 (2015): e1071. http://escholarship.umassmed.edu/jeslib/vol4/iss1/5/

Reilly, Michele, and Anita R. Dryden. "Building an Online Data Management Plan Tool." Journal of Librarianship and Scholarly Communication 1, no. 3 (2013): eP1066. http://doi.org/10.7710/2162-3309.1066

Following the 2011 announcement by the National Science Foundation (NSF) that it would begin requiring Data Management Plans with every funding application, the University of Houston Libraries explored ways to support our campus researchers in meeting this requirement. A small team of librarians built an online tool using a Drupal module. The tool includes informational content, an interactive questionnaire, and an extensive FAQ to meet diverse researcher needs. This easily accessible and locally maintained tool allows us to provide a high level of personalized service to our researchers.

This work is licensed under a Creative Commons Attribution License.

Reilly, Susan, Wouter Schallier, Sabine Schrimpf, Eefke Smit, and Max Wilkinson. Report on Integration of Data and Publications. The Hague: Alliance for Permanent Access, 2011. https://core.ac.uk/download/files/324/30437753.pdf

Scholarly communication is the foundation of modern research where empirical evidence is interpreted and communicated as published hypothesis driven research. Many current and recent reports highlight the impact of advancing technology on modern research and consequences this has on scholarly communication. As part of the ODE project this report sought to coalesce current though and opinions from numerous and diverse sources to reveal opportunities for supporting a more connected and integrated scholarly record. Four perspectives were considered, those of the Researcher who generates or reuses primary data, Publishers who provide the mechanisms to communicate research activities and Libraries & Data enters who maintain and preserve the evidence that underpins scholarly communication and the published record. This report finds the landscape fragmented and comple, where competing interests can sometimes confuse and confound requirements, needs and expectations. Equally the report identifies clear opportunity for all stakeholders to directly enable a more joined up and vital scholarly record of modern research.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Renear, Allen H., Carole L. Palmer, and John Unsworth. Extending Data Curation to the Humanities: Curiculum Development and Recruiting. Urbana-Champaign: Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, 2013. http://hdl.handle.net/2142/42628

Rinehart, Amanda K. "Getting Emotional about Data: The Soft Side of Data Management Services." College & Research Libraries News 76, no. 8 (2015): 437-440. http://crln.acrl.org/content/76/8/437.full

Reznik-Zellen, Rebecca C., Jessica Adamick, and Stephen McGinty. "Tiers of Research Data Support Services." Journal of eScience Librarianship 1, no. 1 (2012): e1002. http://dx.doi.org/10.7191/jeslib.2012.1002

Ribeiro, Cristina, Maria Eugénia, and Matos Fernandes. "Data Curation at U. Porto: Identifying Current Practices across Disciplinary Domains." IASSIST Quarterly 35, no. 4 (2011): 14-17. http://www.iassistdata.org/iq/data-curation-uporto-identifying-current-practices-across-disciplinary-domains

Rice, Robin. "Research Data MANTRA: A Labour of Love." Journal of eScience Librarianship 3, no. 1 (2014): e1056. http://dx.doi.org/10.7191/jeslib.2014.1056

Rice, Robin, Çuna Ekmekcioglu, Jeff Haywood, Sarah Jones, Stuart Lewis, Stuart Macdonald, and Tony Weir. "Implementing the Research Data Management Policy: University of Edinburgh Roadmap." International Journal of Digital Curation 8, no. 2 (2013): 194-204. http://www.ijdc.net/index.php/ijdc/article/view/8.2.194/327

This paper discusses work to implement the University of Edinburgh Research Data Management (RDM) policy by developing the services needed to support researchers and fulfil obligations within a changing national and international setting. This is framed by an evolving Research Data Management Roadmap and includes a governance model that ensures cooperation amongst Information Services (IS) managers and oversight by an academic-led steering group. IS has taken requirements from research groups and IT professionals, and at the request of the steering group has conducted pilot work involving volunteer research units within the three colleges to develop functionality and presentation for the key services. The first pilots cover three key services: the data store, a customisation of the Digital Curation Centre's DMPonline tool, and the data repository. The paper will report on the plans, achievements and challenges encountered while we attempt to bring the University of Edinburgh RDM Roadmap to fruition.

This work is licensed under a Creative Commons Attribution License.

Rice, Robin, and Jeff Haywood. "Research Data Management Initiatives at University of Edinburgh." International Journal of Digital Curation 6, no. 2 (2011): 232-244. http://www.ijdc.net/index.php/ijdc/article/view/194/259

Richards, Julian D. "Digital Preservation and Access." European Journal of Archaeology 5, no. 3 (2002): 343-366. http://eja.sagepub.com/content/5/3/343.abstract

Rimkus, Kyle, Thomas Padilla, Tracy Popp, and Greer Martin. "Digital Preservation File Format Policies of ARL Member Libraries: An Analysis." D-Lib Magazine 20, no. 3/4 (2014). http://www.dlib.org/dlib/march14/rimkus/03rimkus.html

Roche, Dominique G., Robert Lanfear, Sandra A. Binning, Tonya M. Haff, Lisa E. Schwanz, Kristal E. Cain, Hanna Kokko, Michael D. Jennions, and Loeske E. B. Kruuk. "Troubleshooting Public Data Archiving: Suggestions to Increase Participation." PLOS Biology 12, no. 1 (2014): e1001779. http://dx.doi.org/10.1371/journal.pbio.1001779

An increasing number of publishers and funding agencies require public data archiving (PDA) in open-access databases. PDA has obvious group benefits for the scientific community, but many researchers are reluctant to share their data publicly because of real or perceived individual costs. Improving participation in PDA will require lowering costs and/or in-creasing benefits for primary data collectors. Small, simple changes can enhance existing measures to ensure that more scientific data are properly archived and made publicly available: (1) facilitate more flexible embargoes on archived data, (2) encourage communication between data generators and re-users, (3) disclose data re-use ethics, and (4) encourage increased recognition of publicly archived data.

This work is licensed under a Creative Commons Attribution License.

Rumsey, Sally, and Neil Jefferies. "Challenges in Building an Institutional Research Data Catalogue." International Journal of Digital Curation 8, no. 2 (2013): 205-214. http://www.ijdc.net/index.php/ijdc/article/view/8.2.205/328

The University of Oxford is preparing systems and services to enable members of the university to manage research data produced by its scholars. Much of the work has been carried out under the Jisc-funded Damaro project. This project draws together existing nascent services, adds new systems and services to 'fill the gaps' and provides a wide-ranging infrastructure. Development comprises four parallel strands: endorsement of a university research data management policy; training and guidance in research data management; technical infrastructure; and future sustainability. A key element of the technical infrastructure is DataFinder, a catalogue of Oxford research data outputs. DataFinder's core purposes are to record the existence of Oxford datasets, enable their discovery, and provide details of their location. DataFinder will record metadata about Oxford research data, irrespective of location, discipline or format, and is viewed by the university as a crucial hub for the university's Research Data Management (RDM) infrastructure.

This work is licensed under a Creative Commons Attribution License.

———. "DataFinder: A Research Data Catalogue for Oxford." Ariadne, no. 71 (2013). http://www.ariadne.ac.uk/issue71/rumsey-jefferies

Sallans, Andrew, and Martin Donnelly. "DMP Online and DMPTool: Different Strategies towards a Shared Goal." International Journal of Digital Curation 7, no. 2 (2012): 123-129. http://www.ijdc.net/index.php/ijdc/article/view/225/294

This paper provides a comparative discussion of the strategies employed in the UK's DMP Online tool and the US's DMPTool, both designed to provide a structured environment for research data management planning (DMP) with explicit links to funder requirements. Following the Sixth International Digital Curation Conference, held in Chicago in December 2010, a number of US institutions partnered with the Digital Curation Centre's DMP Online team to learn from their experiences while developing a US counterpart. DMPTool arrived in beta in August 2011 and released a production version in November 2011. This joint paper will compare and contrast use cases, organizational and national/cultural characteristics that have influenced the development decisions, outcomes achieved so far, and planned future developments.

This work is licensed under a Creative Commons Attribution License.

Sands, Ashley E., Christine L. Borgman, Sharon Traweek, and Laura A. Wynholds. "We're Working On It: Transferring the Sloan Digital Sky Survey from Laboratory to Library." International Journal of Digital Curation 9, no. 2 (2014): 98-110. http://www.ijdc.net/index.php/ijdc/article/view/9.2.98/372

This article reports on the transfer of a massive scientific dataset from a national laboratory to a university library, and from one kind of workforce to another. We use the transfer of the Sloan Digital Sky Survey (SDSS) archive to examine the emergence of a new workforce for scientific research data management. Many individuals with diverse educational backgrounds and domain experience are involved in SDSS data management: domain scientists, computer scientists, software and systems engineers, programmers, and librarians. These types of positions have been described using terms such as research technologist, data scientist, e-science professional, data curator, and more. The findings reported here are based on semi-structured interviews, ethnographic participant observation, and archival studies from 2011-2013.

The library staff conducting the data storage and archiving of the SDSS archive faced two performance problems. The preservation specialist and the system administrator worked together closely to discover and implement solutions to the slow data transfer and verification processes. The team overcame these slow-downs by problem solving, working in a team, and writing code. The library team lacked the astronomy domain knowledge necessary to meet some of their preservation and curation goals.

The case study reveals the variety of expertise, experience, and individuals essential to the SDSS data management process. A variety of backgrounds and educational histories emerge in the data managers studied. Teamwork is necessary to bring disparate expertise together, especially between those with technical and domain education. The findings have implications for data management education, policy and relevant stakeholders.

This article is part of continuing research on Knowledge Infrastructures.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Sapp Nelson, Megan. "Data Management Outreach to Junior Faculty Members: A Case Study." Journal of eScience Librarianship 4, no. 1 (2015): e1076. http://escholarship.umassmed.edu/jeslib/vol4/iss1/8/

Savage, Caroline J., and Andrew J. Vickers. "Empirical Study of Data Sharing by Authors Publishing in PLoS Journals." PLoS ONE 4, no. 9 (2009): e7078. http://dx.doi.org/10.1371/journal.pone.0007078

Background

Many journals now require authors share their data with other investigators, either by depositing the data in a public repository or making it freely available upon request. These policies are explicit, but remain largely untested. We sought to determine how well authors comply with such policies by requesting data from authors who had published in one of two journals with clear data sharing policies.

Methods and Findings

We requested data from ten investigators who had published in either PLoS Medicine or PLoS Clinical Trials. All responses were carefully documented. In the event that we were refused data, we reminded authors of the journal's data sharing guidelines. If we did not receive a response to our initial request, a second request was made. Following the ten requests for raw data, three investigators did not respond, four authors responded and refused to share their data, two email addresses were no longer valid, and one author requested further details. A reminder of PLoS's explicit requirement that authors share data did not change the reply from the four authors who initially refused. Only one author sent an original data set.

Conclusions

We received only one of ten raw data sets requested. This suggests that journal policies requiring data sharing do not lead to authors making their data sets available to independent investigators.

This work is licensed under a Creative Commons Creative Commons Attribution License.

Sayogoa, Djoko Sigit, and Theresa A. Pard. "Exploring the Determinants of Scientific Data Sharing: Understanding the Motivation to Publish Research Data." Government Information Quarterly 30, no. S1 (2013): S19-S31. http://www.sciencedirect.com/science/article/pii/S0740624X12001529

Scaramozzino, Jeanine Marie, Marisa L. Ramírez, and Karen J. McGaughey. "A Study of Faculty Data Curation Behaviors and Attitudes at a Teaching-Centered University." College & Research Libraries 73, no. 4 (2012): 349-365. http://crl.acrl.org/content/73/4/349.full.pdf+html

Schirrwagen, Jochen, Paolo Manghi, Natalia Manola, Lukasz Bolikowski, Najla Rettberg, and Birgit Schmidt. "Data Curation in the OpenAIRE Scholarly Communication Infrastructure." Information Standards Quarterly 25, no. 3 (2013): 13-19. http://www.niso.org/publications/isq/2013/v25no3/schirrwagen

Schmidt, Birgit, and Jens Dierkes. "New Alliances for Research and Teaching Support: Establishing the Göttingen eResearch Alliance." Program 49, no. 4 (2015): 461-474. http://dx.doi.org/10.1108/PROG-02-2015-0020

Schmidt, Birgit, Birgit Gemeinholzer, and Andrew Treloar. "Open Data in Global Environmental Research: The Belmont Forum's Open Data Survey." PLoS ONE 11, no. 1 (2016): e0146695. http://dx.doi.org/10.1371/journal.pone.0146695

This paper presents the findings of the Belmont Forum's survey on Open Data which targeted the global environmental research and data infrastructure community. It highlights users' perceptions of the term "open data", expectations of infrastructure functionalities, and barriers and enablers for the sharing of data. A wide range of good practice examples was pointed out by the respondents which demonstrates a substantial uptake of data sharing through e-infrastructures and a further need for enhancement and consolidation. Among all policy responses, funder policies seem to be the most important motivator. This supports the conclusion that stronger mandates will strengthen the case for data sharing.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Schopf, Jennifer M., and Steven Newhouse. "User Priorities for Data: Results from SUPER." International Journal of Digital Curation 2, no. 1 (2007): 149-155. http://www.ijdc.net/index.php/ijdc/article/view/34/23

Schopfel, Joachim, Stéphane Chaudiron, Bernard Jacquemin, Hélène Prost, Marta Severo, and Florence Thiault. "Open Access to Research Data in Electronic Theses and Dissertations: An Overview." Library Hi Tech 32, no. 4 (2014): 612-627. http://www.emeraldinsight.com/doi/abs/10.1108/LHT-06-2014-0058

Schubert, Carolyn, Yasmeen Shorish, Paul Frankel, and Kelly Giles. "The Evolution of Research Data: Strategies for Curation and Data Management." Library Hi Tech News 30, no. 6 (2013): 1-6. http://www.emeraldinsight.com/journals.htm?articleid=17094319

Schumacher, Jaime, and Drew VandeCreek. "Intellectual Capital at Risk: Data Management Practices and Data Loss by Faculty Members at Five American Universities." International Journal of Digital Curation 10, no. 2 (2015): 96-109. http://www.ijdc.net/index.php/ijdc/article/view/10.2.96

A study of 56 professors at five American universities found that a majority had little understanding of principles, well-known in the field of data curation, informing the ongoing administration of digital materials and chose to manage and store work-related data by relying on the use of their own storage devices and cloud accounts. It also found that a majority of them had experienced the loss of at least one work-related digital object that they considered to be important in the course of their professional career. Despite such a rate of loss, a majority of respondents expressed at least a moderate level of confidence that they would be able to make use of their digital objects in 25 years. The data suggest that many faculty members are unaware that their data is at risk. They also indicate a strong correlation between faculty members' digital object loss and their data management practices. University professors producing digital objects can help themselves by becoming aware that these materials are subject to loss. They can also benefit from awareness and use of better personal data management practices, as well as participation in university-level programmatic digital curation efforts and the availability of more readily accessible, robust infrastructure for the storage of digital materials.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Schumann, Natascha. "Tried and Trusted: Experiences with Certification Processes at the GESIS Data Archive." IASSIST Quarterly 36, no. 3/4 (2012): 24-27. http://www.iassistdata.org/iq/tried-and-trusted-experiences-certification-processes-gesis-data-archive-0.

Schumann, Natascha, and Reiner Mauer. "The GESIS Data Archive for the Social Sciences: A Widely Recognised Data Archive on its Way." International Journal of Digital Curation 8, no. 2 (2013): 215-222. http://www.ijdc.net/index.php/ijdc/article/view/8.2.215/329

This paper describes initial experiences in evaluating an established data archive with a long-standing commitment to preservation and dissemination of social science research data against recently formulated standards for trustworthy digital archives. As stakeholders need to be sure that the data they produce, use or fund is treated according to common standards, the GESIS Data Archive decided to start a process of audit and certification within the European Framework of Certification and Audit, starting with the Data Seal of Approval (DSA). This paper gives an overview of workflows within the archive and illustrates some of the steps necessary to obtain the DSA as well as to optimize some of its services. Finally, a short appraisal of the method of the DSA is made.

This work is licensed under a Creative Commons Attribution License.

Schumann, Natascha, and Astrid Recker. "De-mystifying OAIS compliance: Benefits and Challenges of Mapping the OAIS Reference Model to the GESIS Data Archive." IASSIST Quarterly 36, no. 2 (2012): 6-11. http://www.iassistdata.org/iq/de-mystifying-oais-compliance-benefits-and-challenges-mapping-oais-reference-model-gesis-data-arc

Schweers, Stefan, Katharina Kinder-kurlanda, Stefan Müller, and Pascal Siegers. "Conceptualizing a Spatial Data Infrastructure for the Social Sciences: An Example from Germany." Journal of Map & Geography Libraries 12, no. 1 (2016): 100-126. http://dx.doi.org/10.1080/15420353.2015.1100152

Searle, Samantha, Malcolm Wolski, Natasha Simons, and Joanna Richardson. "Librarians as Partners in Research Data Service Development at Griffith University." Program 49, no. 4 (2015): 440-460. http://dx.doi.org/10.1108/PROG-02-2015-0013

Shadbolt, Anna, Leo Konstantelos, Liz Lyon, and Marieke Guy. "Delivering Innovative RDM Training: The immersiveInformatics Pilot Programme." International Journal of Digital Curation 9, no. 1 (2014): 313-323. http://www.ijdc.net/index.php/ijdc/article/view/9.1.313/360

This paper presents the findings, lessons learned and next steps associated with the implementation of the immersiveInformatics pilot: a distinctive research data management (RDM) training programme designed in collaboration between UKOLN Informatics and the Library at the University of Melbourne, Australia. The pilot aimed to equip a broad range of academic and professional staff roles with RDM skills as a key element of capacity and capability building within a single institution.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Shaffer, Christopher J. "The Role of the Library in the Research Enterprise." Journal of eScience Librarianship 2, no. 1 (2013): e1043. http://dx.doi.org/10.7191/jeslib.2013.1043

Shankar, Kalpana. "For Want of a Nail: Three Tropes in Data Curation." Preservation, Digital Technology & Culture 44, no. 4 (2016): 161-170. http://www.degruyter.com/view/j/pdtc.2015.44.issue-4/pdtc-2015-0019/pdtc-2015-0019.xml

Shaon, Arif, Sarah Callaghan, Bryan Lawrence, Brian Matthews, Timothy Osborn, Colin Harpham, and Andrew Woolf. "Opening Up Climate Research: A Linked Data Approach to Publishing Data Provenance." International Journal of Digital Curation 7, no. 1 (2012): 163-173. http://www.ijdc.net/index.php/ijdc/article/view/213/282

Traditionally, the formal scientific output in most fields of natural science has been limited to peer-reviewed academic journal publications, with less attention paid to the chain of intermediate data results and their associated metadata, including provenance. In effect, this has constrained the representation and verification of the data provenance to the confines of the related publications. Detailed knowledge of a dataset's provenance is essential to establish the pedigree of the data for its effective re-use, and to avoid redundant re-enactment of the experiment or computation involved. It is increasingly important for open-access data to determine their authenticity and quality, especially considering the growing volumes of datasets appearing in the public domain. To address these issues, we present an approach that combines the Digital Object Identifier (DOI)—a widely adopted citation technique—with existing, widely adopted climate science data standards to formally publish detailed provenance of a climate research dataset as an associated scientific workflow. This is integrated with linked-data compliant data re-use standards (e.g. OAI-ORE) to enable a seamless link between a publication and the complete trail of lineage of the corresponding dataset, including the dataset itself.

This work is licensed under a Creative Commons Attribution License.

Shaon, Arif, and Andrew Woolf. "Long-Term Preservation for Spatial Data Infrastructures: A Metadata Framework and Geo-portal Implementation." D-Lib Magazine 17, no. 9/10 (2012). http://www.dlib.org/dlib/september11/shaon/09shaon.html

Shen, Yi. "Research Data Sharing and Reuse Practices of Academic Faculty Researchers: A Study of the Virginia Tech Data Landscape." International Journal of Digital Curation 10, no. 2 (2015): 157-175. http://www.ijdc.net/index.php/ijdc/article/view/10.2.157

This paper presents the results of a research data assessment and landscape study in the institutional context of Virginia Tech to determine the data sharing and reuse practices of academic faculty researchers. Through mapping the level of user engagement in "openness of data," "openness of methodologies and workflows," and "reuse of existing data," this study contributes to the current knowledge in data sharing and open access, and supports the strategic development of institutional data stewardship. Asking faculty researchers to self-reflect sharing and reuse from both data producers' and data users' perspectives, the study reveals a significant gap between the rather limited sharing activities and the highly perceived reuse or repurpose values regarding data, indicating that potential values of data for future research are lost right after the original work is done. The localized and sporadic data management and documentation practices of researchers also contribute to the obstacles they themselves often encounter when reusing existing data.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Shorish, Yasmeen. "Data Curation Is for Everyone! The Case for Master's and Baccalaureate Institutional Engagement with Data Curation." Journal of Web Librarianship 6, no. 4 (2012): 263-273. http://www.tandfonline.com/doi/full/10.1080/19322909.2012.729394

Si, Li, Xiaozhe Zhuang, Wenming Xing, and Weining Guo. "The Cultivation of Scientific Data Specialists: Development of LIS Education Oriented to E-science Service Requirements." Library Hi Tech 31, no. 4 (2013): 700-724. http://www.emeraldinsight.com/journals.htm?issn=0737-8831&volume=31&issue=4&articleid=17099953&show=html

Simons, Natasha. "Implementing DOIs for Research Data." D-Lib Magazine 18, no. 5/6 (2012). http://dlib.org/dlib/may12/simons/05simons.html

Simons, Natasha, Karen Visser, and Sam Searle. "Growing Institutional Support for Data Citation: Results of a Partnership Between Griffith University and the Australian National Data Service." D-Lib Magazine 19, no. 11/12 (2013). http://www.dlib.org/dlib/november13/simons/11simons.html

Smit, Eefke. "Eloise and Abelard: Why Data and Publications Belong Together." D-Lib Magazine 17, no. 1/2 (2011). http://www.dlib.org/dlib/january11/smit/01smit.html

Smith, Jordan W., William S. Slocumb, Charlynne Smith, and Jason Matney. "A Needs-Assessment Process for Designing Geospatial Data Management Systems within Federal Agencies." Journal of Map & Geography Libraries 11, no. 2 (2015): 226-244. http://dx.doi.org/10.1080/15420353.2015.1048035

Soehner, Catherine, Catherine Steeves, and Jennifer Ward. E-science and Data Support Services: A Study of ARL Member Institutions. Washington, DC: Association of Research Libraries, 2010. http://www.arl.org/storage/documents/publications/escience-report-2010.pdf

South, David M. "Data Preservation in High Energy Physics " Journal of Physics: Conference Series 331 (2011). http://arxiv.org/abs/1101.3186

Stamatoplos, Anthony, Tina Neville, and Deborah Henry. "Analyzing the Data Management Environment in a Master's-level Institution." The Journal of Academic Librarianship 42, no. 2 (2016): 109-190. http://dx.doi.org/10.1016/j.acalib.2015.11.004

Starr, Joan, Eleni Castro, Mercè Crosas, Michel Dumontier, Robert R. Downs, Ruth Duerr, Laurel L. Haak, Melissa Haendel, Ivan Herman, Simon Hodson, Joe Hourclé, John Ernest Kratz, Jennifer Lin, Lars Holm Nielsen, Amy Nurnberger, Stefan Proel, Andreas Rauber, Simone Sacchi, Arthur Smith, Mike Taylor, and Tim Clark. "Achieving Human and Machine Accessibility of Cited Data in Scholarly Publications." PeerJ Computer Science 1 (2015): e1. http://dx.doi.org/10.7717/peerj-cs.1

Starr, Joan, and Angela Gastl. "isCitedBy: A Metadata Scheme for DataCite." D-Lib Magazine 17, no. 1/2 (2011). http://www.dlib.org/dlib/january11/starr/01starr.html

Starr, Joan, Perry Willett, Lis Federer, Horning Claudia, and Mary Linn Bergstrom. "A Collaborative Framework for Data Management Services: The Experience of the University of California." Journal of eScience Librarianship 1, no. 2 (2012): e1014. http://dx.doi.org/10.7191/jeslib.2012.1014

Steeleworthy, Michael. "Research Data Management and the Canadian Academic Library: An Organizational Consideration of Data Management and Data Stewardship." Partnership: the Canadian Journal of Library and Information Practice and Research 9, no. 1 (2014). https://journal.lib.uoguelph.ca/index.php/perj/article/view/2990#.VZLGkEaoB30

Steinhart, Gail. "DataStaR: A Data Sharing and Publication Infrastructure to Support Research." Agricultural Information Worldwide: An International Journal for the Information Specialists in Agriculture, Natural Resources, and the Environment 4, no. 1 (2011). http://hdl.handle.net/1813/15035

———. "Libraries as Distributors of Geospatial Data: Data Management Policies as Tools for Managing Partnerships." Library Trends 55, no. 2 (2006): 264-284. http://hdl.handle.net/2142/3689

Steinhart, Gail, Eric Chen, Florio Arguillas, Dianne Dietrich, and Stefan Kramer. "Prepared to Plan? A Snapshot of Researcher Readiness to Address Data Management Planning Requirements." Journal of eScience Librarianship 1, no. 2 (2012): e1008. http://dx.doi.org/10.7191/jeslib.2012.1008

Steinhart, Gail, Dianne Dietrich, and Ann Green. "Establishing Trust in a Chain of Preservation: The TRAC Checklist Applied to a Data Staging Repository (DataStaR)." D-Lib Magazine 16, no. 9/10 (2009). http://www.dlib.org/dlib/september09/steinhart/09steinhart.html

Strasser, C. A., and S. E. Hampton. "The Fractured Lab Notebook: Undergraduates and Ecological Data Management Training in the United States." Ecosphere 3, no. 12 (2012): art116. http://dx.doi.org/10.1890/es12-00139.1

Data management is a timely and increasingly important topic for ecologists. Recent funder mandates requiring data management plans, combined with the data deluge that faces scientists, make education about data management critical for any future ecologist. In this study, we surveyed instructors of general ecology courses at 48 major institutions in the United States. We chose instructors at institutions that are likely to train future ecologists, and therefore, are most likely to influence the trajectory of data management education in this field. The survey queried instructors about institution and course characteristics, the extent to which data-related topics are included in their courses, the barriers to their teaching these topics, and their own personal beliefs and values associated with data management and stewardship. We found that, in general, data management topics are not being covered in undergraduate ecology courses for a wide range of reasons. Most often, instructors cited a lack of time and a lack of resources as barriers to teaching data management. Although data are used for instruction at some point in the majority of the courses surveyed, good data management practices and a thorough understanding of the importance of data stewardship are not being taught. We offer potential explanations for this and suggestions for improvement.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Strasser, Carly. Research Data Management: A Primer Publication of the National Information Standards Organization. Baltimore, MD: NISO, 2015. http://www.niso.org/apps/group_public/download.php/15375/PrimerRDM-2015-0727.pdf

Strasser, Carly, Stephen Abrams, and Patricia Cruse. " DMPTool 2: Expanding Functionality for Better Data Management Planning." International Journal of Digital Curation 9, no. 1 (2014): 324-330. http://www.ijdc.net/index.php/ijdc/article/view/9.1.324/361

Scholarly researchers today are increasingly required to engage in a range of data management planning activities to comply with institutional policies, or as a precondition for publication or grant funding. The latter is especially true in the U.S. in light of the recent White House Office of Science and Technology Policy (OSTP) mandate aimed at maximizing the availability of all outputs—data as well as the publications that summarize them—resulting from federally-funded research projects.

To aid researchers in creating effective data management plans (DMPs), a group of organizations—California Digital Library, DataONE, Digital Curation Centre, Smithsonian Institution, University of Illinois Urbana-Champaign, and University of Virginia Library—collaborated on the development of the DMPTool, an online application that helps researchers create data management plans. The DMPTool provides detailed guidance, links to general and institutional resources, and walks a researcher through the process of generating a comprehensive plan tailored to specific DMP requirements. The uptake of the DMPTool has been positive: to date, it has been used by over 6,000 researchers from 800 institutions, making use of more than 20 requirements templates customized for funding bodies.

With support from the Alfred P. Sloan Foundation, project partners are now engaged in enhancing the features of the DMPTool. The second version of the tool has enhanced functionality for plan creators and institutional administrators, as well as a redesigned user interface and an open RESTful application programming interface (API).

New administrative functions provide the means for institutions to better support local research activities. New capabilities include support for plan co-ownership; workflow provisions for internal plan review; simplified maintenance and addition of DMP requirements templates; extensive capabilities for the customization of guidance and resources by local institutional administrators; options for plan visibility; and UI refinements based on user feedback and focus group testing. The technical work undertaken for the DMPTool Version 2 has been accompanied by a new governance structure and the growth of a community of engaged stakeholders who will form the basis for a sustainable path forward for the DMPTool as it continues to play an important role in research data management activities.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Strasser, Carly, John Kunze, Stephen Abrams, and Patricia Cruse. "DataUp: A Tool to Help Researchers Describe and Share Tabular Data." F1000Research 3, no. 6 (2014). http://dx.doi.org/10.12688/f1000research.3-6.v2

Scientific datasets have immeasurable value, but they lose their value over time without proper documentation, long-term storage, and easy discovery and access. Across disciplines as diverse as astronomy, demography, archeology, and ecology, large numbers of small heterogeneous datasets (i.e., the long tail of data) are especially at risk unless they are properly documented, saved, and shared. One unifying factor for many of these at-risk datasets is that they reside in spreadsheets. In response to this need, the California Digital Library (CDL) partnered with Microsoft Research Connections and the Gordon and Betty Moore Foundation to create the DataUp data management tool for Microsoft Excel. Many researchers creating these small, heterogeneous datasets use Excel at some point in their data collection and analysis workflow, so we were interested in developing a data management tool that fits easily into those work flows and minimizes the learning curve for researchers. The DataUp project began in August 2011. We first formally assessed the needs of researchers by conducting surveys and interviews of our target research groups: earth, environmental, and ecological scientists. We found that, on average, researchers had very poor data management practices, were not aware of data centers or metadata standards, and did not understand the benefits of data management or sharing. Based on our survey results, we composed a list of desirable components and requirements and solicited feedback from the community to prioritize potential features of the DataUp tool. These requirements were then relayed to the software developers, and DataUp was successfully launched in October 2012.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Sturges, Paul, Marianne Bamkin, Jane H.S. Anders, Bill Hubbard, Azhar Hussain, and Melanie Heeley. "Research Data Sharing: Developing a Stakeholder-Driven Model for Journal Policies." Journal of the Association for Information Science and Technology 66, no. 12 (2015): 2445-2455. http://eprints.nottingham.ac.uk/3185/

Surkis, Alisa, Aileen McCrillis, Richard McGowan, Jeffrey Williams, Brian L. Schmidt, Markus Hardt, and Neil Rambo. "Informationist Support for a Study of the Role of Proteases and Peptides in Cancer Pain." Journal of eScience Librarianship 2, no. 1 (2013): e1029. http://dx.doi.org/10.7191/jeslib.2013.1029

Swanson, Juleah, and Amanda K. Rinehart. "Data in Context: Using Case Studies to Generate A Common Understanding of Data in Academic Libraries." The Journal of Academic Librarianship 42, no. 1 (2016): 97-101. http://dx.doi.org/10.1016/j.acalib.2015.11.005

Sweeney, Latanya, Mercè Crosas, and Michael Bar-Sinai. "Sharing Sensitive Data with Confidence: The Datatags System." Technology Science. (16 October 2015). http://techscience.org/a/2015101601

Sweetkind, Julie, Mary Lynette Larsgaard, and Tracey Erwin. "Digital Preservation of Geospatial Data." Library Trends 55, no. 2 (2006): 304-314. http://hdl.handle.net/2142/3690

Tananbaum, Greg. Implementing an Open Data Policy: A Primer for Research Funders. Washington, DC: Scholarly Publishing and Academic Resources Coalition, 2013. http://sparcopen.org/our-work/implementing-an-open-data-policy/

Tarver, Hannah, and Mark Phillips. "Integrating Image-based Research Datasets into an Existing Digital Repository Infrastructure." Cataloging & Classification Quarterly 51, no. 1-3 (2013): 238-250. http://www.tandfonline.com/doi/full/10.1080/01639374.2012.732203

Tenopir, Carol, Suzie Allard, Kimberly Douglass, Arsev Umur Aydinoglu, Lei Wu, Eleanor Read, Maribeth Manoff, and Mike Frame. "Data Sharing by Scientists: Practices and Perceptions." PLoS ONE 6, no. 6 (2011): e21101. http://www.plosone.org/article/info:doi/10.1371/journal.pone.0021101

Background

Scientific research in the 21st century is more data intensive and collaborative than in the past. It is important to study the data practices of researchers—data accessibility, discovery, re-use, preservation and, particularly, data sharing. Data sharing is a valuable part of the scientific method allowing for verification of results and extending research from prior results.

Methodology/Principal Findings

A total of 1329 scientists participated in this survey exploring current data sharing practices and perceptions of the barriers and enablers of data sharing. Scientists do not make their data electronically available to others for various reasons, including insufficient time and lack of funding. Most respondents are satisfied with their current processes for the initial and short-term parts of the data or research lifecycle (collecting their research data; searching for, describing or cataloging, analyzing, and short-term storage of their data) but are not satisfied with long-term data preservation. Many organizations do not provide support to their researchers for data management both in the short- and long-term. If certain conditions are met (such as formal citation and sharing reprints) respondents agree they are willing to share their data. There are also significant differences and approaches in data management practices based on primary funding agency, subject discipline, age, work focus, and world region.

Conclusions/Significance

Barriers to effective data sharing and preservation are deeply rooted in the practices and culture of the research process as well as the researchers themselves. New mandates for data management plans from NSF and other federal agencies and world-wide attention to the need to share and preserve data could lead to changes. Large scale programs, such as the NSF-sponsored DataNET (including projects like DataONE) will both bring attention and resources to the issue and make it easier for scientists to apply sound data management principles.

This work is licensed under a Creative Commons Attribution License.

Tenopir, Carol, Ben Birch, and Suzie Allard. Academic Libraries and Research Data Services: Current Practices and Plans for the Future. Chicago: Association of College and Research Libraries, 2012. http://www.ala.org/acrl/sites/ala.org.acrl/files/content/publications/whitepapers/Tenopir_Birch_Allard.pdf

Tenopir, Carol, Dane Hughes, Suzie Allard, Mike Frame, Ben Birch, Lynn Baird, Robert Sandusky, Madison Langseth, and Andrew Lundeen. "Research Data Services in Academic Libraries: Data Intensive Roles for the Future?" Journal of eScience Librarianship 4, no. 2 (2015): e1085. http://escholarship.umassmed.edu/jeslib/vol4/iss2/4/

Tenopir, Carol, Robert J. Sandusky, Suzie Allard, and Ben Birch. "Academic Librarians and Research Data Services: Preparation and Attitudes." IFLA Journal 39, no. 1 (2013): 70-78. http://www.ifla.org/files/assets/hq/publications/ifla-journal/ifla-journal-39-1_2013.pdf

———. "Research Data Management Services in Academic Research Libraries and Perceptions of Librarians." Library & Information Science Research 36, no. 2 (2014): 84-90. http://www.sciencedirect.com/science/article/pii/S0740818814000255

Thessen, Anne E., and David J. Patterson. "Data Issues in the Life Sciences." ZooKeys 150 (2011): 15-51. http://dx.doi.org/10.3897/zookeys.150.1766

Thoegersen, Jennifer L. "Examination of Federal Data Management Plan Guidelines." Journal of eScience Librarianship 4, no. 1 (2015): e1072. http://escholarship.umassmed.edu/jeslib/vol4/iss1/1/

Toups, Megan, and Michael Hughes. "When Data Curation Isn't: A Redefinition for Liberal Arts Universities." Journal of Library Administration 53, no. 4 (2013): 223-233. http://digitalcommons.trinity.edu/lib_faculty/36/

Treloar, Andrew. "Design and Implementation of the Australian National Data Service." International Journal of Digital Curation 4, no. 1 (2009): 125-137. http://www.ijdc.net/index.php/ijdc/article/view/107/83

———. "The Research Data Alliance: Globally Co-ordinated Action against Barriers to Data Publishing and Sharing." Learned Publishing 27, no. 5 (2014): 9-13. http://www.ingentaconnect.com/content/alpsp/lp/2014/00000027/00000005/art00003

Treloar, Andrew, David Groenewegen, and Cathrine Harboe-Ree. "The Data Curation Continuum: Managing Data Objects in Institutional Repositories." D-Lib Magazine 13, no. 9/10 (2007). http://www.dlib.org/dlib/september07/treloar/09treloar.html

Treloar, Andrew, and Ross Wilkinson. "Access to Data for eResearch: Designing the Australian National Data Service Discovery Services." International Journal of Digital Curation 3, no. 2 (2008): 151-158. http://www.ijdc.net/index.php/ijdc/article/view/95/66

Trimble, Leanne, Cheryl Woods, Francine Berish, Daniel Jakubek, and Sarah Simpkin. "Collaborative Approaches to the Management of Geospatial Data Collections in Canadian Academic Libraries: A Historical Case Study." Journal of Map & Geography Libraries 11, no. 3 (2015): 330-358. http://ir.lib.uwo.ca/wlpub/47/

Tsoi, Ah Chung, Jeff McDonell, Andrew Treloar, and Ian Atkinson. "Dataset Acquisition, Accessibility, Annotation, E-Research Technologies (DART) Project." International Journal on Digital Libraries 7, no. 1/2 (2007): 53-55. http://link.springer.com/article/10.1007/s00799-007-0019-4

Tuyl, Steve Van, and Gabrielle Michalek. "Assessing Research Data Management Practices of Faculty at Carnegie Mellon University." Journal of Librarianship and Scholarly Communication 3, no. 3 (2015): eP1258. http://doi.org/10.7710/2162-3309.1258

INTRODUCTION Recent changes to requirements for research data management by federal granting agencies and by other funding institutions have resulted in the emergence of institutional support for these requirements. At CMU, we sought to formalize assessment of research data management practices of researchers at the institution by launching a faculty survey and conducting a number of interviews with researchers. METHODS We submitted a survey on research data management practices to a sample of faculty including questions about data production, documentation, management, and sharing practices. The survey was coupled with in-depth interviews with a subset of faculty. We also make estimates of the amount of research data produced by faculty. RESULTS Survey and interview results suggest moderate level of awareness of the regulatory environment around research data management. Results also present a clear picture of the types and quantities of data being produced at CMU and how these differ among research domains. Researchers identified a number of services that they would find valuable including assistance with data management planning and backup/storage services. We attempt to estimate the amount of data produced and shared by researchers at CMU. DISCUSSION Results suggest that researchers may need and are amenable to assistance with research data management. Our estimates of the amount of data produced and shared have implications for decisions about data storage and preservation. CONCLUSION Our survey and interview results have offered significant guidance for building a suite of services for our institution.

This work is licensed under a Creative Commons Attribution 4.0 License.

Ulbricht, Damian, Kirsten Elger, Roland Bertelmann, and Jens Klump. "panMetaDocs, eSciDoc, and DOIDB—An Infrastructure for the Curation and Publication of File-Based Datasets for GFZ Data Services." ISPRS International Journal of Geo-Information 5, no. 3 (2016): 25. http://dx.doi.org/10.3390/ijgi5030025

The GFZ German Research Centre for Geosciences is the national laboratory for Geosciences in Germany. As part of the Helmholtz Association, providing and maintaining large-scale scientific infrastructures are an essential part of GFZ activities. This includes the generation of significant volumes and numbers of research data, which subsequently become source materials for data publications. The development and maintenance of data systems is a key component of GFZ Data Services to support state-of-the-art research. A challenge lies not only in the diversity of scientific subjects and communities, but also in different types and manifestations of how data are managed by research groups and individual scientists. The data repository of GFZ Data Services provides a flexible IT infrastructure for data storage and publication, including minting of digital object identifiers (DOI). It was built as a modular system of several independent software components linked together through Application Programming Interfaces (APIs) provided by the eSciDoc framework. Principal application software are panMetaDocs for data management and DOIDB for logging and moderating data publications activities. Wherever possible, existing software solutions were integrated or adapted. A summary of our experiences made in operating this service is given. Data are described through comprehensive landing pages and supplementary documents, like journal articles or data reports, thus augmenting the scientific usability of the service.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Ure, Jenny, Tasneem Irshad, Janet Hanley, Angus Whyte, Claudia Pagliari, Hilary Pinnock, and Brian McKinstry. "Curating Complex, Dynamic and Distributed Data: Telehealth as a Laboratory for Strategy." International Journal of Digital Curation 6, no. 2 (2011): 128-145. http://www.ijdc.net/index.php/ijdc/article/view/187/267

Valentino, Maura, and Michael Boock. "Data Management Services in Academic Libraries: A Case Study at Oregon State University." Practical Academic Librarianship: The International Journal of the SLA Academic Division 5, no. 2 (2015): 77-91. https://journals.tdl.org/pal/index.php/pal/article/view/7001

Libraries have been asked to provide many new services over the past several decades. This paper aims to show how data management services were incorporated into the services that Oregon State University provides to faculty and graduate students. The lessons learned are general and applicable to any research institute that needs to manage data or help others with managing data.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.

Van den Eynde, Veerle, Louise Corti, Matthew Woollard, Libby Bishop, and Laurence Horton. Managing and Sharing Data: Best Practice for Researchers. Colchester, UK: UK Data Archive, 2011. http://www.data-archive.ac.uk/media/2894/managingsharing.pdf

van Deventer, Martie, Heila Pienaar. "Research Data Management in a Developing Country: A Personal Journey." International Journal of Digital Curation 10, no. 2 (2015): 33-47. http://www.ijdc.net/index.php/ijdc/article/view/10.2.33

This paper explores our own journey to get to grips with research data management (RDM). It also mentions the overlap between our own 'journeys' and that of the country. We share the lessons that we learnt along the way—the most important lesson being that you can learn many wonderful and valuable RDM lessons from the international trend setters, but in the end you need to get your hands dirty and get the work done yourself. You must, within the set parameters, implement the RDM practice that is both appropriate and acceptable for and to your own set of researchers—who may be conducting research in a context that may be very dissimilar to that of international peers.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Van Horik, René, and Dirk Roorda. "Migration to Intermediate XML for Electronic Data (MIXED): Repository of Durable File Format Conversions." International Journal of Digital Curation 6, no. 2 (2011): 245-252. http://www.ijdc.net/index.php/ijdc/article/view/195/260

Vardigan, Mary, Darrell Donakowski, Pascal Heus, Sanda Ionescu, and Julia Rotondo. "Creating Rich, Structured Metadata: Lessons Learned in the Metadata Portal Project." IASSIST Quarterly 38, no. 3 (2014): 15-20. http://www.iassistdata.org/sites/default/files/iqvol38_3_vardigan.pdf

Vardigan, Mary, Pascal Heus, and Wendy Thomas. "Data Documentation Initiative: Toward a Standard for the Social Sciences." International Journal of Digital Curation 3, no. 1 (2008): 107-113. http://www.ijdc.net/index.php/ijdc/article/view/66/45

Vardigan, Mary, and Cole Whiteman. "ICPSR Meets OAIS: Applying the OAIS Reference Model to the Social Science Archive Context." Archival Science 7, no. 1 (2007): 73-87. http://link.springer.com/article/10.1007%2Fs10502-006-9037-z

Varvel, Virgil E., Jr., and Yi Shen. "Data Management Consulting at The Johns Hopkins University." New Review of Academic Librarianship 19, no. 3 (2013): 224-245. http://www.tandfonline.com/doi/abs/10.1080/13614533.2013.768277

Verbaan, E., and A.M. Cox. "Occupational Sub-Cultures, Jurisdictional Struggle and Third Space: Theorising Professional Service Responses to Research Data Management." The Journal of Academic Librarianship 40, no. 3-4 (2014): 211-219. http://dx.doi.org/10.1016/j.acalib.2014.02.008

Viglas, Stratis. "Data Provenance and Trust." Data Science Journal 12 (2013): GRDI58-GRDI64. http://datascience.codata.org/articles/abstract/10.2481/dsj.GRDI-010/

The Oxford Dictionary defines provenance as "the place of origin, or earliest known history of something." The term, when transferred to its digital counterpart, has morphed into a more general meaning. It is not only used to refer to the origin of a digital artefact but also to its changes over time. By changes in this context we may not only refer to its digital snapshots but also to the processes that caused and materialised the change. As an example, consider a database record r created at point in time t0; an update u to that record at time t1 causes it to have a value r'. In terms of provenance, we do not only want to record the snapshots (t0, r) and (t1, r') but also the transformation u that when applied to (t0, r) results in (t1, r'), that is u(t0, r) = (t1, r').

This work is licensed under a Creative Commons Attribution 3.0 License.

Vitale, Cynthia R. H., Brianna Marshall, and Amy Nurnberger. "You're in Good Company: Unifying Campus Research Data Services." Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 26-28. https://www.asist.org/publications/bulletin/aug-2015/unifying-campus-research-data-services/

Vlaeminck, Sven. "Data Management in Scholarly Journals and Possible Roles for Libraries—Some Insights from EDaWaX." LIBER Quarterly 23, no. 1 (2013): 48-79. http://liber.library.uu.nl/index.php/lq/article/view/URN%3ANBN%3ANL%3AUI%3A10-1-114595

In this paper we summarize the findings of an empirical study conducted by the EDaWaX-Project. 141 economics journals were examined regarding the quality and extent of data availability policies that should support replications of published empirical results in economics. This paper suggests criteria for such policies that aim to facilitate replications. These criteria were also used for analysing the data availability policies we found in our sample and to identify best practices for data policies of scholarly journals in economics. In addition, we also evaluated the journals' data archives and checked the percentage of articles associated with research data. To conclude, an appraisal as to how scientific libraries might support the linkage of publications to underlying research data in cooperation with researchers, editors, publishers and data centres is presented.

This work is licensed under a Creative Commons Attribution 4.0 License.

Vlaeminck, Sven, and Gert G. Wagner. "On the Role of Research Data Centres in the Management of Publication-Related Research Data " LIBER Quarterly 23, no. 4 (2014): 336-357. http://liber.library.uu.nl/index.php/lq/article/view/9356

This paper summarizes the findings of an analysis of scientific infrastructure service providers (mainly from Germany but also from other European countries). These service providers are evaluated with regard to their potential services for the management of publication-related research data in the field of social sciences, especially economics. For this purpose we conducted both desk research and an online survey of 46 research data centres (RDCs), library networks and public archives; almost 48% responded to our survey. We find that almost three-quarters of all respondents generally store externally generated research data—which also applies to publication-related data. Almost 75% of all respondents also store and host the code of computation or the syntax of statistical analyses. If self-compiled software components are used to generate research outputs, only 40% of all respondents accept these software components for storing and hosting. Eight out of ten institutions also take specific action to ensure long-term data preservation. With regard to the documentation of stored and hosted research data, almost 70% of respondents claim to use the metadata schema of the Data Documentation Initiative (DDI); Dublin Core is used by 30 percent (multiple answers were permitted). Almost two-thirds also use persistent identifiers to facilitate citation of these datasets. Three in four also support researchers in creating metadata for their data. Application programming interfaces (APIs) for uploading or searching datasets currently are not yet implemented by any of the respondents. Least common is the use of semantic technologies like RDF.

Concluding, the paper discusses the outcome of our survey in relation to Research Data Centres (RDCs) and the roles and responsibilities of publication-related data archives for journals in the fields of social sciences.

This work is licensed under a Creative Commons Attribution 4.0 License.

Waddington, Simon, Jun Zhang, Gareth Knight, Mark Hedges, Jens Jensen, and Roger Downing. "Kindura: Repository Services for Researchers Based on Hybrid Clouds " Journal of Digital Information 13, no. 1 (2012). http://journals.tdl.org/jodi/index.php/jodi/article/view/5877

Walling, David, and Maria Esteva. "Automating the Extraction of Metadata from Archaeological Data Using iRods Rules." International Journal of Digital Curation 6, no. 2 (2011): 253-264. http://www.ijdc.net/index.php/ijdc/article/view/196/261

Wallis, Jillian. "Data Producers Courting Data Reusers: Two Cases from Modeling Communities." International Journal of Digital Curation 9, no. 1 (2014): 98-109. http://www.ijdc.net/index.php/ijdc/article/view/9.1.98/344

Data sharing is a difficult process for both the data producer and the data reuser. Both parties are faced with more disincentives than incentives. Data producers need to sink time and resources into adding metadata for data to be findable and usable, and there is no promise of receiving credit for this effort. Making data available also leaves data producers vulnerable to being scooped or data misuse. Data reusers also need to sink time and resources into evaluating data and trying to understand them, making collecting their own data a more attractive option. In spite of these difficulties, some data producers are looking for new ways to make data sharing and reuse a more viable option. This paper presents two cases from the surface and climate modeling communities, where researchers who produce data are reaching out to other researchers who would be interested in reusing the data. These cases are evaluated as a strategy to identify ways to overcome the challenges typically experienced by both data producers and data reusers. By working together with reusers, data producers are able to mitigate the disincentives and create incentives for sharing data. By working with data producers, data reusers are able to circumvent the hurdles that make data reuse so challenging.

This work is licensed under a Creative Commons Attribution 2.0 UK: England & Wales License.

Wallis, Jillian C., Christine L. Borgman, Matthew S. Mayernik, and Alberto Pepe. "Moving Archival Practices Upstream: An Exploration of the Life Cycle of Ecological Sensing Data in Collaborative Field Research." International Journal of Digital Curation 3, no. 1 (2008): 114-126. http://www.ijdc.net/index.php/ijdc/article/view/67/46

Wallis, Jillian C., Elizabeth Rolando, and Christine L. Borgman. "If We Share Data, Will Anyone Use Them? Data Sharing and Reuse in the Long Tail of Science and Technology." PLoS ONE 8, no. 7 (2013): e67332. http://www.plosone.org/article/info:doi/10.1371/journal.pone.0067332

Research on practices to share and reuse data will inform the design of infrastructure to support data collection, management, and discovery in the long tail of science and technology. These are research domains in which data tend to be local in character, minimally structured, and minimally documented. We report on a ten-year study of the Center for Embedded Network Sensing (CENS), a National Science Foundation Science and Technology Center. We found that CENS researchers are willing to share their data, but few are asked to do so, and in only a few domain areas do their funders or journals require them to deposit data. Few repositories exist to accept data in CENS research areas. Data sharing tends to occur only through interpersonal exchanges. CENS researchers obtain data from repositories, and occasionally from registries and individuals, to provide context, calibration, or other forms of background for their studies. Neither CENS researchers nor those who request access to CENS data appear to use external data for primary research questions or for replication of studies. CENS researchers are willing to share data if they receive credit and retain first rights to publish their results. Practices of releasing, sharing, and reusing of data in CENS reaffirm the gift culture of scholarship, in which goods are bartered between trusted colleagues rather than treated as commodities.

This work is licensed under a Creative Commons Attribution License.

Walton, David, Roy Lowry, and Sarah Callaghan. "Data Citation and Publication by NERC's Environmental Data Centres." Ariadne, no. 68 (2012). http://www.ariadne.ac.uk/issue68/callaghan-et-al

Wang, Wei Min, Tobias Göpfert, and Rainer Stark. "Data Management in Collaborative Interdisciplinary Research Projects—Conclusions from the Digitalization of Research in Sustainable Manufacturing." ISPRS International Journal of Geo-Information 5, no. 4 (2016): 41. http://dx.doi.org/10.3390/ijgi5040041

As research topics become increasingly complex, large scale interdisciplinary research projects are commonly established to foster cross-disciplinary cooperation and to utilize potential synergies. In the case of the Collaborative Research Center (CRC) 1026, 19 individual projects from different disciplines are brought together to investigate perspectives and solutions for sustainable manufacturing. Beside overheads regarding the coordination of activities and communication, such interdisciplinary projects are also facing challengs regarding data management. For exchange and combination of research results, data from individual projects have to be stored systematically, categorized, and linked according to the logical interrelations of the involved disciplinary knowledge domains. In the CRC 1026, the project for information infrastructure observed and analysed collaboration practices and developed IT-supported solutions to facilitate and foster research collaboration. Data management measures in this period were mainly focused on building a shared conceptual framework, and the organization of task related data. For the former aspect, an ontology basesd apporach was developed and prototypically implemented. For the latter aspect, a message board integrated task management system was developed and applied.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Wanga, Minglu, and Bonnie L. Fonga. "Embedded Data Librarianship: A Case Study of Providing Data Management Support for a Science Department." Science & Technology Libraries 34, no. 3 (2015): 228-240. https://rucore.libraries.rutgers.edu/rutgers-lib/47849/

Ward, Catharine, Lesley Freiman, Sarah Jones, Laura Molloy, and Kellie Snow. "Making Sense: Talking Data Management with Researchers." International Journal of Digital Curation 6, no. 2 (2011): 265-273. http://www.ijdc.net/index.php/ijdc/article/view/197/262

Weber, Andreas, and Claudia Piesche. "Requirements on Long-Term Accessibility and Preservation of Research Results with Particular Regard to Their Provenance." ISPRS International Journal of Geo-Information 5, no. 4 (2016): 49. http://dx.doi.org/10.3390/ijgi5040049

Since important national and international funders of research projects require statements on the long-term accessibility of research results, many new solutions appeared to fulfil these demands. The solutions are implemented on various scopes, starting from specific solutions for one research group up to solutions with a national focus (i.e., the RADAR project). While portals for globally standardized research data (e.g., climate data) are available, there is currently no provision for the large amount of data resulting from specialized research in individual research foci, the so called long-tail of sciences. In this article we describe the considerations regarding the implementation of a local research data repository for the Collaborative Research Centre (CRC) 840. The main focus will be on the examination of requirements for, and an agenda of, a possible technical implementation. Requirements were derived from a more theoretical examination of similar projects and relevant literature, diverse discussions with researchers and project leaders, by analysis of existing publication data, and finally the prototypical implementation with refining iterations. Notably, the discussions with the researchers lead to new features going beyond the challenges of the mere long-term preservation of research data. Besides the need for an infrastructure that permits long-term preservation and retrieval of research data, our system will allow the reconstruction of the complete provenance of published research results. This requirement is a serious diversification of the problem, because it creates the need to qualify additional transformation data, describing the transformation process from primary research data to research results.

This work is licensed under a Creative Commons Attribution 4.0 International License.

Weber, Nicholas M., Carole L. Palmer, and Tiffany C. Chao. "Current Trends and Future Directions in Data Curation Research and Education." Journal of Web Librarianship 6, no. 4 (2012): 305-320. http://www.tandfonline.com/doi/full/10.1080/19322909.2012.730358

Weber, Nicholas M., Andrea K. Thomer, Matthew S. Mayernik, Bob Dattore, Zaihua Ji, and Steve Worley. "The Product and System Specificities of Measuring Curation Impact." International Journal of Digital Curation 8, no. 2 (2013): 223-234. http://www.ijdc.net/index.php/ijdc/article/view/8.2.223/330

Using three datasets archived at the National Center for Atmospheric Research (NCAR), we describe the creation of a 'data usage index' for curation-specific impact assessments. Our work is focused on quantitatively evaluating climate and weather data used in earth and space science research, but we also discuss the application of this approach to other research data contexts. We conclude with some proposed future directions for metric-based work in data curation.

This work is licensed under a Creative Commons Attribution License.

Weller, Travis, and Amalia Monroe-Gulick. "Differences in the Data Practices, Challenges, and Future Needs of Graduate Students and Faculty Members." Journal of eScience Librarianship 4, no. 1 (2015): e1070. http://escholarship.umassmed.edu/jeslib/vol4/iss1/2/

———. "Understanding Methodological and Disciplinary Differences in the Data Practices of Academic Researchers." Library Hi Tech 32., no. 3 (2014): 467-482. https://kuscholarworks.ku.edu/handle/1808/15171

Wessels, Bridgette, Rachel L. Finn, Peter Linde, Paolo Mazzetti, Stefano Nativi, Susan Riley, Rod Smallwood, Mark J. Taylor, Victoria Tsoukala, Kush Wadhwa, and Sally Wyatt. "Issues in the Development of Open Access to Research Data." Prometheus: Critical Studies in Innovation 32, no. 1 (2014): 49-66. http://www.tandfonline.com/doi/abs/10.1080/08109028.2014.956505

Westra, Brian. "Data Services for the Sciences: A Needs Assessment." Ariadne, no. 64 (2010). http://www.ariadne.ac.uk/issue64/westra/

Westra, Brian, Marisa Ramirez, Susan Wells Parham, and Jeanine Marie Scaramozzino. "Science and Technology Resources on the Internet: Selected Internet Resources on Digital Research Data Curation." Issues in Science and Technology Librarianship, no. 63 (2010). http://www.istl.org/10-fall/internet2.html

Wheeler, Jonathan, and Karl Benedict. "Functional Requirements Specification for Archival Asset Management: Identification and Integration of Essential Properties of Services-Oriented Architecture Products." Journal of Map & Geography Libraries 11, no. 2 (2015): 155-179. http://dx.doi.org/10.1080/15420353.2015.1035474

White, Hollie C. "Considering Personal Organization: Metadata Practices of Scientists." Journal of Library Metadata 10, no. 2/3 (2010): 156-172. http://www.tandfonline.com/doi/full/10.1080/19386389.2010.506396

———. "Descriptive Metadata for Scientific Data Repositories: A Comparison of Information Scientist and Scientist Organizing Behaviors." Journal of Library Metadata 14, no. 1 (2014): 24-51. http://www.tandfonline.com/doi/abs/10.1080/19386389.2014.891896

White, Wendy, Dorothy Byatt, and Steve Hitchcock. DataPool: Final Report Southampton, UK: University of Southampton, 2013. http://eprints.soton.ac.uk/id/eprint/352949

Whitlock, Michael C. "Data Archiving in Ecology and Evolution: Best Practices." Trends in Ecology & Evolution 26, no. 2 (20110: 61-65. http://dx.doi.org/10.1016/j.tree.2010.11.006

Whitmire, Amanda L. "Implementing a Graduate-Level Research Data Management Course: Approach, Outcomes, and Lessons Learned." Journal of Librarianship and Scholarly Communication 3, no. 2 (2015): eP1246. http://doi.org/10.7710/2162-3309.1246

INTRODUCTION As data-driven research becomes the norm, practical knowledge in data stewardship is critical for researchers. Despite its growing importance, formal education in research data management (RDM) is rare at the university level. Academic librarians are now playing a leadership role in developing and providing RDM training and support to faculty and graduate students. This case study describes the development and implementation of a new, credit-bearing course in RDM for graduate students from all disciplines. DESCRIPTION OF PROGRAM The purpose of the course was to enable students to acquire foundational knowledge and skills in RDM that would support long-term habits in the planning, management, preservation, and sharing of research data. The pedagogical approach for the course combined outcomescentered course design with active learning techniques. Periodic course assessment was performed through anonymous student surveys, with the objective of gauging course efficacy and quality, and to obtain suggested modifications or improvements. These assessment results indicated that the course content and scope were appropriate and that the active learning approach was effective. Assessments of student learning demonstrated that all major learning objectives were achieved. NEXT STEPS Information derived from the student surveys was used to determine how the course could be modified to improve student experience and the overall quality of the course and the instruction.

This work is licensed under a Creative Commons Attribution 4.0 License.

Whitmire, Amanda L., Michael Boock, and Shan C. Sutton. "Variability in Academic Research Data Management Practices: Implications for Data Services Development from a Faculty Survey " Program 49, no. 4 (2015): 382-407. https://ir.library.oregonstate.edu/xmlui/handle/1957/57240

Whyte, Angus, Dominic Job, Stephen Giles, and Stephen Lawrie. "Meeting Curation Challenges in a Neuroimaging Group." International Journal of Digital Curation 3, no. 1 (2008): 171-181. http://www.ijdc.net/index.php/ijdc/article/view/74/53

Whyte, Angus, and Graham Pryor. "Open Science in Practice: Researcher Perspectives and Participation." International Journal of Digital Curation 6, no. 1 (2011): 199-213. http://www.ijdc.net/index.php/ijdc/article/view/173/241

Wiley, Christie A. "An Analysis of Datasets within Illinois Digital Environment for Access to Learning and Scholarship (IDEALS), the University of Illinois Urbana-Champaign Repository." Journal of eScience Librarianship 4, no. 2 (2015): e1081. http://escholarship.umassmed.edu/jeslib/vol4/iss2/3/

———. "Metadata Use in Research Data Management." Bulletin of the American Society for Information Science and Technology 40, no. 6 (2014): 38-40. http://www.asis.org/Bulletin/Aug-14/AugSep14_Wiley.pdf

Williams, Sarah C. "Data Practices in the Crop Sciences: A Review of Selected Faculty Publications." Journal of Agricultural & Food Information 13, no. 4 (2012): 308-325. http://www.tandfonline.com/doi/full/10.1080/10496505.2012.717846

———. "Data Sharing Interviews with Crop Sciences Faculty: Why They Share Data and How the Library Can Help." Issues in Science and Technology Librarianship, no. 72 (2013). http://www.istl.org/13-spring/refereed2.html

———. "Gathering Feedback from Early-Career Faculty: Speaking with and Surveying Agricultural Faculty Members about Research Data." Journal of eScience Librarianship 2, no. 2 (2013): e1048. http://dx.doi.org/10.7191/jeslib.2013.1048

———. "Using a Bibliographic Study to Identify Faculty Candidates for Data Services." Science & Technology Libraries 32, no. 2 (2013): 202-209. http://www.tandfonline.com/doi/abs/10.1080/0194262X.2013.774622?journalCode=wstl20#.Ub3-SfnfBsk

Willis, Craig, Jane Greenberg, and Hollie White. "Analysis and Synthesis of Metadata Goals for Scientific Data." Journal of the American Society for Information Science and Technology 63, no. 8 (2012): 1505-1520. http://scholarship.law.duke.edu/faculty_scholarship/2713/

Willmes, Christian, Daniel Kürner, and Georg Bareth. "Building Research Data Management Infrastructure using Open Source Software." Transactions in GIS, 18 (2014): 496-509. http://dx.doi.org/10.1111/tgis.12060

Wilson, Andrew. "How Much Is Enough: Metadata for Preserving Digital Data." Journal of Library Metadata 10, no. 2/3 (2010): 205-217. http://www.tandfonline.com/doi/full/10.1080/19386389.2010.506395

Wilson, James A. J., Michael A. Fraser, Luis Martinez-Uribe, Paul Jeffreys, Meriel Patrick, Asif Akram, and Tahir Mansoori. "Developing Infrastructure for Research Data Management at the University of Oxford." Ariadne, no. 65 (2010). http://www.ariadne.ac.uk/issue65/wilson-et-al

Wilson, James A. J., and Paul Jeffreys. "Towards a Unified University Infrastructure: The Data Management Roll-Out at the University of Oxford." International Journal of Digital Curation 8, no. 2 (2013): 235-246. http://www.ijdc.net/index.php/ijdc/article/view/8.2.235/331

Since presenting a paper at the International Digital Curation Conference 2010 conference entitled 'An Institutional Approach to Developing Research Data Management Infrastructure', the University of Oxford has come a long way in developing research data management (RDM) policy, tools and training to address the various phases of the research data lifecycle. Work has now begun on integrating these various elements into a unified infrastructure for the whole university, under the aegis of the Data Management Roll-out at Oxford (Damaro) Project.

This paper will explain the process and motivation behind the project, and describes our vision for the future. It will also introduce the new tools and processes created by the university to tie the individual RDM components together. Chief among these is the 'DataFinder'—a hierarchically-structured metadata cataloguing system which will enable researchers to search for and locate research datasets hosted in a variety of different datastores from institutional repositories, through Web 2 services, to filing cabinets standing in department offices. DataFinder will be able to pull and associate research metadata from research information databases and data management plans, and is intended to be CERIF compatible. DataFinder is being designed so that it can be deployed at different levels within different contexts, with higher-level instances harvesting information from lower-level instances enabling, for example, an academic department to deploy one instance of DataFinder, which can then be harvested by another at an institutional level, which can then in turn be harvested by another at a national level.

The paper will also consider the requirements of embedding tools and training within an institution and address the difficulties of ensuring the sustainability of an RDM infrastructure at a time when funding for such endeavours is limited. Our research shows that researchers (and indeed departments) are at present not exposed to the true costs of their (often suboptimal) data management solutions, whereas when data management services are centrally provided the full costs are visible and off-putting. There is, therefore, the need to sell the benefits of centrally-provided infrastructure to researchers. Furthermore, there is a distinction between training and services that can be most effectively provided at the institutional level, and those which need to be provided at the divisional or departmental level in order to be relevant and applicable to researchers. This is being addressed in principle by Oxford's research data management policy, and in practice by the planning and piloting aspects of the Damaro Project.

This work is licensed under a Creative Commons Attribution License.

Wilson, James A. J., Luis Martinez-Uribe, Michael A. Fraser, and Paul Jeffreys. "An Institutional Approach to Developing Research Data Management Infrastructure." International Journal of Digital Curation 6, no. 2 (2011): 274-287. http://www.ijdc.net/index.php/ijdc/article/view/198/263

Witt, Michael. "Co-designing, Co-developing, and Co-implementing an Institutional Data Repository Service." Journal of Library Administration 52, no. 2 (2012): 172-188. http://docs.lib.purdue.edu/lib_fsdocs/6/

———. "Institutional Repositories and Research Data Curation in a Distributed Environment." Library Trends 57, no. 2 (2008): 191-201. http://hdl.handle.net/2142/10680

Woolfrey, H. "Innovations for the Curation and Sharing of African Social Survey Data." Data Science Journal 12 (2013): WDS185-WDS188. http://datascience.codata.org/articles/abstract/10.2481/dsj.WDS-031/

A substantial amount of data is collected through surveys conducted in Africa by national statistics offices, international donor organisations, research institutions, and the private sector. Data management at African national statistics offices is hampered by limited resources. An option for data curation in African countries is the establishment of dedicated institutions for data preservation and dissemination, such as survey data archives, and research data centres. DataFirst, at the University of Cape Town, has established an African data service and is helping to improve African data curation practices through providing data, promoting free curation tools, and undertaking data management training in African countries.

This work is licensed under a Creative Commons Attribution 3.0 License.

Wright, Andrea. "Electronic Resources for Developing Data Management Skills and Data Management Plans." Journal of Electronic Resources in Medical Libraries 13, no. 1 (2016): 43-48. http://www.tandfonline.com/doi/abs/10.1080/15424065.2016.1146640?journalCode=werm20&

Wright, Sarah J., Wendy A. Kozlowski, Dianne Dietrich, Huda J. Khan, and Gail S. Steinhart. "Using Data Curation Profiles to Design the Datastar Dataset Registry." D-Lib Magazine 19, no. 7/8 (2013). http://www.dlib.org/dlib/july13/wright/07wright.html

Wright, Stephanie, Amanda Whitmire, Lisa Zilinski, and David Minor. "Collaboration and Tension between Institutions and Units Providing Data Management Support." Bulletin of the American Society for Information Science and Technology 40, no. 6 (2014): 18-21. https://www.asis.org/Bulletin/Aug-14/AugSep14_WrightEtAl.pdf

Wynholds, Laura. "Linking to Scientific Data: Identity Problems of Unruly and Poorly Bounded Digital Objects." International Journal of Digital Curation 6, no. 1 (2011): 214-225. http://www.ijdc.net/index.php/ijdc/article/view/174/242

Xia, Jingfeng. "Mandates and the Contributions of Open Genomic Data." Publications 1, no. 3 (2013): 99-112. http://dx.doi.org/10.3390/publications1030099

Yang, Yanyan, Omer F. Rana, David W. Walker, Roy Williams, Christos Georgousopoulos, Massimo Caffaro, and Giovanni Aloisio. "An Agent Infrastructure for On-Demand Processing of Remote-Sensing Archives." International Journal on Digital Libraries 5, no. 2 (2005): 120-132. http://link.springer.com/article/10.1007/s00799-003-0054-8

Yoon, Ayoung. "End Users' Trust in Data Repositories: Definition and Influences on Trust Development." Archival Science 14, no. 1 (2014): 17-34. http://link.springer.com/article/10.1007/s10502-013-9207-8

Yoon, Ayoung, and Helen Tibbo. "Examination of Data Deposit Practices in Repositories with the OAIS Model." IASSIST Quarterly 35, no. 4 (2011): 6-13. http://www.iassistdata.org/downloads/iqvol35_tibbo.pdf

Zborowski, Mary. "Data Management Activities of Canada's National Science Library—2010 Update and Prospective." Data Science Journal 9 (2011): 100-106. http://datascience.codata.org/articles/abstract/10.2481/dsj.009-026/

NRC-CISTI serves Canada as its National Science Library (as mandated by Canada's Parliament in 1924) and also provides direct support to researchers of the National Research Council of Canada (NRC). By reason of its mandate, vision, and strategic positioning, NRC-CISTI has been rapidly and effectively mobilizing Canadian stakeholders and resources to become a lead player on both the Canadian national and international scenes in matters relating to the organization and management of scientific research data. In a previous communication (CODATA International Conference, 2008), the orientation of NRC-CISTI towards this objective and its short- and medium-term plans and strategies were presented. Since then, significant milestones have been achieved. This paper presents NRC-CISTI's most recent activities in these areas, which are progressing well alongside a strategic organizational redesign process that is realigning NRC-CISTI's structure, mission, and mandate to better serve its clients. Throughout this transformational phase, activities relating to data management remain vibrant.

This work is licensed under a Creative Commons Attribution 3.0 License.

Zenk-Möltgen, Wolfgang, and Greta Lepthien. "Data Sharing in Sociology Journals." Online Information Review 38, no. 6 (2014): 709-722. http://www.emeraldinsight.com/doi/abs/10.1108/OIR-05-2014-0119

Zilinski, Lisa, David Scherer, Darcy Bullock, Deborah Horton, Courtney Matthews. "Evolution of Data Creation, Management, Publication, and Curation in the Research Process." Transportation Research Record: Journal of the Transportation Research Board 2414 (2014): 9-19. http://dx.doi.org/10.3141/2414-02

Zilinski, Lisa D., Amy Barton, Tao Zhan, Line Pouchard, and Pete Pascuzzi. "RDAP Review: Research Data Integration in the Purdue Libraries." Bulletin of the Association for Information Science and Technology 42, no. 2 (2016): 33-37. https://www.asist.org/publications/bulletin/decemberjanuary-2016/rdap-review-research-data-integration-in-the-purdue-libraries/

Zilinski, Lisa D., Abigail Gobens, and Kristin Briney. "University Data Policies and Library Data Services: Who Owns Your Data?" Bulletin of the Association for Information Science and Technology 41, no. 6 (2015): 32-34. https://www.asist.org/publications/bulletin/aug-2015/who-owns-your-data/

Zimmerman, Ann. "Not by Metadata Alone: The Use of Diverse Forms of Knowledge to Locate Data for Reuse." International Journal on Digital Libraries 7, no. 1/2 (2007): 5-16. http://link.springer.com/article/10.1007/s00799-007-0015-8

Note on the Inclusion of Abstracts

Abstracts are included in this bibliography if the work is under a Creative Commons Attribution License (BY and national/international variations), a Creative Commons public domain dedication (CC0), or a Creative Commons Public Domain Mark and this is clearly indicated in the work.

If the version of the Creative Commons Attribution License is indicated, this information is included. Otherwise, the fact that it is under a Creative Commons Attribution License is noted.

Abstracts for works under the following types of Creative Commons Licenses (and their national/international variations) are not included:

  • Attribution-NoDerivs
  • Attribution-NonCommercial
  • Attribution-NonCommercial-NoDerivs
  • Attribution-NonCommercial-ShareAlike
  • Attribution-ShareAlike

If there is a Creative Commons symbol in the article that does not indicate what license is applicable and it is not linked to a Creative Commons license, the abstract for the work is not included.

See the Creative Commons' Frequently Asked Questions for a discussion of how documents under different Creative Commons licenses can be combined.

Related Bibliographies and Webliographies

Bailey, Charles W., Jr. Digital Curation and Preservation Bibliography 2010. Houston: Digital Scholarship, 2011. http://digital-scholarship.org/dcpb/dcpb2010.htm

———. Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works. Houston: Digital Scholarship, 2012. http://digital-scholarship.org/dcbw/dcb.htm

———. Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works, 2012 Supplement Houston: Digital Scholarship, 2013. http://digital-scholarship.org/dcbw/s1/dcbw-s1.htm

———. Digital Curation Resource Guide Houston: Digital Scholarship, 2012. http://digital-scholarship.org/dcrg/dcrg.htm

About the Author

Charles W. Bailey, Jr. is a digital artist and the publisher of Digital Scholarship.

Bailey transforms photographs into digital artworks using specialized Photoshop plug-ins and art programs, such as Alien Skin Snap Art 4 and Topaz Impression. He primarily creates digital oil and impasto paintings and charcoal, oil pastel, and pastel drawings. He has made over 370 digital artworks freely available on 500px, Flickr (primary site), Google+, and other social media sites, providing detailed information about how each artwork was created. They have been viewed over four million times.

Bailey has over 30 years of information and instructional technology experience, including 24 years of managerial experience in academic libraries. From 2004 to 2007, he was the Assistant Dean for Digital Library Planning and Development at the University of Houston Libraries. From 1987 to 2003, he served as Assistant Dean/Director for Systems at the University of Houston Libraries.

Previously, he served as Head, Systems and Research Services at the Health Sciences Library, The University of North Carolina at Chapel Hill; Systems Librarian at the Milton S. Eisenhower Library, The Johns Hopkins University; User Documentation Specialist at the OCLC Online Computer Library Center; and Media Library Manager at the Learning Resources Center, SUNY College at Oswego.

Bailey has discussed his career in an interview in Preservation, Digital Technology & Culture. See Bailey's vita for more details.

Bailey has been an open access publisher for over 25 years. In 1989, Bailey established PACS-L, a discussion list about public-access computers in libraries, and The Public-Access Computer Systems Review, the first open access journal in the field of library and information science. He served as PACS-L Moderator until November 1991 and as Editor-in-Chief of The Public-Access Computer Systems Review until the end of 1996.

In 1990, Bailey and Dana Rooks established Public-Access Computer Systems News, an electronic newsletter, and Bailey co-edited this publication until 1992.

In 1992, he founded the PACS-P mailing list for announcing the publication of selected e-serials, and he moderated this list until 2007.

In 1996, he established the Scholarly Electronic Publishing Bibliography (SEPB), an open access book that was updated 80 times.

In 2001, he added the Scholarly Electronic Publishing Weblog, which announces relevant new publications, to SEPB.

In 2001, he was selected as a team member of Current Cites, and he has subsequently been a frequent contributor of reviews to this monthly e-serial.

In 2005, he published the Open Access Bibliography: Liberating Scholarly Literature with E-prints and Open Access Journals with the Association of Research Libraries (also a website).

In 2005, Bailey established Digital Scholarship (http://digital-scholarship.org/), which provides information and commentary about digital copyright, digital curation, digital repository, open access, scholarly communication, and other digital information issues. Digital Scholarship's digital publications are open access. Its publications are under Creative Commons licenses.

At that time, he also established DigitalKoans, a weblog that covers the same topics as Digital Scholarship.

From April 2005 through May 2016, Digital Scholarship had over 16.6 million visitors from 232 counties, over 80.2 million file requests, and over 58.5 million page views. Excluding spiders, there were over 10 million visitors from 232 counties, over 46.8 million file requests, and over 26.3 million page views.

During this period, Bailey published the following books and book supplements: the Scholarly Electronic Publishing Bibliography: 2008 Annual Edition (2009), Digital Scholarship 2009 (2010), Transforming Scholarly Publishing through Open Access: A Bibliography (2010), the Scholarly Electronic Publishing Bibliography 2010 (2011), the Digital Curation and Preservation Bibliography 2010 (2011), the Institutional Repository and ETD Bibliography 2011 (2011), the Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works (2012), and the Digital Curation Bibliography: Preservation and Stewardship of Scholarly Works, 2012 Supplement (2013).

He also published and updated the following bibliographies and webliographies as websites with links to freely available works: the Scholarly Electronic Publishing Bibliography (1996-2011), the Electronic Theses and Dissertations Bibliography (2005-2012), the Google Books Bibliography (2005-2011), the Institutional Repository Bibliography (2009-2011), the Open Access Journals Bibliography (2010), the Digital Curation and Preservation Bibliography (2010-2011), the E-science and Academic Libraries Bibliography (2011), the Digital Curation Resource Guide (2012), the Research Data Curation Bibliography (2012-2016), the Altmetrics Bibliography (2013), and the Transforming Peer Review Bibliography (2014).

In 2011, he established the LinkedIn Digital Curation Group.

For more details, see the "Digital Scholarship Publications Overview."

In 2010, Bailey was given a Best Content by an Individual Award by The Charleston Advisor. In 2003, he was named as one of Library Journal's "Movers & Shakers." In 1993, he was awarded the first LITA/Library Hi Tech Award For Outstanding Communication for Continuing Education in Library and Information Science.

In 1973, Bailey won a Wallace Stevens Poetry Award. He is the author of The Cave of Hypnos: Early Poems, which includes several poems that won that award.

Bailey has written over 30 papers about digital copyright, expert systems, institutional repositories, open access, scholarly communication, and other topics.

He has served on the editorial boards of Information Technology and Libraries, Library Software Review, and Reference Services Review.

He holds master's degrees in information and library science and instructional media and technology.

You can follow Bailey at these URLs:

His e-mail address is cb at digital-scholarship.org.