Digital Curation & Digital Preservation – Page 30

Here's an excerpt:

Finding or discovering data is a necessary precursor to being able to reuse data, although relatively little large-scale empirical evidence exists about how researchers discover, make sense of and (re)use data for research. This study presents evidence from the largest known survey investigating how researchers discover and use data that they do not create themselves. We examine the data needs and discovery strategies of respondents, propose a typology for data (re)use and probe the role of social interactions and other research practices in data discovery, with the aim of informing the design of community-centric solutions and policies.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Practical Application of a Data Stewardship Maturity Matrix for the NOAA OneStop Project"

Ge Peng et al. have published "Practical Application of a Data Stewardship Maturity Matrix for the NOAA OneStop Project" in the Data Science Journal.

Here's an excerpt:

Assessing the stewardship maturity of individual datasets is an essential part of ensuring and improving the way datasets are documented, preserved, and disseminated to users. It is a critical step towards meeting U.S. federal regulations, organizational requirements, and user needs. However, it is challenging to do so consistently and quantifiably. The Data Stewardship Maturity Matrix (DSMM), developed jointly by NOAA's National Centers for Environmental Information (NCEI) and the Cooperative Institute for Climate and Satellites–North Carolina (CICS-NC), provides a uniform framework for consistently rating stewardship maturity of individual datasets in nine key components: preservability, accessibility, usability, production sustainability, data quality assurance, data quality control/monitoring, data quality assessment, transparency/traceability, and data integrity. So far, the DSMM has been applied to over 800 individual datasets that are archived and/or managed by NCEI, in support of the NOAA's OneStop Data Discovery and Access Framework Project. As a part of the OneStop-ready process, tools, implementation guidance, workflows, and best practices are developed to assist the application of the DSMM and described in this paper. The DSMM ratings are also consistently captured in the ISO standard-based dataset-level quality metadata and citable quality descriptive information documents, which serve as interoperable quality information to both machine and human end-users. These DSMM implementation and integration workflows and best practices could be adopted by other data management and stewardship projects or adapted for applications of other maturity assessment models.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"After the Digital Revolution: Working with Emails and Born-Digital Records in Literary and Publishers’ Archives"

https://doi.org/10.1080/01576895.2019.1640555

Paywall Article: "Negotiating the Born-Digital: A Problem of Search"

https://doi.org/10.1080/01576895.2019.1640753

"Skills, Standards, and Sapp Nelson’s Matrix: Evaluating Research Data Management Workshop Offerings"

Philip Espinola Coombs et al. have published "Skills, Standards, and Sapp Nelson's Matrix: Evaluating Research Data Management Workshop Offerings" in the Journal of eScience Librarianship.

Here's an excerpt:

Here's an excerpt from the announcement:

Objective: To evaluate library workshops on their coverage of data management topics.

Methods: We used a modified version of Sapp Nelson’s Competency Matrix for Data Management Skills, a matrix of learning goals organized by data management competency and complexity level, against which we compared our educational materials: slide decks and worksheets. We examined each of the educational materials against the 333 learning objectives in our modified version of the Matrix to determine which of the learning objectives applied.

Conclusions: We found it necessary to change certain elements of the Matrix’s structure to increase its clarity and functionality: reinterpreting the behaviors, shifting the organization from the three domains of Bloom’s taxonomy to increasing complexity solely within the cognitive domain, as well as creating a comprehensive identifier schema. We appreciated the Matrix for its specificity of learning objectives, its organizational structure, the comprehensive range of competencies included, and its ease of use. On the whole, the Matrix is a useful instrument for the assessment of data management programming.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Building A National Research Data Management Course for Health Information Professionals"

Jessica Van Der Volgen and Shirley Zhao have published "Building A National Research Data Management Course for Health Information Professionals" in the Journal of eScience Librarianship.

Here's an excerpt:

Background: In August 2017 the National Network of Libraries of Medicine Training Office (NTO) was awarded an administrative supplement from the National Library of Medicine (NLM) to create training for librarians in biomedical and health research data management (RDM). The primary goal of the training was to enable information professionals to initiate or enhance RDM at their institutions.

Case Presentation: An eight-week online course was developed to address key concepts in RDM. Each module was organized around measurable learning objectives using existing subject resources, such as readings, tutorials, and videos. Within each module, an expert in the field co-facilitated relevant discussions, created and graded a practical assignment, and answered questions. Thirty-eight participants were selected for this initial cohort. Mentors were assigned to each participant for guidance in completing a required project action plan to further their RDM goals at their institution. The course was evaluated through pre- and post-tests and an online questionnaire.

Results: Thirty participants successfully completed the online course work and project, and gathered at the National Institutes of Health for a Capstone Summit. Students demonstrated improved knowledge of RDM concepts between the pre- and post-tests. Most students also self-reported increased skill and confidence. Practical assignments with individual feedback from experienced data librarians were the most valued aspect of the course. Time to complete each module was underestimated.

Conclusions: The initial offering of this training program improved the RDM skills and knowledge of participants and enabled students to add or enhance services at their institutions. Further investigations are necessary to determine the longer-term impact on the individuals and their libraries. While many of the participants will need additional training to become part of the data-ready workforce of health information professionals, completing this training is an important step in their professional development.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Paywall Article: "The Ownership and Control of Online Photos and Game Data: Patterns, Trends, and Keeping Pace with Evolving Circumstances"

https://doi.org/10.1109/JCDL.2019.00022

Paywall Article: "Digital Curation at Work: Modeling Workflows for Digital Archival Materials"

https://doi.org/10.1109/JCDL.2019.00016

"Digitization and the Future of Natural History Collections"

Brandon Hedrick et al. have self-archived "Digitization and the Future of Natural History Collections."

Here's an excerpt:

Natural history collections (NHCs) are the foundation of historical baselines for assessing anthropogenic impacts on biodiversity. Along these lines, the online mobilization of specimens via digitization–the conversion of specimen data into accessible digital content–has greatly expanded the use of NHC collections across a diversity of disciplines. We broaden the current vision of digitization (Digitization 1.0)–whereby specimens are digitized within NHCs–to include new approaches that rely on digitized products rather than the physical specimen (Digitization 2.0). Digitization 2.0 builds upon the data, workflows, and infrastructure produced by Digitization 1.0 to create digital-only workflows that facilitate digitization, curation, and data linkages, thus returning value to physical specimens by creating new layers of annotation, empowering a global community, and developing automated approaches to advance biodiversity discovery and conservation. These efforts will transform large-scale biodiversity assessments to address fundamental questions including those pertaining to critical modern issues of global change.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Our Journey to Digital Curation of the Jeghers Medical Index"

https://dx.doi.org/10.5195%2Fjmla.2017.47

"The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management"

Richard Grunzke et al. have published "The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management" in Future Generation Computer Systems.

Here's an excerpt:

Here, we present the architecture and developments of the Metadata Management for Applied Sciences (MASi) project that is currently building a comprehensive research data management service. MASi extends the existing KIT Data Manager framework by a generic metadata programming interface and a generic graphical web interface. Furthermore, MASi is OAI compliant and supports the OAI-PMH protocol while providing support for provenance information using ProvONE, a well-established and accepted provenance model. To illustrate the practical applicability of the MASi service, we present the adoption of initial use cases within geography, chemistry and digital humanities.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"UC San Diego Library and Texas Digital Library Collaborate to Transform Digital Preservation Possibilities for Sensitive Data"

https://library.ucsd.edu/news-events/uc-san-diego-library-and-texas-digital-library-collaborate-to-transform-digital-preservation-possibilities-for-sensitive-data/

"Building Open Access to Research (OAR) Data Infrastructure at NIST"

Gretchen Greene, Raymond Plante, and Robert Hanisch have published "Building Open Access to Research (OAR) Data Infrastructure at NIST" in Data Science Journal.

Here's an excerpt:

As a National Metrology Institute (NMI), the USA National Institute of Standards and Technology (NIST) scientists, engineers and technology experts conduct research across a full spectrum of physical science domains. NIST is a non-regulatory agency within the U.S. Department of Commerce with a mission to promote U.S. innovation and industrial competitiveness by advancing measurement science, standards, and technology in ways that enhance economic security and improve our quality of life. NIST research results in the production and distribution of standard reference materials, calibration services, and datasets. These are generated from a wide range of complex laboratory instrumentation, expert analyses, and calibration processes. In response to a government open data policy, and in collaboration with the broader research community, NIST has developed a federated Open Access to Research (OAR) scientific data infrastructure aligned with FAIR (Findable, Accessible, Interoperable, Reusable) data principles. Through the OAR initiatives, NIST's Material Measurement Laboratory Office of Data and Informatics (ODI) recently released a new scientific data discovery portal and public data repository. These science-oriented applications provide dissemination and public access for data from across the broad spectrum of NIST research disciplines, including chemistry, biology, materials science (such as crystallography, nanomaterials, etc.), physics, disaster resilience, cyberinfrastructure, communications, forensics, and others. NIST’s public data consist of carefully curated Standard Reference Data, legacy high valued data, and new research data publications. The repository is thus evolving both in content and features as the nature of research progresses. Implementation of the OAR infrastructure is key to NIST’s role in sharing high integrity reproducible research for measurement science in a rapidly changing world.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"The Lives and After Lives of Data"

Christine L. Borgman has published "The Lives and After Lives of Data" in the Harvard Data Science Review.

Here's an excerpt:

The most elusive term in data science is 'data.' While often treated as objects to be computed upon, data is a theory-laden concept with a long history. Data exist within knowledge infrastructures that govern how they are created, managed, and interpreted. By comparing models of data life cycles, implicit assumptions about data become apparent. In linear models, data pass through stages from beginning to end of life, which suggest that data can be recreated as needed. Cyclical models, in which data flow in a virtuous circle of uses and reuses, are better suited for irreplaceable observational data that may retain value indefinitely. In astronomy, for example, observations from one generation of telescopes may become calibration and modeling data for the next generation, whether digital sky surveys or glass plates. The value and reusability of data can be enhanced through investments in knowledge infrastructures, especially digital curation and preservation. Determining what data to keep, why, how, and for how long, is the challenge of our day.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Research Data Curation Bibliography, Version 10 PDF Released

Digital Scholarship has released a PDF of the Research Data Curation Bibliography, Version 10.

Created from the HTML file, this unpaginated PDF with basic formatting makes it easier to print the lengthy bibliography.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Digital Curation at Work: Modeling Workflows for Digital Archival Materials"

Colin Post et al. have self-archived "Digital Curation at Work: Modeling Workflows for Digital Archival Materials."

Here's an excerpt:

This paper describes and compares digital curation workflows from 12 cultural heritage institutions that vary in size, nature of digital collections, available resources, and level of development of digital curation activities.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Share–Publish–Store–Preserve. Methodologies, Tools and Challenges for 3D Use in Social Sciences and Humanities

Anas Alaoui M'Darhri et al. have self-archived "Share–Publish–Store–Preserve. Methodologies, Tools and Challenges for 3D Use in Social Sciences and Humanities."

Here's an excerpt:

Through this White Paper, which gathers contributions from experts of 3D data as well as professionals concerned with the interoperability and sustainability of 3D research data, the PARTHENOS project aims at highlighting some of the current issues they have to face, with possible specific points according to the discipline, and potential practices and methodologies to deal with these issues. During the workshop, several tools to deal with these issues have been introduced and confronted with the participants experiences, this White Paper now intends to go further by also integrating participants feedbacks and suggestions of potential improvements. Therefore, even if the focus is put on specific tools, the main goal is to contribute to the development of standardized good practices related to the sharing, publication, storage and long-term preservation of 3D data.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Supporting Web Archiving via Web Packaging"

Sawood Alam et al. have self-archived "Supporting Web Archiving via Web Packaging."

Here's an excerpt:

We describe challenges related to web archiving, replaying archived web resources, and verifying their authenticity. We show that Web Packaging has significant potential to help address these challenges and identify areas in which changes are needed in order to fully realize that potential.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Curation as ‘Interoperability With the Future’: Preserving Scholarly Research Software in Academic Libraries"

https://doi.org/10.1002/asi.24244

Software Citation Implementation Challenges

Daniel S. Katz et al. have self-archived "Software Citation Implementation Challenges."

Here's an excerpt:

The main output of the FORCE11 Software Citation working group (this https URL) was a paper on software citation principles (this https URL) published in September 2016. This paper laid out a set of six high-level principles for software citation (importance, credit and attribution, unique identification, persistence, accessibility, and specificity) and discussed how they could be used to implement software citation in the scholarly community. In a series of talks and other activities, we have promoted software citation using these increasingly accepted principles. At the time the initial paper was published, we also provided guidance and examples on how to make software citable, though we now realize there are unresolved problems with that guidance. The purpose of this document is to provide an explanation of current issues impacting scholarly attribution of research software, organize updated implementation guidance, and identify where best practices and solutions are still needed.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"The Administrative Load of Sharing Sensitive Data—Challenges and Solutions?"

Kirsty Merrett et al. have published "The Administrative Load of Sharing Sensitive Data—Challenges and Solutions?" in the International Journal of Digital Curation.

Here's an excerpt:

Sharing data openly has become a straightforward process at the University of Bristol. The University's top funders mandate or recommend data sharing as a condition of funding, and many publishers require access to research data to enable results of published articles to be verified. The University has provided a dedicated data repository to support this since 2015, and demand for open publication has risen steadily since its inception. However, an increasing number of requests for sharing data relate to data that has ethical, legal or commercial sensitivities and so cannot be published openly.

Rather than discuss the wide-ranging ethical implications of data sharing, this practice paper will focus on the secure sharing of sensitive data that has ethical approval and, where required, has the necessary consent in place, from the perspective of an institution that has already decided to undertake the work inherent in sharing sensitive data. The specific purpose is to detail the workflow and administrative tasks integral in this and to highlight the types of challenges encountered.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"From Passive to Active, from Generic to Focussed: How Can an Institutional Data Archive Remain Relevant in a Rapidly Evolving Landscape?"

Maria J. Cruz et al. have published "From Passive to Active, From Generic to Focussed: How Can an Institutional Data Archive Remain Relevant in a Rapidly Evolving Landscape? " in the International Journal of Digital Curation.

Here's an excerpt:

Founded in 2008 as an initiative of the libraries of three of the four technical universities in the Netherlands, the 4TU.Centre for Research Data (4TU.Research Data) has provided a fully operational, cross-institutional, long-term archive since 2010, storing data from all subjects in applied sciences and engineering. Presently, over 90% of the data in the archive is geoscientific data coded in netCDF (Network Common Data Form)—a data format and data model that, although generic, is mostly used in climate, ocean and atmospheric sciences. In this practice paper, we explore the question of how 4TU.Research Data can stay relevant and forward-looking in a rapidly evolving research data management landscape. In particular, we describe the motivation behind this question and how we propose to address it.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Category: Digital Curation & Digital Preservation

"Lost or Found? Discovering Data Needed for Research"

"1.71 Billion Websites Now Exist; Most Are Dead"

"The Australian Research Data Commons"

"Lost or Found? Discovering Data Needed for Research"

"Practical Application of a Data Stewardship Maturity Matrix for the NOAA OneStop Project"

"After the Digital Revolution: Working with Emails and Born-Digital Records in Literary and Publishers’ Archives"

Paywall Article: "Negotiating the Born-Digital: A Problem of Search"

"Skills, Standards, and Sapp Nelson’s Matrix: Evaluating Research Data Management Workshop Offerings"

"Building A National Research Data Management Course for Health Information Professionals"

Paywall Article: "The Ownership and Control of Online Photos and Game Data: Patterns, Trends, and Keeping Pace with Evolving Circumstances"

Paywall Article: "Digital Curation at Work: Modeling Workflows for Digital Archival Materials"

"Digitization and the Future of Natural History Collections"

"Our Journey to Digital Curation of the Jeghers Medical Index"

"The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management"

"UC San Diego Library and Texas Digital Library Collaborate to Transform Digital Preservation Possibilities for Sensitive Data"

"Building Open Access to Research (OAR) Data Infrastructure at NIST"

"The Lives and After Lives of Data"

Research Data Curation Bibliography, Version 10 PDF Released

"Digital Curation at Work: Modeling Workflows for Digital Archival Materials"

Share–Publish–Store–Preserve. Methodologies, Tools and Challenges for 3D Use in Social Sciences and Humanities

"Supporting Web Archiving via Web Packaging"

"Curation as ‘Interoperability With the Future’: Preserving Scholarly Research Software in Academic Libraries"

Software Citation Implementation Challenges

"The Administrative Load of Sharing Sensitive Data—Challenges and Solutions?"

"From Passive to Active, from Generic to Focussed: How Can an Institutional Data Archive Remain Relevant in a Rapidly Evolving Landscape?"