Scholarly Communication Librarianship and Open Knowledge


Scholarly Communication Librarianship and Open Knowledge is an open textbook and practitioner’s guide that collects theory, practice, and case studies from nearly 80 experts in scholarly communication and open education. Divided into three parts:

  • What is Scholarly Communication?
  • Scholarly Communication and Open Culture
  • Voices from the Field: Perspectives, Intersections, and Case Studies

The book delves into the economic, social, policy, and legal aspects of scholarly communication as well as open access, open data, open education, and open science and infrastructure. Practitioners provide insight into the relationship between university presses and academic libraries, defining collection development as operational scholarly communication, and promotion and tenure and the challenge for open access.

https://bit.ly/SCLAOK

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

The State of Open Data 2023


Support is not making its way to those who need it

Almost three-quarters of respondents had never received support with making their data openly available.

One size does not fit all

Variations in responses from different subject expertise and geographies highlight a need for a more nuanced approach to research data management support globally.

Challenging stereotypes

Are later career academics really opposed to progress? The results of the 2023 survey indicate that career stage is not a significant factor in open data awareness or support levels.

Credit is an ongoing issue

For eight years running, our survey has revealed a recurring concern among researchers: the perception that they don’t receive sufficient recognition for openly sharing their data.

AI awareness hasn’t translated to action

For the first time, this year we asked survey respondents to indicate if they were using ChatGPT or similar AI tools for data collection, processing and metadata creation.

https://tinyurl.com/mr28u6tt

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

Paywall: "Cutting the Gordian (Workload) Knot? Adding Data Services to Academic Library Public Services"


The library and information science literature wants two irreconcilable things out of its workload data: 1) aggregate comparable data to document and measure use of libraries and its value; and 2) accurate descriptions to document and measure the individual work done by librarians. . . . We propose here to change the question asked: how can we achieve a reasonable balance of workload within a group of librarians? . . . The goal was to answer a common and longstanding question: we are in continual process of assessing what needs to be done and how/where to shift workloads, but how do we know we’re doing it in a reasonable and fair way beyond anecdotes and intuitions? We developed a weighted measure of public services workload in order to assess and track and assign a) areas of declining workload, b) areas of increasing workload (data services), and c) a balance between library divisions contributing to public services.

https://doi.org/10.1016/j.acalib.2023.102801

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Structural Elements and Spheres of Expertise: Creating a Healthy Ecosystem for Cultural Data Initiatives"


While technology affords creation of digital collections, and promises access to all, the reality is that many cultural data collections exist in a precarious ecosystem, where erratic funding, fragmented support, and disconnected expertise threaten their continued existence. As a significant branch of the broader information ecosystem, cultural data collections range in size and scope, from national institutions to bespoke local collections supported by individuals. This exploratory, qualitative study engaged cultural data experts in Australia, Canada, and the United Kingdom to map the broad cultural data ecosystem and to identify opportunities for healthier growth. The development and maintenance of cultural data collections requires integration across the spheres of expertise of creators, curators, subject matter experts, information science, and computing and technology. The foundational structural elements of the ecosystem include funding, policies, access to existing data, community context, and technological infrastructure. The key elements of a healthy data ecosystem are clarity of purpose, user-focused design, sustainability, allied coproduction, and reciprocal interconnection. A healthier cultural data ecosystem means more collections and initiatives will have positive impacts for research, knowledge, and diverse communities, contributing positively to the broader information ecosystem and to society, at large.

https://doi.org/10.1002/asi.24849

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

NYU: "From Silos to Community: The Path to a Holistic Digital Preservation Policy"


While New York University Libraries has a long history of and commitment to digital collecting and preservation efforts, the institution did not have any policies governing the services and activities of digital preservation prior to 2022. This paper details the creation of a holistic digital preservation policy statement, with contributors from across ten functional units at NYU Libraries. The policy was grounded in the Libraries’ mission and values–including deep commitments to inclusion, diversity, belonging, equity, and accessibility–and drew on themes crafted by all members of the group to ensure their work was represented in the statement. The success of the policy group was rooted in its intentional formation and processes that acknowledged the distributed nature of digital preservation and emphasized the creation of a community of practice. Further, it laid the foundation for a more complete suite of preservation policies and forward-looking conversations about how to enact ethical and sustainable stewardship in digital collecting, access, and preservation practices

https://www.ideals.illinois.edu/items/128309

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Training to Act FAIR: A Pre-post Study on Teaching FAIR Guiding Principles to (Future) Researchers in Higher Education."


Before FAIR training, 81.1% of students suggest scientific actions not in line with the FAIR guiding principles. However, after the training, there is a 3.75-fold increase in scientific actions that adhere to these principles. Interestingly, the training does not significantly impact how students justify FAIR actions. The study observes a positive correlation between the presence of university legal frameworks on FAIR guiding principles and students’ inclination towards FAIR training. It explicates safe space, participation, motivation, usefulness, and satisfaction as the five highest-rated learning factors in FAIR training.

https://doi.org/10.21203/rs.3.rs-3409769/v1

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Dissemination Effect of Data Papers on Scientific Datasets"


This study aims to investigate the citation practices associated with data papers and to explore the role of data papers in disseminating scientific datasets. . . . The findings indicate a consistent growth in the number of biomedical data journals published in recent years, with data papers gaining attention and recognition as both publications and data sources. Although the use of data papers as citation sources for data remains relatively rare, there has been a steady increase in data paper citations for data utilization through formal data citations.

https://doi.org/10.1002/asi.24843

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Repository Staff Perspectives on the Benefits Of Trustworthy Digital Repository Certification"


This paper reports on the results from a qualitative study that asks whether and how staff members from TRAC certified repositories find value in the audit and certification process. While some interviewees found certification valuable, others argued that the costs outweighed the benefits or expressed ambivalence towards certification. Findings indicate that TRAC certification offered both internal and external benefits, such as improved documentation, accountability, transparency, communication, and standards, but there were concerns about high costs, implementation problems, and lack of objective evaluation criteria.

https://tinyurl.com/bddmuwjy

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"FAIR EVA: Bringing Institutional Multidisciplinary Repositories into the FAIR Picture"


The FAIR Principles are a set of good practices to improve the reproducibility and quality of data in an Open Science context. Different sets of indicators have been proposed to evaluate the FAIRness of digital objects, including datasets that are usually stored in repositories or data portals. However, indicators like those proposed by the Research Data Alliance are provided from a high-level perspective that can be interpreted and they are not always realistic to particular environments like multidisciplinary repositories. This paper describes FAIR EVA, a new tool developed within the European Open Science Cloud context that is oriented to particular data management systems like open repositories, which can be customized to a specific case in a scalable and automatic environment. It aims to be adaptive enough to work for different environments, repository software and disciplines, taking into account the flexibility of the FAIR Principles. As an example, we present DIGITAL.CSIC repository as the first target of the tool, gathering the particular needs of a multidisciplinary institution as well as its institutional repository.

https://doi.org/10.1038/s41597-023-02652-8

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Understanding the Value of Curation: A Survey of Researcher Perspectives of Data Curation Services from Six Us Institutions"


Data curation encompasses a range of actions undertaken to ensure that research data are fit for purpose and available for discovery and reuse, and can help to improve the likelihood that data is more FAIR (Findable, Accessible, Interoperable, and Reusable). The Data Curation Network (DCN) has taken a collaborative approach to data curation, sharing curation expertise across a network of partner institutions and data repositories, and enabling those member institutions to provide expert curation for a wide variety of data types and discipline-specific datasets. This study sought to assess the satisfaction of researchers who had received data curation services, and to learn more about what curation actions were most valued by researchers. By surveying researchers who had deposited data into one of six academic generalist data repositories between 2019–2021, this study set out to collect feedback on the value of curation from the researchers themselves. A total of 568 researchers were surveyed; 42% (238) responded. Respondents were positive in their evaluation of the importance and value of curation, indicating that the participants not only value curation services, but are largely satisfied with the services provided. An overwhelming majority 97% of researchers agreed that data curation adds value to the data sharing process, 96% agreed it was worth the effort, and 90% felt more confident sharing their data due to the curation process. We share these results to provide insights into researchers’ perceptions and experience of data curation, and to contribute evidence of the positive impact of curation on repository depositors. From the perspective of researchers we surveyed, curation is worth the effort, increases their comfort with data sharing, and makes data more findable, accessible, interoperable, and reusable.

https://doi.org/10.1371/journal.pone.0293534

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Digital Preservation Coalition: The Global ‘Bit List’ of Endangered Digital Species


A free-to-access and open resource for digital preservation advocacy, the DPC’s Global Bit List of Endangered Digital Species (or Bit List for short) is a community-sourced list of at-risk digital materials which is revised every two years. Entries to the list are nominated by the community, who are at the forefront of digital preservation efforts, and reviewed by international organizations which represent global expertise in the preservation of the listed digital species.

https://tinyurl.com/jevyn9mj

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"The Impacts of Changes in Journal Data Policies: A Cross-disciplinary Survey"


This discipline-specific survey of journal DSP and SMP highlighted the increasing adoption rates and rankings of DSP over time. Furthermore, the findings suggest that DSP adoption may have a notable impact on the increase in JIF. The adoption of DSP by journals may be associated with the increased attention and credibility of the articles.

https://doi.org/10.1002/pra2.924

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Results of the 2022 NDSA Web Archiving Survey Report Now Available"


The Web Archiving Survey Working Group is excited to announce the publication of the results of the 2022 Web Archiving Survey. The 2022 survey builds upon surveys previously conducted in 2017, 2016, 2013, and 2011 — and though previous surveys were focused on the United States, the 2022 survey was open to international audiences as well. The 26-question survey was completed by 190 respondents over a 10-day period. This report details the outcomes of the web archiving survey that was distributed to various local, national, and international professional organizations and topical groups in October 2022. Topics discussed included archiving policies, tools and services, and access and discovery.

https://tinyurl.com/mpjjdfse

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"How Can Open Data Sharing Policies Be More Attentive to Qualitative Researchers?"


The expected and prescriptive ways of preparing data are a key part of the problem. These are governed largely by quantitative data management strategies. Qualitative data is the outcome of personal interactions between researchers and participants. Yet, data sharing guidance is seldom attentive to the co-constructed nature of qualitative material. "The identities of researchers and what they reflexively reveal of themselves, how they interact with participants, their techniques and approaches and the messiness of qualitative work are laid bare within the artefacts of qualitative data" (Weller 2023: 9). This can make researchers especially vulnerable to personal and professional scrutiny in a way that survey and other quantitative researchers are not.

https://tinyurl.com/2fpr82vr

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "The New Information Retrieval Problem: Data Availability"


In this paper, we discuss a method for exploring and locating datasets made available by scientists from federally funded projects in the US. The data pathways method was tested on federal awards. Here we describe the method and the results from analyzing fifty federal awards granted by the National Science Foundation to pursue data resources and their availability in publications, data repositories, or institutional repositories. The data pathways approach contributes to the development of a practical approach on availability that captures the current ways in which data are accessible from federally funded science projects –ranging from institutional repositories, journal data deposit, PI and project web pages, and science data platforms, among other found possibilities

https://doi.org/10.1002/pra2.796

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "’Garbage Bags Full of Files: Exploring Sociotechnical Perceptions of Formats within the Recovery and Reuse of Scientific Data"


This paper explores the social and technical perceptions of physical and digital formats as they relate to work in the recovery and reuse of scientific data, specifically historical, archival, and defunct data sources. . . . Based on 23 qualitative interviews with practitioners conducting data recovery and reuse, ranging from marine biologists to data librarians, we study how they understand, engage with, and utilize formats within their data curation work. . . . The paper focuses on practitioner perceptions of formats around the following themes: how practitioners’ historical relationships to certain challenging formats inform their ongoing curation practices; the importance of contexts in prioritizing or ignoring formats within scientific curation work; and how formats reveal larger sociotechnical issues.

https://doi.org/10.1002/pra2.798

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Conceptualizing Slow Curation"


The pressure to do things quickly is a constant in our professional lives as data curators. But what if we slowed down our work? Taking inspiration from the Slow Movement and its various sub-genres, we propose the idea of Slow Curation, specifically for the application of curating research data. Data curation is the process of reviewing datasets to prepare them for sharing, use, and reuse. We have identified areas where Slow can be embraced in the Data Curation Network’s CURATE(D) model. We also advocate for a few ways, outside of curation, where data curators can collaboratively advocate for Slowing down our work for the better. Join us as we practice radical self-care and advocate for our communities by embracing Slow and easing our way out of Busy culture.

https://doi.org/10.7191/jeslib.740

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories"


At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.

https://doi.org/10.5334/dsj-2023-039

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Connecting Fragmented Support on Campus: Growing Research Data Services Programs Through Collaboration"


Research data services are provided by multiple units across and beyond the library, which is why communication and collaboration are paramount to building support for researchers. By exploring how Research Data Services (RDS) programs can function in the fragmented landscape of research support on campuses, we outline the role of collaboration in building programs. In this paper, we discuss building an RDS program by emphasizing three strategies for collaboration: collaborating within the library, collaborating across campus, and collaborating externally with those without direct ties to your organization. The aim of this paper is to offer attainable examples and strategies for building collaborations across campuses for libraries that have small or nascent RDS programs—how to approach and cultivate partnerships, how to set realistic goals, and how to work holistically within the fragmented academy.

https://tinyurl.com/9hbz49df

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "DMPFrame: A Conceptual Metadata Framework for Data Management Plans"


We have examined 12 open-source DMP tools, in particular, to evaluate the metadata adopted by these tools. The current study spots and highlights the gaps in the DMP metadata management in DMP tools and suggests DMPFrame as a conceptual framework addressing such gaps to improve the existing tools’ DMP metadata management practices. Based on the examined DMP tool’s metadata elements analysis and mapping, DMPFrame manages DMP metadata under 6 categories, namely, contributors, project, funding, organization, DMP, and output. The current study also suggests a systematic workflow that DMP tools could incorporate for metadata creation for DMPs.

https://doi.org/10.1080/19386389.2023.2268474

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices"


In recent years, funding agencies and journals increasingly advocate for open science practices (e.g. data and method sharing) to improve the transparency, access, and reproducibility of science. However, quantifying these practices at scale has proven difficult. In this work, we leverage a large-scale dataset of 1.1M papers from arXiv that are representative of the fields of physics, math, and computer science to analyze the adoption of data and method link-sharing practices over time and their impact on article reception. To identify links to data and methods, we train a neural text classification model to automatically classify URL types based on contextual mentions in papers. We find evidence that the practice of link-sharing to methods and data is spreading as more papers include such URLs over time. Reproducibility efforts may also be spreading because the same links are being increasingly reused across papers (especially in computer science); and these links are increasingly concentrated within fewer web domains (e.g. Github) over time. Lastly, articles that share data and method links receive increased recognition in terms of citation count, with a stronger effect when the shared links are active (rather than defunct). Together, these findings demonstrate the increased spread and perceived value of data and method sharing practices in open science.

https://arxiv.org/abs/2310.03193

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Kristin Briney: The Research Data Management Workbook


The Research Data Management Workbook is made up of a collection of exercises for researchers to improve their data management. The Workbook contains exercises across the data lifecycle, though the range of activities is not comprehensive. Instead, exercises focus on discrete practices within data management that are structured and can be reproduced by any researcher.

The book is divided into chapters, loosely by phases of the data lifecycle, with one or more exercises in each chapter. Every exercise comes with a description of its value within data management, instructions on how to do the exercise, original source of the exercise (when applicable), and the exercise itself.

https://tinyurl.com/2p8sk5xd

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Data Curation in Interdisciplinary and Highly Collaborative Research"


This paper provides a systematic analysis of publications that discuss data curation in interdisciplinary and highly collaborative research (IHCR). Using content analysis methodology, it examined 159 publications and identified patterns in definitions of interdisciplinarity, projects’ participants and methodologies, and approaches to data curation. The findings suggest that data is a prominent component in interdisciplinarity. In addition to crossing disciplinary and other boundaries, IHCR is defined as curating and integrating heterogeneous data and creating new forms of knowledge from it. Using personal experiences and descriptive approaches, the publications discussed challenges that data curation in IHCR faces, including an increased overhead in coordination and management, lack of consistent metadata practices, and custom infrastructure that makes interoperability across projects, domains, and repositories difficult. The paper concludes with suggestions for future research.

https://doi.org/10.2218/ijdc.v17i1.835

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Scholarly Communication Librarianship and Open Knowledge


The book consists of three parts. Part I offers definitions of scholarly communication and scholarly communication librarianship and provides an introduction to the social, economic, technological, and policy/legal pressures that underpin and shape scholarly communication work in libraries. These pressures, which have framed ACRL’s understanding of scholarly communication for the better part of the past two decades, have unsettled many foundational assumptions and practices in the field, removing core pillars of scholarly communication as it was practiced in the twentieth century. These pressures have also cleared fresh ground, and scholarly communication practitioners have begun to seed the space with values and practices designed to renew and often improve the field. Part II begins with an introduction to "open," the core response to the pressures described in part I. This part offers a general overview of the idea of openness in scholarly communication followed by chapters on different permutations and practices of open, each edited by a recognized expert of these areas with authors of their selection. Amy Buckland edited chapter 2.1, "Open Access." Brianna Marshall edited chapter 2.2, "Open Data." Lillian Hogendoorn edited chapter 2.3, "Open Education." Micah Vandegrift edited chapter 2.4, "Open Science and Infrastructure." Each of them brought on incredible expertise through contributors whom they identified, through both original contributions and repurposing existing openly licensed work, which is something we want to model where possible. Part III consists of twenty-four concise perspectives, intersections, and case studies from practicing librarians and closely related stakeholders, which we hope will stimulate discussion and reflection on theory and implications for practice. In every single case, we’re really excited by the editors and authors and the ideas they bring to the whole. Each contribution features light pedagogical apparatuses like suggested further reading, discussion or reflection prompts, and potential activities. It’s all available for free and openly licensed with a Creative Commons Attribution Non-Commercial (CC BY-NC) license, so anyone is encouraged to grab whatever parts are useful and to adapt and repurpose and improve them to meet specific course goals and student needs within the confines of the license.

https://bit.ly/SCLAOK

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |