"PreprintMatch: A Tool for Preprint to Publication Detection Shows Global Inequities in Scientific Publication"


Preprints, versions of scientific manuscripts that precede peer review, are growing in popularity. They offer an opportunity to democratize and accelerate research, as they have no publication costs or a lengthy peer review process. Preprints are often later published in peer-reviewed venues, but these publications and the original preprints are frequently not linked in any way. To this end, we developed a tool, PreprintMatch, to find matches between preprints and their corresponding published papers, if they exist. This tool outperforms existing techniques to match preprints and papers, both on matching performance and speed. PreprintMatch was applied to search for matches between preprints (from bioRxiv and medRxiv), and PubMed. The preliminary nature of preprints offers a unique perspective into scientific projects at a relatively early stage, and with better matching between preprint and paper, we explored questions related to research inequity. We found that preprints from low income countries are published as peer-reviewed papers at a lower rate than high income countries (39.6% and 61.1%, respectively), and our data is consistent with previous work that cite a lack of resources, lack of stability, and policy choices to explain this discrepancy. Preprints from low income countries were also found to be published quicker (178 vs 203 days) and with less title, abstract, and author similarity to the published version compared to high income countries. Low income countries add more authors from the preprint to the published version than high income countries (0.42 authors vs 0.32, respectively), a practice that is significantly more frequent in China compared to similar countries. Finally, we find that some publishers publish work with authors from lower income countries more frequently than others.

https://doi.org/10.1371/journal.pone.0281659

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Is Writing a Book Chapter Still a Waste of Time?"


How has digital open access transformed academic communication for the better? LSE Press’s Editor in Chief, Patrick Dunleavy, explores the impact of chapters in edited books. Once the Cinderella of academic publishing, doomed to obscurity under paywall books’ formal and de facto access restrictions, chapters in books are, thanks to digital open access, once again rivalling journal articles in their visibility to academic communities, their usefulness as teaching resources, and in their ability to tackle innovative and state of-the-art topics.

bit.ly/3KYRMq6

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Revisiting Methodology for Identifying Open Access Advantages"


This study revisited the methodology for identifying the effects of open access and revealed the causes for contradictory conclusions using four indices for journals that transitioned from subscription to open access. . . . Although the aggregated data of the eight journals indicated that open access had a positive effect, the effect varied across journals. A few journals produced different results between the two citation scores as well as between citation scores and number of citations or articles. Furthermore, a publisher’s choice of which journal to shift to open access influenced their performance after the shift.

https://doi.org/10.1007/s12109-023-09946-0

| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Research Data Curation and Management Works |
| Digital Scholarship |

"Ten Lessons for Data Sharing with a Data Commons"


A data commons is a cloud-based data platform with a governance structure that allows a community to manage, analyze and share its data. Data commons provide a research community with the ability to manage and analyze large datasets using the elastic scalability provided by cloud computing and to share data securely and compliantly, and, in this way, accelerate the pace of research. Over the past decade, a number of data commons have been developed and we discuss some of the lessons learned from this effort.

https://doi.org/10.1038/s41597-023-02029-x

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "Open Data and the 2023 NIH Data Management and Sharing Policy"


As the largest public funder of biomedical research in the world, the National Institutes of Health’s (NIH) new Data Management and Sharing (DMS) Policy is a large step toward shifting the culture of medical research toward a broader sharing of scientific data. . . . This article will serve as a primer on open data, data sharing, the NIH’s DMS Policy and its implications, and how librarians can support researchers in this landscape.

https://doi.org/10.1080/02763869.2023.2168103

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Lack of Sustainability Plans for Preprint Services Risks Their Potential to Improve Science"


Despite successfully building a revenue model that shares the burden between Cornell University, the Simons Foundation and several members and supporters, arXiv’s “funding is still outpaced by [their] growth” – the server hosts over 2 million preprints already and is growing by 10% each year. And while arXiv has been supporting more and more scholars to share and discover preprints, the team behind it has been through significant changes in leadership and is dealing with the urgent need to modernize their 30-year-old technology. As a former Executive Director of arXiv noted, “[arXiv’s success] may not last forever”. Similarly, the recent news that Chan Zuckerberg Initiative has renewed its financial support for the leading preprint servers in biology and medicine, bioRxiv and medRxiv is welcome relief, but this support is temporary, and the team must find a way to continue in the long run.

bit.ly/3y745Ji

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"The Importance of Copyright and Shared Norms for Credit in Open Educational Resources"


Open Educational Resources (OER) are reducing barriers to education while allowing creators the opportunity to share their work with the world and continue owning copyright of their work. To support new authors and adaptors in the OER space, we provide an overview of common considerations that creators and adaptors of OER should make with respect to issues related to copyright in the context of OER. Further, and importantly, a challenge in the OER space is ensuring that original creators receive appropriate credit for their work, while also respecting the credit of those who have adapted work. Thus, in addition to providing important considerations when it comes to the creation of open access works, we propose shared norms for ensuring appropriate attribution and credit for creators and adaptors of OER.

https://doi.org/10.3389/feduc.2022.1069388

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"China and Open Access"


In December 2022, the International Association of STM Publishers and the China Association for Science and Technology (CAST) released a report: Open Access Publishing in China. The report is openly available in both English and Chinese. This interview with Mark Robertson, consultant to the STM Association on the project, highlights the findings of the report and their implications for the scholarly publishing industry as well as providing background on the STM/CAST collaboration.

bit.ly/3kHUuW5

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "An Investigation of Gold Open Access Publications of STEM Faculty at a Public University in the United States"


This study investigated Gold Open Access journal publication by science and engineering faculty at the authors’ university from 2013 to 2022. Specifically, did Gold Open Access (OA) by these faculty increase, and did the publication rate vary between disciplines? The authors found that Gold OA publication increased by 176% over the past 10 years, and that an important factor was the Libraries’ creation of an Open Access Publishing Fund in 2017.

https://doi.org/10.1080/0194262X.2023.2175103

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"The Future of the Monograph in the Arts, Humanities and Social Sciences: Publisher Perspectives on a Transitioning Format"


A web-based survey of academic publishers was undertaken in 2021 by a team at Oxford International Centre for Publishing into the state of monograph publication in the arts, humanities, and social sciences. 25 publishing organisations responded, including many of the larger presses, representing approximately 75% of monograph output. Responses to the survey showed that the Covid 19 pandemic has accelerated the existing trend from print to digital dissemination and that Open Access (OA) titles receive substantially greater levels of usage than those published traditionally. Responses also showed that for most publishers OA publication stands at under 25% of output and that fewer than 10% of authors enquire about OA publication options. Continuing problem areas highlighted by respondents were the clearing of rights for OA publication and the standardisation of title and usage metadata. All responding organisations confirmed that they expect to be publishing monographs in ten years’ time, but that they anticipate the format and/or the model will be different, with open access expected to play a key part in the future, perhaps in the context of a mixed economy of OA and ‘toll access’ publication.

https://doi.org/10.1007/s12109-023-09937-1

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Research Data Management Needs Assessment for Social Sciences Graduate Students: A Mixed Methods Study"


The complexity and privacy issues inherent in social science research data makes research data management (RDM) an essential skill for future researchers. Data management training has not fully addressed the needs of graduate students in the social sciences. To address this gap, this study used a mixed methods design to investigate the RDM awareness, preparation, confidence, and challenges of social science graduate students. A survey measuring RDM preparedness and training needs was completed by 98 graduate students in a school of education at a research university in the southern United States. Then, interviews exploring data awareness, knowledge of RDM, and challenges related to RDM were conducted with 10 randomly selected graduate students. All participants had low confidence in using RDM, but United States citizens had higher confidence than international graduate students. Most participants were not aware of on-campus RDM services, and were not familiar with data repositories or data sharing. Training needs identified for social science graduate students included support with data documentation and organization when collaborating, using naming procedures to track versions, data analysis using open access software, and data preservation and security. These findings are significant in highlighting the topics to cover in RDM training for social science graduate students. Additionally, RDM confidence and preparation differ between populations so being aware of the backgrounds of students taking the training will be essential for designing student-centered instruction.

https://doi.org/10.1371/journal.pone.0282152

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Changes in the Absolute Numbers and Proportions of Open Access Articles from 2000 to 2021 Based on the Web of Science Core Collection: A Bibliometric Study"


Purpose:

The ultimate goal of current open access (OA) initiatives is for library services to use OA resources. This study aimed to assess the infrastructure for OA scholarly information services by tabulating the number and proportion of OA articles in a literature database.

Method:

We measured the absolute numbers and proportions of OA articles at different time points across various disciplines based on the Web of Science (WoS) database.

Results:

The number (proportion) of available OA articles between 2000 and 2021 in the WoS database was 12 million (32.4%). The number (proportion) of indexed OA articles in 1 year was 0.15 million (14.6%) in 2000 and 1.5 million (48.0%) in 2021. The proportion of OA by subject categories in the cumulative data was the highest in the multidisciplinary category (2000–2021, 79%; 2021, 89%), high in natural sciences (2000–2021, 21%–46%; 2021, 41%–62%) and health and medicine (2000–2021, 37%–40%; 2021, 52%–60%), and low in social sciences and others (2000–2021, 23%–32%; 2021, 36%–44%), engineering (2000–2021, 17%–33%; 2021, 31%–39%) and humanities and arts (2000–2021, 11%–22%; 2021, 28%–38%).

Conclusion:

Our study confirmed that increasingly many OA research papers have been published in the last 20 years, and the recent data show considerable promise for better services in the future. The proportions of OA articles differed among scholarly disciplines, and designing library services necessitates several considerations with regard to the customers’ demands, available OA resources, and strategic approaches to encourage the use of scholarly OA articles.

https://doi.org/10.6087/kcse.296

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Librarians and Academic Libraries’ Role in Promoting Open Access: What Needs to Change? "


Profound changes due to Open-Access (OA) publications lead to organizational changes in universities and libraries. This study examined Israeli librarians’ perceptions regarding their role and the academic library’s role in promoting OA-publication, including the barriers, challenges, needs and requirements necessary to promote OA publishing. Lack of a budget for OA-agreements and cooperation with university management, and researchers’ unawareness of OA were among the most prominent barriers. Librarians see great importance in their role of advising researchers regarding OA. However, they insisted on a regulated OA-policy at the national and institutional levels, which would strengthen their status as change-leaders of the OA-movement.

https://doi.org/10.31235/osf.io/shqnv

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Benefits of Open Access (OA) to Researchers from Lower-Income Countries: Tracing Evidence through an Analysis of Reference Patterns"


Making scientific literature freely available to everyone is a main objective of the open access (OA) movement. This may be of particular importance to researchers in lower-income countries, where access to literature is often hindered by high subscription costs. This study addresses this issue by analyzing reference lists of the world’s output of scientific publications over time. The core issues addressed include whether researchers from lower-income countries refer to fewer previous publications when they publish and how this pattern develops over time. Moreover, whether researchers from lower-income countries rely more on literature that is openly available through different OA routes than other researchers is explored. The study shows that the proportion of OA references increases over time for all publications and country groups. However, the main finding is that publications from lower-income countries have a higher growth rate of OA references. This suggests that an increase in OA publishing has been particularly beneficial to researchers in lower-income countries.

https://doi.org/10.31235/osf.io/ecgzh

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"How and Why Do Researchers Reference Data? A Study of Rhetorical Features and Functions of Data References in Academic Articles"


Data reuse is a common practice in the social sciences. While published data play an essential role in the production of social science research, they are not consistently cited, which makes it difficult to assess their full scholarly impact and give credit to the original data producers. Furthermore, it can be challenging to understand researchers’ motivations for referencing data. Like references to academic literature, data references perform various rhetorical functions, such as paying homage, signaling disagreement, or drawing comparisons. This paper studies how and why researchers reference social science data in their academic writing. We develop a typology to model relationships between the entities that anchor data references, along with their features (access, actions, locations, styles, types) and functions (critique, describe, illustrate, interact, legitimize). We illustrate the use of the typology by coding multidisciplinary research articles (n=30) referencing social science data archived at the Inter-university Consortium for Political and Social Research (ICPSR). We show how our typology captures researchers’ interactions with data and purposes for referencing data. Our typology provides a systematic way to document and analyze researchers’ narratives about data use, extending our ability to give credit to data that support research.

https://arxiv.org/abs/2302.08477

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Library Futures Releases Policy Paper: Digital Ownership for Libraries and the Public"


In response, Library Futures recommends policymakers adopt an approach of digital ownership that extends the current paradigm for print works and allow libraries to both maintain the benefits of print collections and innovate even further toward providing new methods of access, preservation, and education by creating new lending models, equitizing access for underserved communities, and contributing to a more democratic balance. To that end, we have outlined some approaches to solving this issue through structural, community-based, and technical means:

  • Legal reform: This can include judicial remedies through the courts, legislative action on the part of Congress, or regulatory intervention by an authority such as the Federal Trade Commission.
  • Collective action: Community intervention can be a powerful way to act concertedly to stand against entities that are prohibiting libraries from exercising their rights, such as boycotts and grassroots action, state legislative initiatives, and the collective use of incentives and accountability measures for publishers.
  • Library-owned infrastructure: The library community can build its own infrastructure to ensure that it is oriented towards the needs of their users and provides libraries with the choice to own their digital content. This is not without its challenges (practical and resource-wise), but sustainable infrastructure can put control of digital content back into the hands of libraries and users.

Policy Paper

https://www.libraryfutures.net/post/digital-ownership-for-libraries-and-the-public

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

Penn State: "University Libraries Expands Open Access Support via 3 New BTAA [Big 10] Agreements"


The agreements with Wiley, Institute of Physics (IOP) and Microbiology Society cover OA publishing charges for Penn State corresponding authors publishing in these publishers’ journals. Those qualified articles will be immediately open access on the publisher’s platform. These publishers will offer a choice of open access licenses to Penn State authors publishing in their journals. Authors retain copyright in their articles.

The agreements run for three years from Jan. 1, 2023, to Dec. 31, 2025. In general, articles will need to be accepted during the agreements’ timeframe. The agreements also cover subscriptions and read access to Wiley, Institute of Physics (IOP) and Microbiology Society journals. Unlimited open access publishing is included with no additional cost to individual Penn State authors.

 

bit.ly/3Sjx9Xa

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

2.6 Billion Total Downloads: arXiv Annual Report 2022


Our critical priorities during 2022 were to secure additional funding, hire technical and program directors, and ramp up our efforts to modernize arXiv’s software by moving it to the cloud, which will provide better stability, scalability and maintainability. I’m pleased to report that we were able to make significant progress on all of these fronts. arXiv brought in more funding than expected in the form of grants, memberships, and donations, and we hired Stephanie Orphan as program director and Charles Frankston as technical director. Both bring strong and complementary expertise to the team. Moving the technical operations of arXiv—a service with a 30 year history—off of Cornell’s on-premises servers is a major, complicated task. The move to the cloud is currently in progress and on track

bit.ly/41exRsX

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Data Management Librarians Role in a Large Interdisciplinary Scientific Grant for PFAS Remediation: Considerations and Recommendations"


This article explores the conflicts, disparities, and inequalities experienced by two librarians when collaborating on a federal grant proposal. The authors discuss concerns related to time and salary expectations and the inequities that can occur during faculty and staff collaborations on research grants. The bureaucratic structure and the job classifications of staff at academic institutions in addition to the contract limitations of non-faculty status librarian positions can hinder successful collaborations. The authors also describe data management needs that may occur when working with interdisciplinary research teams and detail the type of work that is included in writing a data management grant. This article concludes with considerations and recommendations for other data librarians who may undertake similar projects with a focus on ways to create parity between faculty and staff collaborators.

https://doi.org/10.7191/jeslib.616

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"There’s No “I” in Research Data Management: Reshaping RDM Services Toward a Collaborative Multi-Stakeholder Model"


Objective: This article examines a reshaped service model for research data management (RDM) founded on centralized and cohesive collaboration between multiple stakeholders at a large research university in Canada. This initiative, along with a newly formed team dedicated to RDM service provision, is a joint effort by the institution’s Vice-Principal Research and Innovation (VPRI), Library, IT Services, and Research Ethics units.

Methods: This article presents a single case study methodology. The authors reflect on services such as "query the panel" sessions where researchers across all disciplines bring their questions to representatives from the Library, IT, Research Ethics, and VPRI. This case study also highlights the use of Jira’s service desk software as a user management system. The authors also present descriptive statistics representing engagement with this new unit and our services.

Results: Support for RDM requires expertise from multiple domains. With a collaborative approach as a guiding principle and a focus on establishing a small, but agile team comprised of a librarian along with stakeholders from IT and VPRI, it is possible to leverage resources and support for RDM from a broad range of units across an institution.

Conclusions: At many institutions, RDM services are siloed within the library or an adjacent campus unit. New digital technologies have profoundly transformed academic research across all disciplines, necessitating the evolution of corresponding research data-related services. The authors will conclude by outlining specific lessons learned in reshaping digital research infrastructure-related services at their institution.

https://doi.org/10.7191/jeslib.624

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Are Institutional Research Data Policies in the US Supporting the FAIR Principles? A Content Analysis"


Objective: The FAIR principles were created with the goal of enhancing the reusability of research data and to give guidance on how to make data Findable, Accessible, Interoperable and Reusable. In this article we explore the role of institutional research data policies in enabling and encouraging researchers at their institutions to generate FAIR data.

Methods: We identified the research data policies in place for “very high research activity” institutions (as defined by Carnegie classification) in the United States. We created a list of 31 criteria, based on previous work by Davidson et al. (2019) and Briney et al. (2015), and evaluated the 40 policies using a content analysis methodology.

Results: The guiding principles and the definitions for research data in the policies support the idea that institutional policies are a potential tool for the implementation of the FAIR principles. However, our analysis indicates that they are not generally used for that purpose. Only one policy mentions FAIR. Data sharing is mentioned in half of the policies, but 11 of these only note this concept in the context of funder requirements. Access and retention sections are mostly written without considering publicly available data. Twenty-nine policies do not mention data documentation.

Conclusions: We discuss ways in which these institutional policies represent a missed opportunity to implement the FAIR principles and suggest ways policies could be modified to encourage researchers to follow them. We also discuss future research opportunities to examine how policy implementation may affect what institutional support researchers receive.

https://doi.org/10.7191/jeslib.614

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"How Open Access Diamond Journals Comply with Industry Standards Exemplified by Plan S Technical Requirements"


Purpose:

This study investigated how well current open access (OA) diamond journals in the Directory of Open Access Journals (DOAJ) and a survey conform to Plan S requirements, including licenses, peer review, author copyright, unique article identifiers, digital archiving, and machine-readable licenses.

Method:

Data obtained from DOAJ journals and surveyed journals from mid-June to mid-July 2020 were analyzed for a variety of Plan S requirements. The results were presented using descriptive statistics.

Results:

Out of 1,465 journals that answered, 1,137 (77.0%) reported compliance with the Committee on Publication Ethics (COPE) principles. The peer review types used by OA diamond journals were double-blind (6,339), blind (2,070), peer review (not otherwise specified, 1,879), open peer review (42), and editorial review (118) out of 10,449 DOAJ journals. An author copyright retention policy was adopted by 5,090 out of 10,448 OA diamond journals (48.7%) in DOAJ. Of the unique article identifiers, 5,702 (54.6%) were digital object identifiers, 58 (0.6%) were handles, and 14 (0.1%) were uniform resource names, while 4,675 (44.7%) used none. Out of 1,619 surveyed journals, the archiving solutions were national libraries (n=170, 10.5%), Portico (n=67, 4.1%), PubMed Central (n=15, 0.9%), PKP PN (n=91, 5.6%), LOCKSS (n=136, 8.4%), CLOCKSS (n=87, 5.4%), the National Computing Center for Higher Education (n=6, 0.3%), others (n=69, 4.3%), no policy (n=855, 52.8%), and no reply (n=123, 7.6%). Article-level metadata deposition was done by 8,145 out of 10,449 OA diamond journals (78.0%) in DOAJ.

Conclusion:

OA diamond journals’ compliance with industry standards exemplified by the Plan S technical requirements was insufficient, except for the peer review type.

https://doi.org/10.6087/kcse.295

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |