Advancing Ireland’s Open Repository Landscape: A Strategic Roadmap


This report presents an in-depth analysis and strategic roadmap for advancing the open repository landscape in Ireland. Drawing on comprehensive data from interviews, surveys, self-assessments, and both national and international initiatives, the document outlines the current status, challenges, and future prospects for open repositories in Ireland. Key findings highlight significant advancements in open access adherence. Despite these successes, persistent challenges such as metadata quality, resource limitations, and sustainability issues underscore the need for concerted effort and strategic planning.

The report proposes a forward-looking roadmap spanning from 2025 to 2030 and beyond, prioritising the enhancement of repository infrastructures, metadata quality improvement, open mandates promotion, technological advancements, capacity building, and fostering collaborative partnerships. This strategic vision aims to develop and encourage Ireland’s transition to open research, leveraging innovative practices and collaborative efforts to facilitate a more open, inclusive, and sustainable research environment. By addressing current limitations and embracing future opportunities, the roadmap sets the stage for a transformative shift in Ireland’s scholarly communication landscape, with potential significant impacts on researchers, institutions, and society at large.

https://doi.org/10.5281/zenodo.10810233

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Platformisation of Science: Conceptual Foundations"


The digital platforms we are dealing with in this article are auxiliary tools that do not produce anything themselves but provide an infrastructure for service providers and users to meet. They have potentially unlimited scaling potential and have become the central places of exchange. In academia, we can also observe that research and its communication become more digital and that digital services are aiming to become platforms. In this article we explore the concept of digital platforms and their potential impact on academic research, firstly addressing the question: To what extent can digital platforms be understood as a specific type of research infrastructure? We draw from recent literature on platforms and platformisation from different streams of scholarship and relate them to the science studies concept of research infrastructures, to eventually arrive at a framework for science platforms. Secondly, we aim to assess how science platforms may affect scholarly practice. Thirdly, we aim to assess to what extent science is platformised and how this interferes with scientific understandings of quality and autonomy. At the end of this article, we argue that the potential benefits of platform infrastructure for academic pursuits cannot be ignored, but the commercialization of the infrastructure for scholarly communication is a cause for concern. Ultimately, a nuanced and well-informed perspective on the impact of platformisation on academia is necessary to ensure that the academic community can maximize the benefits of digital infrastructures while mitigating negative consequences.

http://dx.doi.org/10.53377/lq.16693

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Enhancing the FAIRness of Arctic Research Data Through Semantic Annotation"


The National Science Foundation’s Arctic Data Center is the primary data repository for NSF-funded research conducted in the Arctic. There are major challenges in discovering and interpreting resources in a repository containing data as heterogeneous and interdisciplinary as those in the Arctic Data Center. This paper reports on advances in cyberinfrastructure at the Arctic Data Center that help address these issues by leveraging semantic technologies that enhance the repository’s adherence to the FAIR data principles and improve the Findability, Accessibility, Interoperability, and Reusability of digital resources in the repository. We describe the Arctic Data Center’s improvements. We use semantic annotation to bind metadata about Arctic data sets with concepts in web-accessible ontologies. The Arctic Data Center’s implementation of a semantic annotation mechanism is accompanied by the development of an extended search interface that increases the findability of data by allowing users to search for specific, broader, and narrower meanings of measurement descriptions, as well as through their potential synonyms. Based on research carried out by the DataONE project, we evaluated the potential impact of this approach, regarding the accessibility, interoperability, and reusability of measurement data. Arctic research often benefits from having additional data, typically from multiple, heterogeneous sources, that complement and extend the bases – spatially, temporally, or thematically – for understanding Arctic phenomena. These relevant data resources must be ‘found’, and ‘harmonized’ prior to integration and analysis. The findings of a case study indicated that the semantic annotation of measurement data enhances the capabilities of researchers to accomplish these tasks.

https://doi.org/10.5334/dsj-2024-002

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"HERITRACE: Tracing Evolution and Bridging Data for Streamlined Curatorial Work in the GLAM Domain"


HERITRACE is a semantic data management system tailored for the GLAM sector. It is engineered to streamline data curation for non-technical users while also offering an efficient administrative interface for technical staff. The paper compares HERITRACE with other established platforms such as OmekaS, Semantic MediaWiki, Research Space, and CLEF, emphasizing its advantages in user friendliness, provenance management, change tracking, customization capabilities, and data integration. The system leverages SHACL for data modeling and employs the OpenCitations Data Model (OCDM) for provenance and change tracking, ensuring a harmonious blend of advanced technical features and user accessibility. Future developments include the integration of a robust authentication system and the expansion of data compatibility via the RDF Mapping Language (RML), enhancing HERITRACE’s utility in digital heritage management.

https://arxiv.org/abs/2402.00477

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Do Disappearing Data Repositories Pose a Threat to Open Science and the Scholarly Record? "


Only little more than half of the research data repositories in the sample have detailed strategies they use to mitigate data loss. It is important to note that none of the strategies analysed offers a permanent solution; instead, infrastructure maintenance requires continuous efforts. The burden of infrastructure maintenance and data preservation is currently placed on individual repositories alone; preservation systems comparable to those for scholarly texts, such as CLOCKSS, are not widely spread and can be difficult to realise.

http://tinyurl.com/3snrhxpk

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Decades of Transformation: Evolution of the NASA Astrophysics Data System’s Infrastructure"


The NASA Astrophysics Data System (ADS) is the primary Digital Library portal for researchers in astronomy and astrophysics. Over the past 30 years, the ADS has gone from being an astronomy-focused bibliographic database to an open digital library system supporting research in space and (soon) earth sciences. This paper describes the evolution of the ADS system, its capabilities, and the technological infrastructure underpinning it.

https://arxiv.org/abs/2401.09685

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Current State and Future Directions for Open Repositories in Europe


In January 2023, OpenAIRE, LIBER, SPARC Europe, and COAR launched a joint strategy aimed at strengthening the European repository network. As a first step, a survey of the European repository landscape was undertaken in February-March 2023. The survey found that, collectively, European repositories acquire, preserve and provide open access to tens or possibly hundreds of millions of valuable research outputs and represent critical, not-for-profit infrastructure in the European open science landscape. They are used for sharing articles that may be pay-walled in published journals, but also for providing access to a large variety of other types of research outputs including research data, theses/dissertations, conference papers, preprints, code, and so on.

However, in order to ensure the European repository network is fit for purpose and able to support the evolving needs of the research community, the survey also identified three areas in particular that could be strengthened: maintaining up-to-date, highly functioning software platforms; applying consistent and comprehensive good practices in terms of metadata, preservation, and usage statistics; and gaining appropriate visibility in the scholarly ecosystem.

Despite the challenges, the current climate offers exciting opportunities for repositories. Many funders are actively promoting the repository route for articles because of their role in supporting equitable access to content (i.e. no fees to access or deposit). The value proposition for open science is growing and repositories are increasingly recognised as the main mechanism for collecting and providing access to a wide range of other research outputs. Add to this, the nascent, but growing, interest in the publish-review-curate model in which repositories have a central function, and it seems they are well placed to expand their current role in the ecosystem.

https://doi.org/10.5281/zenodo.10255559

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Jupyter Notebooks and Institutional Repositories: A Landscape Analysis of Realities, Opportunities and Paths Forward"


Jupyter Notebooks are important outputs of modern scholarship, though the longevity of these resources within the broader scholarly record is still unclear. Communities and their creators have yet to holistically understand creation, access, sharing and preservation of computational notebooks, and such notebooks have yet to be designated a proper place among institutional repositories or other preservation environments as first class scholarly digital assets. Before this can happen, repository managers and curators need to have the appropriate tools, schemas and best practices to maximize the benefit of notebooks within their repository landscape and environments.

This paper explores the landscape of Jupyter notebooks today, and focuses on the opportunities and challenges related to bringing Jupyter Notebooks into institutional repositories. We explore the extent to which Jupyter Notebooks are currently accessioned into institutional repositories, and how metadata schemas like CodeMeta might facilitate their adoption. We also discuss characteristics of Jupyter Notebooks created by researchers at the National Center for Atmospheric Research, to provide additional insight into how to assess and accession Jupyter Notebooks and related resources into an institutional repository.

https://journal.code4lib.org/articles/17751

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Islandora for Archival Access and Discovery"


This article is a case study describing the implementation of Islandora 2 to create a public online portal for the discovery, access, and use of archives and special collections materials at the University of Nevada, Las Vegas. The authors will explain how the goal of providing users with a unified point of access across diverse data (including finding aids, digital objects, and agents) led to the selection of Islandora 2 and they will discuss the benefits and challenges of using this open source software. They will describe the various steps of implementation, including custom development, migration from CONTENTdm, integration with ArchivesSpace, and developing new skills and workflows to use Islandora most effectively. As hindsight always provides additional perspective, the case study will also offer reflection on lessons learned since the launch, insights on open-source repository sustainability, and priorities for future development.

https://journal.code4lib.org/articles/17929

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"ResearchGate and American Association for the Advancement of Science (AAAS) Announce New Journal Home Partnership for Science Partner Journals"


AAAS, a leading publisher of cutting-edge research renowned for its Science family of journals, launched its Science Partner Journal (SPJ) program in 2017. Consisting of 14 high-quality, fully open access journals produced in collaboration with international research institutions, foundations, funders, and societies, the SPJ program will now expand its reach through Journal Home on ResearchGate. . . .

ResearchGate will create dedicated journal profiles on the platform that will be prominently featured on all associated articles and touchpoints on ResearchGate, significantly boosting the visibility of these titles with highly relevant authors and readers.

Authors of articles in the SPJs will enjoy the added benefit of having their content automatically added to their profiles on ResearchGate.

https://tinyurl.com/53ehxhzu

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Open Access Movement in the Scholarly World: Pathways for Libraries in Developing Countries"


Open access is a scholarly publishing model that has emerged as an alternative to traditional subscription-based journal publishing. This study explores the adoption of the open access movement worldwide and the role that libraries can play in addressing those factors which are slowing its progress within developing countries. The study has drawn upon both qualitative data from a focused literature review and quantitative data from major open access platforms. The results indicate that while the open access movement is steadily gaining acceptance worldwide, the progress in developing countries within geographical areas such as Africa, Asia and Oceania is quite a bit slower. Two significant factors are the cost of publishing fees and the lack of institutional open access mandates and policies to encourage uptake. The study provides suggested strategies for academic libraries to help overcome current challenges.

https://doi.org/10.1177/01655515231202758

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"US Repository Network Launches Pilot to Enhance Discoverability of Open Access Content in Repositories"


In November, the US Repository Network (USRN) will launch a pilot project aimed at improving the discoverability of articles in repositories. This pilot project involves the use of services from CORE, a not-for-profit aggregator based at Open University in the UK, to evaluate and improve local repository practices. Additional technical support will be provided by Antleaf Ltd.

As part of the project, CORE will aggregate the metadata and full text of articles from a subset of US repositories, allowing them to be findable through a centralized discovery service with prominent links back to the original full text of the repository. At the same time, the project will assess current practices related to metadata quality, the tracking of Open Access deposits, the use of PIDs, technical support for OAI-PMH, and the adoption of more recent protocols, such as FAIR Signposting. At the level of the centralized aggregation, CORE will enrich the existing US metadata with information from its larger international aggregation. A Dashboard service for participating institutions will be provided, enabling them to assess, validate and monitor their practices.

https://tinyurl.com/2utfpvj3

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Repository Staff Perspectives on the Benefits Of Trustworthy Digital Repository Certification"


This paper reports on the results from a qualitative study that asks whether and how staff members from TRAC certified repositories find value in the audit and certification process. While some interviewees found certification valuable, others argued that the costs outweighed the benefits or expressed ambivalence towards certification. Findings indicate that TRAC certification offered both internal and external benefits, such as improved documentation, accountability, transparency, communication, and standards, but there were concerns about high costs, implementation problems, and lack of objective evaluation criteria.

https://tinyurl.com/bddmuwjy

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"FAIR EVA: Bringing Institutional Multidisciplinary Repositories into the FAIR Picture"


The FAIR Principles are a set of good practices to improve the reproducibility and quality of data in an Open Science context. Different sets of indicators have been proposed to evaluate the FAIRness of digital objects, including datasets that are usually stored in repositories or data portals. However, indicators like those proposed by the Research Data Alliance are provided from a high-level perspective that can be interpreted and they are not always realistic to particular environments like multidisciplinary repositories. This paper describes FAIR EVA, a new tool developed within the European Open Science Cloud context that is oriented to particular data management systems like open repositories, which can be customized to a specific case in a scalable and automatic environment. It aims to be adaptive enough to work for different environments, repository software and disciplines, taking into account the flexibility of the FAIR Principles. As an example, we present DIGITAL.CSIC repository as the first target of the tool, gathering the particular needs of a multidisciplinary institution as well as its institutional repository.

https://doi.org/10.1038/s41597-023-02652-8

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Where Is All the Research Software? An Analysis of Software in UK Academic Repositories"


This research examines the prevalence of research software as independent records of output within UK academic institutional repositories (IRs). There has been a steep decline in numbers of research software submissions to the UK’s Research Excellence Framework from 2008 to 2021, but there has been no investigation into whether and how the official academic IRs have affected the low return rates. In what we believe to be the first such census of its kind, we queried the 182 online repositories of 157 UK universities. Our findings show that the prevalence of software within UK Academic IRs is incredibly low. Fewer than 28% contain software as recognised academic output. Of greater concern, we found that over 63% of repositories do not currently record software as a type of research output and that several Universities appeared to have removed software as a defined type from default settings of their repository. We also explored potential correlations, such as being a member of the Russell group, but found no correlation between these metadata and prevalence of records of software. Finally, we discuss the implications of these findings with regards to the lack of recognition of software as a discrete research output in institutions, despite the opposite being mandated by funders, and we make recommendations for changes in policies and operating procedures.

https://doi.org/10.7717/peerj-cs.1546

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Zenodo Mandates Document Submission to a Community and Peer Review


What’s more, content submitted to Zenodo would be published automatically within the repository before and whether or not it was accepted into a community. Now, when a researcher goes to publish their outputs, they must select their community and submit their work for peer review, before it is made public. Community curators will then review the content to see if it fits within the community even have the capability to improve and correct the metadata to ensure that it meets quality standards. Once the metadata is approved, it will then be published in Zenodo and, consequently, integrated into the OpenAIRE Graph.

https://tinyurl.com/5y79y2nb

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Implementation of a Federated Information System by Means of Reuse of Research Data Archived in Research Data Repositories"


At universities, research data is increasingly stored in research data repositories according to a data management plan (DMP) and thus made available for further use. The challenge of reusing hundreds, thousands, or millions of data sets is to obtain an overview of the data in a short period of time and to search through all the data. The high variability of the formats used to store research data requires a new approach to data reusability that focuses on the visualisation and searchability of archived research data, which can also be combined with each other. In this article, we present a practical DMP that describes how information systems can be created on demand by reusing research data archived in research data repositories and how these systems can be merged into a federated information system. As a result, in our projects, information systems have been created in minutes or a couple of hours with few resources. The initial effort to create a federated system remains; however, this allows federated searches to be performed. Extending a federated system to include other information systems can then be accomplished by making a few configurations and manageable adjustments to the source code.

https://doi.org/10.5334/dsj-2023-039

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Disappearing Repositories — Taking an Infrastructure Perspective on the Long-Term Availability of Research Data"


Currently, there is limited research investigating the phenomenon of research data repositories being shut down, and the impact this has on the long-term availability of data. This paper takes an infrastructure perspective on the preservation of research data by using a registry to identify 191 research data repositories that have been closed and presenting information on the shutdown process. The results show that 6.2 % of research data repositories indexed in the registry were shut down. The risks resulting in repository shutdown are varied. The median age of a repository when shutting down is 12 years. Strategies to prevent data loss at the infrastructure level are pursued to varying extent. 44 % of the repositories in the sample migrated data to another repository, and 12 % maintain limited access to their data collection. However, both strategies are not permanent solutions. Finally, the general lack of information on repository shutdown events as well as the effect on the findability of data and the permanence of the scholarly record are discussed.

https://arxiv.org/abs/2310.06712

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"DSpace 7 Benefits: Is It Worth Upgrading?"


With the release of DSpace version 7, a natural question that arises is whether the new version offers enough new functionalities to motivate system administrators to upgrade. This paper briefly describes the most important changes, including new features and bug fixes, included in DSpace 7.4 and prior minor versions. The next parts of this paper explore our estimate that there are several thousand DSpace-based systems globally that will likely have to be upgraded in the near future. The main reason for this need is that older versions of DSpace (including 5.x) have reached the end of their developer support period or are reaching it in mid-2023. Based on our own upgrade experience, we propose suggestions and recommendations on migrating from the previous DSpace 6.3-based environment to the new one in a case study that concludes this article.

https://tinyurl.com/32t7ac9m

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "Proactive Institutional Repository Collection Development Techniques: Archiving Gold Open Access Articles and Metadata Retrieved with Web Scraping"


This article describes a method for copying open access articles and corresponding descriptive metadata from open repositories for archiving in an institutional repository using Beautiful Soup and Selenium as web scraping tools. This method quickly added hundreds of articles to an IR without relying on faculty participation or consulting publisher policies, increasing repository downloads and usage.

https://doi.org/10.1080/01930826.2023.2240190

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Publishers, Internet Archive Agree to Streamline Digital Book-Lending Case"


The proposed order would require the Archive to pay Lagardere SCA’s (LAGA.PA) Hachette Book Group, News Corp’s (NWSA.O) HarperCollins Publishers, John Wiley & Sons (WLY.N) and Bertelsmann SE & Co’s (BTGGg.F) Penguin Random House an undisclosed amount of money if it loses its appeal.

The order would also permanently block the Archive from lending out copies of the publishers’ books without permission, pending the result of the appeal.

https://tinyurl.com/yc5j2vb8

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Record Labels Hit Internet Archive with New $400m+ Copyright Lawsuit"


Record labels including UMG, Capitol and Sony have filed a copyright infringement lawsuit in the United States targeting Internet Archive and founder Brewster Kale, among others. Filed in Manhattan federal court late Friday, the complaint alleges infringement of 2,749 works, recorded by deceased artists, including Frank Sinatra, Billie Holiday, Louis Armstrong and Bing Crosby.

https://tinyurl.com/43b4c3w6

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"eLife and PREreview to Enhance the ‘Publish, Review, Curate’ Ecosystem Through Adoption of COAR Notify"


The project will put in place the basic infrastructure and protocols needed for all-round and standardised connections between preprint repositories, community-led preprint review platforms, journals, and preprint review aggregation and curation platforms. The aim is to lower existing technological and cost barriers so that as many of these services as possible can more easily participate in the ‘publish, review, curate’ future for research.

https://tinyurl.com/36emyk9b

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"The Status of Open Access Repositories in the Field of Technology: Insights from OpenDOAR"


The study found that 125 nations contributed a total of 4,045 repositories in the field of research, with the USA leading the list with the most repositories. Maximum repositories were operated by institutions having multidisciplinary approaches. The DSpace and Eprints were the preferred software types for repositories. The preferred upload content by contributors was "research articles" and "electronic thesis and dissertations."

https://doi.org/10.1108/IDD-11-2022-0119

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |