"Recording Available: ‘Securing Community-Controlled Infrastructure: SPARC’s Plan of Action’"

DuraSpace has released "Recording Available: 'Securing Community-Controlled Infrastructure: SPARC's Plan of Action'."

Here's an excerpt:

In this webinar, Heather shared SPARC's efforts on their community-controlled infrastructure project that further explores this question. Heather highlighted what lead to SPARC conducting a market analysis, which included both a financial analysis and an analysis of strategies of some of the key commercial players in the infrastructure arena, and the implications of those strategies for our community. . . .

The recordings and presentation slides of both webinars are available at https://duraspace.org/webinar/.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Supporting FAIR Data Principles with Fedora"

David Wilcox has published "Supporting FAIR Data Principles with Fedora" in LIBER Quarterly.

Here's an excerpt:

Making data findable, accessible, interoperable, and re-usable is an important but challenging goal. From an infrastructure perspective, repository technologies play a key role in supporting FAIR data principles. Fedora is a flexible, extensible, open source repository platform for managing, preserving, and providing access to digital content. Fedora is used in a wide variety of institutions including libraries, museums, archives, and government organizations. Fedora provides native linked data capabilities and a modular architecture based on well-documented APIs and ease of integration with existing applications. As both a project and a community, Fedora has been increasingly focused on research data management, making it well-suited to supporting FAIR data principles as a repository platform. Fedora provides strong support for persistent identifiers, both by minting HTTP URIs for each resource and by allowing any number of additional identifiers to be associated with resources as RDF properties. Fedora also supports rich metadata in any schema that can be indexed and disseminated using a variety of protocols and services. As a linked data server, Fedora allows resources to be semantically linked both within the repository and on the broader web. Along with these and other features supporting research data management, the Fedora community has been actively participating in related initiatives, most notably the Research Data Alliance. Fedora representatives participate in a number of interest and working groups focused on requirements and interoperability for research data repository platforms. This participation allows the Fedora project to both influence and be influenced by an international group of Research Data Alliance stakeholders. This paper will describe how Fedora supports FAIR data principles, both in terms of relevant features and community participation in related initiatives.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"OpenAIRE and DuraSpace Partner to Support Greater Functionality in the Global Repository Network"

DuraSpace has released "OpenAIRE and DuraSpace Partner to Support Greater Functionality in the Global Repository Network."

Here's an excerpt:

Repositories collectively act as the foundation for Open Science by collecting and providing access to research outputs, and play a key role in the emerging scholarly commons. To that end, OpenAIRE and DuraSpace aim to ensure that repositories are using up-to-date technologies and adopting international standards and protocols. Through this MOU, OpenAIRE and DuraSpace have agreed to work together on a number of aspects to support their common goals. These activities include enabling DSpace systems to comply with OpenAIRE metadata guidelines, gradual adoption of next generation repository functionalities, and working together on standardized methods for measuring and aggregating usage statistics.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Facilitating and Improving Environmental Research Data Repository Interoperability"

Corinna Gries et al. have published "Facilitating and Improving Environmental Research Data Repository Interoperability" in Data Science Journal.

Here's an excerpt:

Environmental research data repositories provide much needed services for data preservation and data dissemination to diverse communities with domain specific or programmatic data needs and standards. Due to independent development these repositories serve their communities well, but were developed with different technologies, data models and using different ontologies. Hence, the effectiveness and efficiency of these services can be vastly improved if repositories work together adhering to a shared community platform that focuses on the implementation of agreed upon standards and best practices for curation and dissemination of data. Such a community platform drives forward the convergence of technologies and practices that will advance cross-domain interoperability. It will also facilitate contributions from investigators through standardized and streamlined workflows and provide increased visibility for the role of data managers and the curation services provided by data repositories, beyond preservation infrastructure. Ten specific suggestions for such standardizations are outlined without any suggestions for priority or technical implementation. Although the recommendations are for repositories to implement, they have been chosen specifically with the data provider/data curator and synthesis scientist in mind.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Towards Open Access Self Archiving Policies: A Case Study of COAR"

Bijan Kumar Roy et al. have published "Towards Open Access Self Archiving Policies: A Case Study of COAR" in LIBER Quarterly.

Here's an excerpt:

This paper examines Open Access (OA) self archiving policies of different Open Access Repositories (OARs) affiliated to COAR (Confederation of Open Access Repositories) as partner institutes. The process of scrutiny includes three major activities—selection of databases to consult; comparison and evaluation of Open Access policies of repositories listed in the selected databases and attached to COAR group; and critical examination of available self archiving policies of these OA repositories against a set of selected criteria. The above steps lead to reporting the following results: key findings have been identified and highlighted; common practices have been analyzed in relation to the focus of this paper; and a best practice benchmark has been suggested for popularizing and strengthening OARs as national research systems. This paper may help administrators, funding agencies, policy makers and professional librarians in devising institute-specific self archiving policies for their own organizations.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Search Results Ranking Using Machine-Learning Algorithms: "Best Match: New Relevance Search for PubMed"

Nicolas Fiorini et al. have published "Best Match: New Relevance Search for PubMed" in PLOS Biology.

Here's an excerpt:

PubMed is a free search engine for biomedical literature accessed by millions of users from around the world each day. With the rapid growth of biomedical literature—about two articles are added every minute on average—finding and retrieving the most relevant papers for a given query is increasingly challenging. We present Best Match, a new relevance search algorithm for PubMed that leverages the intelligence of our users and cutting-edge machine-learning technology as an alternative to the traditional date sort order. The Best Match algorithm is trained with past user searches with dozens of relevance-ranking signals (factors), the most important being the past usage of an article, publication date, relevance score, and type of article. This new algorithm demonstrates state-of-the-art retrieval performance in benchmarking experiments as well as an improved user experience in real-world testing (over 20% increase in user click-through rate). Since its deployment in June 2017, we have observed a significant increase (60%) in PubMed searches with relevance sort order: it now assists millions of PubMed searches each week. In this work, we hope to increase the awareness and transparency of this new relevance sort option for PubMed users, enabling them to retrieve information more effectively.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"IMLS Funds DuraSpace Fedora Investigation–Designing a Migration Path: Assessing Barriers of Upgrading to the Most Current Version of Fedora–No Collection Left Behind"

DuraSpace has released "IMLS Funds DuraSpace Fedora Investigation–Designing a Migration Path: Assessing Barriers of Upgrading to the Most Current Version of Fedora–No Collection Left Behind."

Here's an excerpt:

The Institute of Museum and Library Services has awarded DuraSpace a National Digital Platform Planning Grant for $49,279 to investigate barriers to upgrading hundreds of U.S.-based libraries and archives running unsupported versions of Fedora. In consultation with stakeholders this project will conduct an environmental scan of relevant community initiatives, and gather primary research data to inform recommendations to reduce barriers to upgrading to the most current version of Fedora.

There are approximately 240 U.S.-based libraries and archives identified as target beneficiaries of the deliverables of this project including universities, liberal arts colleges, and not-for-profit special libraries hosted by historical societies and small research institutes.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"HydraDAM2: Extending Fedora 4 and Hydra for Media Preservation"

Jon W. Dunn et al. have self-archived "HydraDAM2: Extending Fedora 4 and Hydra for Media Preservation."

Here's an excerpt:

The overarching goal of the HydraDAM2 project, funded by a grant from the National Endowment for the Humanities Preservation and Access Research and Development program, was to extend the existing HydraDAM digital asset management system, developed with prior NEH support, to be able to serve as a digital preservation repository for time-based media collections implementable at a wide range of institutions using multiple digital storage strategies. The new open source digital preservation repository system developed as part of the project by partners Indiana University (IU) and WGBH, known as Phydo, is based on the Fedora 4.x digital repository system and Samvera (formerly Hydra) repository application development framework and is intended to support storage and long-term preservation management of audio and video files and their accompanying metadata. This white paper describes the work of the HydraDAM2 project to develop the Phydo system, along with future plans.

Academic Library as Scholarly Publisher Bibliography | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Cornell University Library Repository Principles and Strategies Handbook

Erin Faulder et al. have self-archived the "Cornell University Library Repository Principles and Strategies Handbook."

Here's an excerpt:

The handbook provides support for both new and existing repository managers, comprising both recommended practices and specifically identified action steps that will allow them to track their progress and identify gaps. Each section of the handbook covers a different strategic area of repository management, standing largely on its own and linking to other sections when appropriate. Although there is no primary section order, we recommend starting with Defining Repository Scope and Service Planning. The handbook specifically addresses principles and practices pertaining to digital repositories, where a digital repository can be defined as: a system, the purpose of which is to store, present, and preserve a collection of data for which the library provides services. That is, the term refers specifically to the application as opposed to the content (collections, objects and metadata) within.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"More Than 1 Million Images Now Publicly Available at library.artstor.org!"

Artstor has released "More Than 1 Million Images Now Publicly Available at library.artstor.org!."

Here's an excerpt:

Artstor has made more than 1 million image, video, document, and audio files from public institutional collections freely available to everyone—subscribers and non-subscribers alike—at library.artstor.org. These collections are being shared by institutions who make their content available via JSTOR Forum, a tool that allows them to catalog, manage, and share digital media collections and make them discoverable to the widest possible audience.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Data-Level Metrics Now Available through Make Data Count"

DataONE has released "Data-Level Metrics Now Available through Make Data Count."

Here's an excerpt:

One year into our Sloan funded Make Data Count project, the Make Data Count Team comprising DataONE, California Digital Library and Data Cite are proud to release Version 1 of standardized data usage and citation metrics! . . .

Since the development of our COUNTER Code of Practice for Research Data we have implemented comparable, standardized data usage and citation metrics at Dash (CDL) and DataONE, two project team repositories. . . .

The Make Data Count project team works in an agile "minimum viable product" methodology. This first release has focused on developing a standard recommendation, processing our logs against that Code of Practice [COUNTER Code of Practice for Research Data] to develop comparable data usage metrics, and display of both usage and citation metrics at the repository level.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"The Types, Frequencies, and Findability of Disciplinary Grey Literature within Prominent Subject Databases and Academic Institutional Repositories"

Wanda R. Marsolek et al. have published "The Types, Frequencies, and Findability of Disciplinary Grey Literature within Prominent Subject Databases and Academic Institutional Repositories" in the Journal of Librarianship and Scholarly Communication.

Here's an excerpt:

INTRODUCTION In many disciplines grey literature, or works that are more ephemeral in nature and are not typically published through traditional scholarly channels, are heavily used alongside traditional materials and sources. We were interested in the type and frequency of grey literature in subject databases and in North American institutional repositories (IRs) as well as what disciplines use grey literature. METHODS Over 100 subject databases utilized by academic researchers and the IRs of over 100 academic institutions were studied. Document type, search capabilities, and level of curation were noted. RESULTS Grey literature was present in the majority (68%) of the literature databases and almost all IRs (95%) contained grey literature. DISCUSSION Grey literature was present in the subject databases across all broad disciplines including arts and humanities. In these resources the most common types of grey literature were conference papers, technical reports, and theses and dissertations. The findability of the grey literature in IRs varied widely as did evidence of active collection development. CONCLUSION Recommendations include the development of consistent metadata standards for grey literature to enhance searching within individual resources as well as supporting future interoperability. An increased level of collection development of grey literature in institutional repositories would facilitate preservation and increase the findability and reach of grey literature.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap