Metadata – Page 5 – DigitalKoans

"FAIR Data Reuse—The Path through Data Citation"

https://doi.org/10.1162/dint_a_00030

Requires ALA Login: "Metadata Revisited: Updating Metadata Profiles and Practices in a Vendor-Hosted Repository"

https://journals.ala.org/index.php/lrts/article/view/6867

"Proper Attribution for Curation and Maintenance of Research Collections: Metadata Recommendations of the RDA/TDWG Working Group"

http://doi.org/10.5334/dsj-2019-054

"Leaked Document on Elsevier Negotiations Sparks Controversy"

https://www.scienceguide.nl/2019/11/leaked-document-on-elsevier-negotiations-sparks-controversy/

"The NIH Open Citation Collection: A Public Access, Broad Coverage Resource"

B. Ian Hutchins et al.have published "The NIH Open Citation Collection: A Public Access, Broad Coverage Resource" in PLoS Biology.

Here's an excerpt:

Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past several years, the National Institutes of Health (NIH) Office of Portfolio Analysis (OPA) has been aggregating and enhancing citation data that can be shared publicly. Here, we describe the NIH Open Citation Collection (NIH-OCC), a public access database for biomedical research that is made freely available to the community. This dataset, which has been carefully generated from unrestricted data sources such as MedLine, PubMed Central (PMC), and CrossRef, now underlies the citation statistics delivered in the NIH iCite analytic platform. We have also included data from a machine learning pipeline that identifies, extracts, resolves, and disambiguates references from full-text articles available on the internet. Open citation links are available to the public in a major update of iCite (https://icite.od.nih.gov).

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Paywall Article: "Old Metadata in a New World: Standardizing the Getty Provenance Index for Linked Data"

https://doi.org/10.1017/alj.2019.24

Paywall Article: "Text Mining and Subject Analysis for Fiction; or, Using Machine Learning and Information Extraction to Assign Subject Headings to Dime Novels"

https://doi.org/10.1080/01639374.2019.1653413

Paywall Article: "Core Metadata Element Recommendations for Institutional Repositories at Texas A&M University Libraries"

https://doi.org/10.1080/19386389.2019.1651499

"Metadata Documentation Practices at ARL Institutional Repositories"

https://preprint.press.jhu.edu/portal/article/metadata-documentation-practices-arl-institutional-repositories

Paywall Article: "Definitions of ‘Metadata’: A Brief Survey of International Standards"

https://asistdl.onlinelibrary.wiley.com/doi/abs/10.1002/asi.24295?af=R&

"The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management"

Richard Grunzke et al. have published "The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management" in Future Generation Computer Systems.

Here's an excerpt:

Here, we present the architecture and developments of the Metadata Management for Applied Sciences (MASi) project that is currently building a comprehensive research data management service. MASi extends the existing KIT Data Manager framework by a generic metadata programming interface and a generic graphical web interface. Furthermore, MASi is OAI compliant and supports the OAI-PMH protocol while providing support for provenance information using ProvONE, a well-established and accepted provenance model. To illustrate the practical applicability of the MASi service, we present the adoption of initial use cases within geography, chemistry and digital humanities.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"No Need to Ask: Creating Permissionless Blockchains of Metadata Records "

https://doi.org/10.6017/ital.v38i2.10822

"OpenCitations"

Silvio Peroni and David Shotton have self-archived "OpenCitations."

Here's an excerpt:

OpenCitations is a scholarly infrastructure organization dedicated to open scholarship and the publication of open bibliographic and citation data as Linked Open Data using Semantic Web technologies, to the development of software tools and services that enable convenient access to these open data, and to community advocacy for open citations.

This paper describes OpenCitations and its datasets, tools, services and activities. It introduces the OpenCitations Data Model and the SPAR (Semantic Publishing and Referencing) Ontologies for encoding scholarly bibliographic and citation data in RDF, and OpenCitations' open software of generic applicability for searching, browsing and providing REST APIs over RDF triplestores. It describes Open Citation Identifiers (OCIs), globally unique and persistent identifiers for bibliographic citations, and the OpenCitations OCI Resolution Service that returns bibliographic and citation metadata when queried with an OCI. And it describes the OpenCitations Corpus (OCC), a database of open downloadable bibliographic and citation data harvested from bibliographic references in the scholarly literature and made available in RDF under a Creative Commons public domain dedication. Finally, it outlines the Open Citation Indexes of citation data openly available in third-party bibliographic databases that OpenCitations is currently making available as Linked Open Datasets accessible via its REST API, of which the first and largest is COCI, the OpenCitations Index of Crossref DOI-to-DOI Citations which currently contains over 445 million bibliographic citations.

Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Paywall Article: "Library Provision of Intellectual Access to Open Access Journal Articles"

https://doi.org/10.1080/0361526X.2019.1628161

"Crowdsourcing Open Citations with CROCI—an Analysis of the Current Status of Open Citations, and a Proposal"

https://arxiv.org/abs/1902.02534

Paywall Article: "Transforming the Quality of Metadata in Institutional Repositories"

https://doi.org/10.1080/0361526X.2019.1540270

Paywall Article: "Bridging Identity Challenges: Why and How One Library Plugged ORCID into Their Enterprise"

https://emeraldinsight.com/doi/abs/10.1108/LHT-04-2018-0046?af=R&

"Accuracy of Citation Data in Web of Science and Scopus"

https://arxiv.org/abs/1906.07011

Software Citation Implementation Challenges

Daniel S. Katz et al. have self-archived "Software Citation Implementation Challenges."

Here's an excerpt:

The main output of the FORCE11 Software Citation working group (this https URL) was a paper on software citation principles (this https URL) published in September 2016. This paper laid out a set of six high-level principles for software citation (importance, credit and attribution, unique identification, persistence, accessibility, and specificity) and discussed how they could be used to implement software citation in the scholarly community. In a series of talks and other activities, we have promoted software citation using these increasingly accepted principles. At the time the initial paper was published, we also provided guidance and examples on how to make software citable, though we now realize there are unresolved problems with that guidance. The purpose of this document is to provide an explanation of current issues impacting scholarly attribution of research software, organize updated implementation guidance, and identify where best practices and solutions are still needed.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

UK: "Digital Description and Metadata at the National Archives. Digital Strategy"

Jone Garmendia has self-archived "Digital Description and Metadata at the National Archives. Digital Strategy."

Here's an excerpt:

Over the last eighteen years, The National Archives of the United Kingdom has delivered a wide range of online catalogues and digital services and is now transforming to deliver an ambitious digital strategy. Our Digital Strategy addresses both the challenge of digital records as well as our goal to become a digital archive by instinct and design. To achieve this goal, we must acknowledge that digital records disrupt archival practice, archival theory and the whole notion of what a professional archivist should be.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

"Behind the Scenes of Web Archiving: Metadata of Harvested Websites"

Emmanuel Di Pretoro and Friedel Geeraert have self-archived "Behind the Scenes of Web Archiving: Metadata of Harvested Websites."

Here's an excerpt:

This paper first provides more information about web archiving from a technical point of view before focusing on descriptive metadata in the context of web archiving and the WARC file format. Lastly, the experiments done within the PROMISE project with regard to integrating metadata into the WARC file format are discussed.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Paywall Article: "The Heart of the Cycle: How Can Metadata 2020 Improve Serials Metadata for Scholarly Communications and Research?"

https://www.tandfonline.com/doi/full/10.1080/0361526X.2019.1585169

Paywall Article: "Planting Cedar: An Open Source Linked Data Vocabulary Manager at the University of Houston Libraries"

https://doi.org/10.1080/19386389.2019.1589696

"Too Many Tags Spoil the Metadata: Investigating the Knowledge Management of Scientific Research with Semantic Web Technologies"

Samantha Kanza, Nicholas Gibbins, and Jeremy G. Frey have published "Too Many Tags Spoil the Metadata: Investigating the Knowledge Management of Scientific Research with Semantic Web Technologies" in the Journal of Cheminformatics.

Here's an excerpt:

Previous studies of Electronic Lab Notebooks (ELNs) in academia and industry have identified semantic web technologies as a means for organising scientific documents to improve current workflows and knowledge management practices. In this paper, we present a qualitative, user-centred study of researcher requirements and practices, based on a series of discipline-specific focus groups. We developed a prototype semantic ELN to serve as a discussion aid for these focus groups, and to help us explore the technical readiness of a range of semantic web technologies. While these technologies showed potential, existing tools for semantic annotation were not well-received by our focus groups, and need to be refined before they can be used to enhance current researcher practices.

Research Data Curation Bibliography, Version 9 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap

Paywall Article: "Metadata Quality at Scale: Metadata Quality Control at the Digital Public Library of America"

https://www.ingentaconnect.com/content/hsp/jdmm/2019/00000007/00000002/art00003