Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap
B. Ian Hutchins et al.have published "The NIH Open Citation Collection: A Public Access, Broad Coverage Resource" in PLoS Biology.
Here's an excerpt:
Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past several years, the National Institutes of Health (NIH) Office of Portfolio Analysis (OPA) has been aggregating and enhancing citation data that can be shared publicly. Here, we describe the NIH Open Citation Collection (NIH-OCC), a public access database for biomedical research that is made freely available to the community. This dataset, which has been carefully generated from unrestricted data sources such as MedLine, PubMed Central (PMC), and CrossRef, now underlies the citation statistics delivered in the NIH iCite analytic platform. We have also included data from a machine learning pipeline that identifies, extracts, resolves, and disambiguates references from full-text articles available on the internet. Open citation links are available to the public in a major update of iCite (
Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap
Richard Grunzke et al. have published "The MASi Repository Service—Comprehensive, Metadata-Driven and Multi-Community Research Data Management" in Future Generation Computer Systems.
Here's an excerpt:
Here, we present the architecture and developments of the Metadata Management for Applied Sciences (MASi) project that is currently building a comprehensive research data management service. MASi extends the existing KIT Data Manager framework by a generic metadata programming interface and a generic graphical web interface. Furthermore, MASi is OAI compliant and supports the OAI-PMH protocol while providing support for provenance information using ProvONE, a well-established and accepted provenance model. To illustrate the practical applicability of the MASi service, we present the adoption of initial use cases within geography, chemistry and digital humanities.
Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap
Silvio Peroni and David Shotton have self-archived "OpenCitations."
Here's an excerpt:
OpenCitations is a scholarly infrastructure organization dedicated to open scholarship and the publication of open bibliographic and citation data as Linked Open Data using Semantic Web technologies, to the development of software tools and services that enable convenient access to these open data, and to community advocacy for open citations.
This paper describes OpenCitations and its datasets, tools, services and activities. It introduces the OpenCitations Data Model and the SPAR (Semantic Publishing and Referencing) Ontologies for encoding scholarly bibliographic and citation data in RDF, and OpenCitations' open software of generic applicability for searching, browsing and providing REST APIs over RDF triplestores. It describes Open Citation Identifiers (OCIs), globally unique and persistent identifiers for bibliographic citations, and the OpenCitations OCI Resolution Service that returns bibliographic and citation metadata when queried with an OCI. And it describes the OpenCitations Corpus (OCC), a database of open downloadable bibliographic and citation data harvested from bibliographic references in the scholarly literature and made available in RDF under a Creative Commons public domain dedication. Finally, it outlines the Open Citation Indexes of citation data openly available in third-party bibliographic databases that OpenCitations is currently making available as Linked Open Datasets accessible via its REST API, of which the first and largest is COCI, the OpenCitations Index of Crossref DOI-to-DOI Citations which currently contains over 445 million bibliographic citations.
Research Data Curation Bibliography, Version 10 | Digital Curation and Digital Preservation Works | Open Access Works | Digital Scholarship | Digital Scholarship Sitemap