Table 1 Findable and Accessible principle assessment checks for the CMS H(\(b\bar{b}\)) Open Dataset.
Metric | Evaluation |
---|---|
F1. (Meta)data are assigned globally unique and persistent identifiers. | |
Identifier Uniqueness: this metric measures whether there is a scheme to uniquely identify the digital resource. | Pass. The DOI for the data (which resolves to a URL29) follows a registered identifier scheme. |
Identifier Persistence: this measures whether there is a policy that describes what the provider will do in the event an identifier scheme becomes deprecated. | Pass. The use of a DOI provide a persistent interoperable identifier. |
F2. Data are described with rich metadata. | |
Machine-readability of Metadata: to meet this metric, a URL to a document containing machine-readable metadata for the digital resource must be provided. | Pass. The URL for the metadata57 in JSON Schema with REST API is available. The use of JSON Schema provides clear human and machine readable documentation. Also, running the URL through the Rich Result Test shows the data page contains rich results. |
Richness of Metadata: data are described with rich metadata | Partially pass. Reviewing the DataCite metadata for the DOI shows a fairly sparse record. The metadata can be improved with richer fields. |
F3. Metadata clearly and explicitly include the identifier of the data they describe. | |
Resource Identifier in Metadata: this measures if the metadata document contains the identifier for the digital resource that meets F1 principle. | Pass. The association between the metadata and the dataset is made explicit because the dataset’s globally unique and persistent identifier can be found in the metadata. Specifically, the DOI is a top-level and a mandatory field in the metadata record. |
F4. (Meta)data are registered or indexed in a searchable resource | |
Index in a searchable resource: this measures the degree to which the digital resource can be found using web-based search engines | Pass. The dataset is indexed by Google Dataset Search engine. |
A1. (Meta)data are retrievable by their identifier using a standardized communications protocol | |
A1.1: The protocol is open, free and universally implementable | |
Access Protocol: it measures whether the URL is open access and free. | Pass. HTTP get on the identifier’s URL returns a valid document |
A1.2. The protocol allows for an authentication and authorization where necessary | |
Access Authorization: it requires specification of a protocol to access restricted content. | Pass. This is an open dataset, accessible to everyone on the internet. The data is non-profit and privacy-unrelated, so no access authorization is needed. |
A2. Metadata should be accessible even when the data is no longer available | |
Metadata Longevity: it requires metadata to be present even in the absence of data | Pass. Metadata is stored separately in the CERN Open Data server. As per FAIR Principle F3, this metadata remains discoverable, even in the absence of the data, because it contains an explicit reference to the DOI of the data. Data and metadata will be retained for the lifetime of the repository. The host laboratory CERN, currently plans to support the repository for at least the next 20 years. |