Table 2 Details of the attributes and values contained in the composition dataset.

From: Aluminum alloy compositions and properties extracted from a corpus of scientific manuscripts and US patents

Attribute

Value Datatype

Description

Applicability by ‘source’

source

String (class)

The original source of composition information, one of: (full text, table, named, patent)

—

ft_doi_list

String (list of)

Full text DOI list: List containing all DOIs associated with a given composition

full text

table_doi

String

DOI of table’s journal article

table

name

String

Determined by source: (named: Four-digit identifier code designated by AA; table: Original source table row name; patent: Patent publication number)

named, table, patent

table_extr_AA_des

Integer

Table-extracted AA designation: AA designation code (extracted from original source table row name or table caption via text matching digits of format 'XXXX')

table

comp_rule_based_series

Integer (class)

Composition rule-based series: Aluminum alloy series, assigned by applying a set of rules (based on Table 3) to the alloy’s composition

all

<element>

Decimal

Percent weight of this <element> within the Al alloy

all

  1. The csv file contains 6 descriptive attributes (columns) in addition to the element composition columns indicating the weight percent within the alloy.