Table 4 Details of the attributes and values contained in the property dataset.

From: Aluminum alloy compositions and properties extracted from a corpus of scientific manuscripts and US patents

Attribute

Value Datatype

Description

Notes

doi

String

Digital Object Identifier of the journal article

 

name

String

Original table row name

 

series

Integer (class)

Aluminum alloy series designation, one of: 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000 (see Table 3).

The ‘series’ value is first based on the alloy composition associated with the same ‘doi’. It is then manually cleaned following validation processing.

caption

String

Original table caption

 

table_extr_AA_des

Integer

AA designation code

Extracted from original source row name or table caption via text matching where available, otherwise, empty.

YS

Decimal

Yield strength (MPa)

When available

UTS

Decimal

Ultimate tensile strength (MPa)

When available

temper

String

Temper designation

When available

elong

Decimal

Percent elongation

When available

flag

True/False

Alloy undergoes special processing

 

flag_note

String

Reason for flag

 
  1. The csv file contains 11 attributes (columns), which are described here along with the datatypes of the column values.