Table 1 Description of DeepPatentAI dataset.

From: A Global Dataset Mapping the AI Innovation from Academic Research to Industrial Patents

Features

Data type

Description

ID

Int

Sequential identifier ranked from 1 to 2,356,204

PN

String

The unique patent number

IPC

String

The International Patent Classification (IPC) code

Title

String

The title of the patent

Abstract

String

The abstract summarizing the patent

Year

Int

The year in which the patent application was filed

Keywords

String

A JSON-formatted list of keywords

Novelty

Float

A numerical indicator quantifying the patent Innovation