Table 1 Description of dataset projects.

From: Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model

Project

Versions

Description

Avg files

Defects rate (%)

xalan

2.4, 2.6

A Java library for processing XML files.

804

32.3

poi

1.5, 3.0

Java library for accessing Microsoft files.

340

62.1

ant

1.7

A Java-based build code files.

745

22.2

log4j

1.1

A Java-based logging library

109

33.8

jEdit

4.0, 4.1

A text editor built for programmers.

309

24.9

lucene

2.0, 2.2

An open-source text search library.

221

52.9

synapse

1.1, 1.2

Adapters for transmitting data

239

30.5