Table 2 Summary of the benchmark datasets used in the experimental evaluation. Each dataset is characterized by the total number of instances, number of features, number of classes, and imbalance ratio (IR), defined as the ratio of majority to minority samples. The datasets were selected from various domains to ensure diversity in size, dimensionality, and imbalance severity, providing a comprehensive basis for assessing the effectiveness of the proposed Borderline-Shifting Oversampling (BSO) method compared with existing resampling techniques.

From: An approach for handling imbalanced datasets using borderline shifting

Dataset

Features

#instance

IR

#noise

#border

subcl5-a (m1)

2

600

5

16

0

subcl5-b (m2)

2

800

7

2

0

subcl5-c (m3)

2

800

7

0

0

subcl5-d (m4)

2

800

7

2

0

subcl5-e (m5)

2

800

7

4

0

subcl5-f (m6)

2

800

7

9

0

clover5z-a (m7)

2

600

5

9

2

clover5z-b (m8)

2

600

5

14

0

clover5z-c (m9)

2

600

5

13

0

clover5z-d (m10)

2

600

5

16

1

clover5z-e (m11)

2

600

5

22

1

clover5z-f (m12)

2

800

7

8

0

clover5z-g (m13)

2

800

7

10

0

clover5z-h (m14)

2

800

7

11

0

clover5z-i (m15)

2

800

7

13

1

clover5z-j (m16)

2

800

7

13

0

paw02a-a (m17)

2

600

5

6

0

paw02a-b (m18)

2

600

5

7

0

paw02a-c (m19)

2

600

5

10

0

paw02a-d (m20)

2

600

5

9

1

paw02a-e (m21)

2

600

5

14

1

paw02a-f (m22)

2

800

7

7

0

paw02a-g (m23)

2

800

7

4

1

paw02a-h (m24)

2

800

7

10

1

paw02a-i (m25)

2

800

7

6

0

paw02a-j (m26)

2

800

7

6

1

flare-F (m27)

10

1067

23.25

2

0

winequality-red-4 (m28)

11

1600

28.62

0

0

yeast1 (m29)

8

1484

2.45

90

2

glass2 (m30)

9

214

11.5

0

0