Scientific Reports

Table 17 The category of challenges of each dataset with the removal of the respective non-English Language stopwords.

From: Key insights into recommended SMS spam detection datasets

Dataset	MNB Accuracy	Category of accuracy
Dataset 2 (Turkish)	98.28%	High accuracy
Dataset 4 (Bengali)	90.10%	Moderate accuracy
Dataset 6 (Hindi)	96.75%	High accuracy
Dataset 7 (Persian)	92.95%	Moderate accuracy
Dataset 8 (Indonesian)	96.94%	High accuracy
Dataset 9 (Hindi)	90.48%	Moderate accuracy
Dataset 10 (Hindi)	86.49%	Low accuracy

Back to article page

Search

Advanced search

Quick links