Table 1 Data quality metrics: pre- and post-preprocessing.

From: Forecasting cashew production in India using a hybrid machine learning framework with STL decomposition, ensemble methods, and global trade network analysis

Metric

Raw data

Processed data

Total observations

284

281

Duplicate records

3 (1.07%)

0 (0%)

Missing values

12 (4.23%)

0 (0%)

Outliers detected

4 (1.41%)

0 (winsorized)

Temporal coverage

1999–2020

1999–2020

Countries represented

60

60