Table 7 List of features selected by each of the feature selection methods.

From: Predicting land suitability for wheat and barley crops using machine learning techniques

Models

Selected features by UFS

Selected features by RFECV

Selected features by SFS

RF

'Annual Rainfall(mm)', 'Depth (cm)', 'K', ‘Texture’, ‘mean min_temp’, ‘max_slope’, ‘min_slope’, 'CEC,' ‘T_N’, ‘pH_water’, ‘mean_max temp’, ‘O_M’, ‘Av_p’

‘Mean min_temp’, ‘Mean max_temp’, ‘Texture’,'Annual_Rainfall (mm)', ‘Depth’, ‘pH_water’, 'K', ‘O_M’, ‘EC’, ‘T_N’, ‘Av_p’, 'Sodicity (ESP)',

‘Crop_name’

‘Mean min_temp’, ‘Texture’, ‘Annual_Rainfall’, ‘Depth’, ‘EC’, ‘Mean max_temp’, ‘CEC’, 'Sodicity (ESP)', ‘Crop_name’

GB

'Annual Rainfall(mm)', ‘Depth’, 'K', ‘Av_p’, ‘EC’, ‘Texture’, ‘mean min_temp’, ‘max_slope’, ‘min_slope’, ‘CEC’, ‘T_N’, ‘pH_water’, ‘Mean max_temp’, ‘O_M’

'Annual Rainfall(mm)', ‘Depth’, ‘EC’, 'K', ‘O_M’, ‘Crop_name’

‘Texture’, ‘mean min_temp’, ‘mean max_temp’, 'Annual Rainfall(mm)', ‘Depth’, ‘max_slope’, 'K', ‘EC’, ‘Av_p’, ‘Crop_name’

KNN

'Annual Rainfall(mm)', ‘Depth’, 'K', ‘Av_p’, ‘EC’, ‘Texture’, ‘mean min_temp’, ‘max_slope’, ‘min_slope’, ‘CEC’, ‘T_N’, ‘pH_water’, ‘Mean max_temp’, ‘O_M’

'Annual Rainfall(mm)', ‘EC’, 'K', ‘T_N’, ‘O_M’, ‘Crop_name’

  1. As part of preprocessing, we have also detected our dataset for outlier hence parts of the features such as 'Annual Rainfall(mm)', 'K', ‘Texture’, ‘mean min_temp’, ‘max_slope’, ‘min_slope’, 'EC,' ‘T_N’, ‘mean_max temp’, ‘O_M’, ‘Av_p’ contain outlier and 'Depth (cm)', ‘pH_water’, ‘Crop_name’, ' CEC ', and sodicity doesn’t contain outlier. The following are the sample examples of how outlier detected and removed using interquartile range as illustrated in the Fig. 7 below.