Abstract
The rapid advancement of artificial intelligence and robot technology has spurred the proposal and innovation of a coal gangue sorting robot system (CGSRS) paradigm. The time-varying raw coal flow (TVRCF) with multi-scene and full working conditions affects the gangue queue. Configuring the CGSRS scheme correctly is combative. The field environment puts forward higher requirements for the time complexity of the CGSRS multi-task allocation strategy. Therefore, this paper proposes a scheme evaluation method of the CGSRS with time-varying multi-scenario based on deep learning. Firstly, the gangue queue data set of multi-scene and full-condition TVRCF was obtained according to the belt speed, the maximum coal flow, and the uncorrelated nonlinear changes of coal flow and gangue content. The CGSRS scheme is established based on robot number and rule combination, and the multi-task allocation strategy is adjusted to generate the labels of the gangue queue. Then, the RGB sample set is established based on the labels of the gangue queue. The CGSRS scheme evaluation model is trained based on DenseNet. Finally, the CGSRS scheme evaluation method was designed to realize the prediction of a random gangue queue. In this paper, the CGSRS scheme evaluation model, the stability of the solution, and the comparison of methods are carried out. Experimental results show that the solution of the CGSRS scheme evaluation model is accurate and stable. The time complexity is significantly reduced and very stable. The CGSRS scheme evaluation method is applied to the CGSRS multi-task allocation problem, and the stability of the solution is not affected by the data. It is significantly better than the multi-task allocation strategy. The proposed method is the first attempt to apply deep learning to a multi-task allocation problem in CGSRS.
Similar content being viewed by others
Introduction
The intelligent development of coal mines has injected new vitality into improving coal separation technology1. Robotics have been extensively employed in the coal preparation process. The combination of time-varying raw coal flow (TVRCF), belt conveyor, and multi-robot system describes a coal gangue sorting robot system (CGSRS) paradigm2. The coal gangue sorting robot consists of an identification and positioning module, a control module, and a multi-manipulator module3. The key technologies encompass coal gangue identification technology, machine vision technology4, robot technology5, and artificial intelligence technology6, intending to sort coal or gangue efficiently. In the coal preparation process, the scenes of the coal preparation plant and the working conditions of TVRCF are diverse. Therefore, the research on coal gangue identification and positioning technology, dynamic target tracking technology and multi-task allocation strategy of CGSRS have been the focus of scholars in recent years.
Identification and positioning technology and dynamic target tracking technology of coal gangue have developed rapidly. However, the multi-task allocation strategy can not efficiently pick and place the identified and positioned gangue queue in multi-scene and all working conditions. This also makes dynamic target-tracking technology challenging to perform. Regarding coal gangue identification and positioning, scholars are committed to improving the accuracy and efficiency of coal gangue identification, reducing positioning errors, and obtaining the volume information of gangue in multiple scenarios. Zhou et al. proposed the identification of coal and gangue based on the R-value method for dual-energy X-ray of Geant4 simulation7. The identification accuracy of this method for coal and gangue with different shapes and thicknesses reaches 95%, but the mechanism of influence of R-value on thickness, shape and composition needs to be studied. Zhang et al. studied the effect of liquid intervention on coal and gangue identification8. The experimental results show that the temperature difference between coal gangue can reach 1.5℃ after only 3 s of liquid intervention, which is conducive to the infrared detection efficiency of coal and gangue. Yan et al. proposed a coal-gangue identification method based on multi-spectral imaging technology and target detection9. Huang et al. proposed a coal gangue identification method based on volume detection and dynamic weighing, considering gangue queuing and binocular vision10. The YOLO series algorithms have been extensively applied to address coal gangue detection challenges. Notably, enhancing tiny YOLO v3 and incorporating a resnet18 backbone feature network have significantly bolstered detection efficiency and recognition accuracy11,12. In terms of dynamic target tracking technology, combined with global planning and local vision, the coal gangue sorting robot can quickly pursue and follow up the dynamic gangue target and reach the purpose of quasi-grasping13. The impact load arises from the velocity differential between the manipulator’s end and the target during grasping, potentially resulting in grasp failure and manipulator damage. Wang et al. propose a dynamic gangue trajectory planning method for the manipulator synchronous tracking under multi-constraint conditions14. Finally, the collaborative control of multiple manipulators will be used to obtain identification and positioning information of coal and gangue, generating trajectory planning information for the manipulators.
In summary, coal gangue identification and positioning technology and dynamic target tracking technology have been realized as applications in the coal preparation process, whether multi-scene or full working conditions. In terms of the multi-task allocation strategy, Cao et al. Proposed a multi-manipulator sorting strategy with a feedback mechanism to ensure the sustainability of gangue picking and placing by the manipulator when the position coordinate of manipulators in the fixed workspace is random15. When the position coordinate of manipulators in the fixed workspace is determined, Ma et al. proposed a multi-arm global cooperative coal gangue sorting method based on an improved Hungarian algorithm, which considers the influence of different belt speeds but does not discuss the relationship between robot number and sorting efficiency16. In addition, Wang et al. proposed a cooperative strategy of multi-arm coal gangue sorting robot based on immune dynamic workspace, which improved sorting efficiency and solved the deadlock problem in the collaborative process of multi-manipulator17. Wu et al. proposed a multi-task allocation framework of a coal gangue sorting robot system for the time-varying raw coal flow and proposed task decomposition, task distribution, and executable collaborative strategy2. The coal gangue sorting robot system is applied to the coal preparation process. Among them, the belt speed of multiple scenes is different. The maximum coal flow and TVRCF of full-working conditions are varied. In any scene and working condition, if CGSRS is configured with the best robot number and cooperative strategy, the time complexity of the existing multi-task allocation strategy is enormous. Even in a scene with high belt speed, the time complexity of an online multi-task allocation strategy to select the best cooperative strategy is challenging in meeting real-time requirements. However, there is still a lack of research on the coal gangue sorting robot system for multi-scene and all-condition time-varying raw coal flow.
With the development of artificial intelligence, neural networks are widely used in various fields18,19. In a multi-robot system. Bhadra et al. developed an end-to-end 3D CNN model to predict soybean yield using multi-temporal RGB images acquired by drones20. This is the earliest study of neural networks for soybean yield prediction. Experimental results show that DenseNet optimizes the accuracy of yield prediction and model complexity. Cao et al. discussed an end-to-end detection model based on DenseNet for real-time and high-precision robot grasping detection of fragile fruits21. This method considers the diversity of fruit shapes and sizes and designs four end-to-end detection models based on three convolutional neural network architectures to obtain the fruit grasp position. In the experimental results, the accuracy of the four models ranged from 83.86 to 93.64%. All the models were capable of rapid real-time detection. In the CGSRS paradigm, a fundamental challenge faced by the multi-manipulator module is optimizing the robot number and cooperation strategy for the gangue queue with multiple scenes and full working conditions22. On this basis, the combination of scheduling rules is selected online for the gangue queue of determined scenes and conditions, and this multi-task allocation strategy is robust to TVRCF23. Therefore, neural networks are applied to the CGSRS paradigm. DenseNet establishes the CGSRS scheme evaluation model of multi-task allocation strategy to reduce the time complexity. It is beneficial for optimizing robot number and cooperative strategy and selecting online scheduling rule combinations.
Aiming at the gangue queue with multi-scene and full working condition, this study constructs the RGB image sample set and uses DenseNet to establish the CGSRS scheme evaluation model based on a multi-task allocation strategy. The CGSRS scheme describes robot number and cooperative strategy. The task completion success rate, sorting rate and robot utilization rate are used as evaluation indexes. This method calculates the approximate solution of each CGSRS scheme evaluation model, selects the CGSRS scheme with the optimal solution, and calculates the deterministic solution of the gangue queue through the multi-task allocation strategy. The neural network realizes the end-to-end solution of the multi-task allocation strategy, which effectively reduces the selection time of the CGSRS scheme and the optimization time of the online cooperative strategy.
Aiming at the gangue queue of multi-scene and full working condition in the coal preparation process, this paper proposed a scheme evaluation method for CGSRS with time-varying multi-scenario based on deep learning. Specific contributions are as follows.
According to the time, space and feature of the gangue queue, the RGB image sample set is constructed. DenseNet is used to establish the CGSRS scheme evaluation model, and the optimal CGSRS scheme is efficiently selected for gangue queues in multi-scene and full working condition. This method has good accuracy, and the required time is much lower than the time complexity of using multi-task allocation strategy.
During the operation of CGSRS, the CGSRS scheme evaluation model is used to calculate and select the online cooperative strategy of multi-manipulator. The time of the optimization process is much lower than the time complexity of using a multi-task allocation strategy.
The subsequent sections of this article are organized as follows: Section “Problem description” delineates the problem, while Section “The CGSRS scheme evaluation method for all-condition TVRCF is constructed” expounds upon the proposed methodology. Section “Simulation experiments” presents simulations, results, and discussions. Section “Conclusion” provides conclusions.
Problem description
TVRCF in the coal preparation process
The dimensional information such as time, space and feature of gangue queue is affected by the coal preparation process of multi-scene and full working condition. Among them, multi-scene describes the belt conveyor speed and the maximum value of coal flow in the coal preparation plant, which affects the arrival rate of coal and gangue. The full working condition describes the coal flow and gangue content of fully mechanized mining and caving coal, the coal and gangue granularity of the crusher24,25, and the coal and gangue position of the vibrating screen. Following the objective reality, the arrival rate of coal and gangue must not lead to the accumulation and blockage of belt conveyor. The granularity of coal and gangue can not be greater than the width of the belt conveyor.
Coal flow and gangue content of raw coal flow are uncorrelated and change nonlinearly with time, defined as TVRCF. In the coal preparation process, the coal flow of TVRCF varies within a specific range and is always smaller than the maximum value of coal flow in this scene. On this basis, the gangue content of TVRCF affects the quantity and granularity of gangue. According to the conservation of mass, the larger of gangue quantity, the smaller of gangue granularity. And vice versa. Wu et al. described the mapping between TVRCF and gangue queue by task decomposition2. This contributes to the research work in this paper.
In summary, the CGSRS paradigm is deployed in a multi-scene, full-condition coal preparation process. The gangue queue of TVRCF processed by CGSRS is quite complex. This leads to the problem that the multi-task allocation result of CGSRS is not ideal, and the CGSRS scheme is not universal. The cost of replacing hand-picked gangue with the CGSRS paradigm is that the CGSRS scheme must conform to human adaptability.
CGSRS in the coal preparation process
In order to enhance the ability of CGSRS to pick and place gangue queue, it can be achieved by adjusting the gangue queue, increasing the manipulator number, and optimizing the cooperative strategy. Consider increasing the manipulator number. It can be expected that the scene of the hand-picking gangue does not have infinite space. Moreover, the layout of other devices also limits the manipulator number in CGSRS. Although the space of the transportation scene is abundant, the capacity of a coal mine requires that the belt conveyor speed is generally above 2m/s. The robot specification makes it difficult to pick and place gangue queue at this speed. Consider adjusting the gangue queue. As seen from “TVRCF in the coal preparation process”, in the coal preparation process with multi-scene and full-condition, the gangue queue is directly or indirectly affected by fully mechanized mining, top-caving, crusher, vibrating screen and belt conveyor. It is not easy. Consider the optimization of cooperative strategy. According to the scheduling rules executed by the manipulator, the combination of the scheduling rule with the best task allocation result is selected. In summary, the decision variables of the CGSRS scheme are manipulator number and scheduling rule combination.
According to the problem description, selecting the CGSRS scheme suitable for the gangue queue in the coal preparation process with multi-scene and full working condition is challenging. This study considers dimensional information such as time, space and feature of gangue queue. Firstly, the RGB image dataset of the gangue queue is constructed according to the arrival time interval, position and granularity. Then, the multi-task allocation model is used to generate the label of the RGB image. Finally, DenseNet established the CGSRS scheme evaluation model. It is an end-to-end evaluation of a multi-task allocation problem. For the CGSRS scheme selection of gangue queue and the cooperative strategy optimization of CGSRS, the CGSRS scheme evaluation model can be used to obtain the near-optimal solution, and then the multi-task allocation strategy can be used to calculate the deterministic solution.
Mathematical modeling of the CGSRS scheme
Assumptions
Without loss of generality, the following assumptions are given in this paper:
-
The variation of gangue content is known and will not be discussed in this paper.
-
The granularity distribution of coal and gangue is known, and the range of granularity distribution is constrained in this paper.
-
The conveyor uses a flat belt. The speed is adjustable. There is no slack, skid, or deviation.
-
In the CGSRS scheme, the density value of gangue is known.
-
In the CGSRS scheme, the robot’s initial position and placing position are determined, and the gangue coordinate affects the picking position.
-
The gangue arrival rate follows a Poisson distribution, and the position coordinates are expressed as (x, y). The ratio of x interval value to belt speed defines the gangue arrival time interval. The coordinate value y in the vertical belt speed direction is uniformly distributed.
Evaluation indexes
Generally, the evaluation metrics for the multi-task allocation problem of CGSRS are sorting rate (SR), task completion success rate (TCSR), and robot utilization rate (RUR)2. Among them, The SR is the ratio of the total mass of gangue picked up and placed by the manipulator to the total mass of gangue entering the CGSRS. The expression is as follows.
where εQ is the symbol of the sorting rate. mGk is the mass of the k-th gangue picked up and placed by the j-th manipulator, in kg. mGi is the mass of gangue i-th in the gangue queue, in kg.
The ratio of the total amount of gangue picked up and placed by the manipulator to the total amount of gangue entering CGSRS is defined as the TCSR, which is calculated as follows:
where εN is the symbol of the task completion success rate. nj is the statistical value of the amount of gangue picked up and placed by the j-th manipulator. N is the total amount of gangue in TVRCF.
The ratio of the working time of the manipulator to the operating time of the system is defined as the RUR. The expression is as follows:
where εMj is the symbol of the robot utilization rate. tjk is the sorting time of the j-th manipulator picking up and placing the k-th gangue, unit s. tGN is the arrival time of the last gangue, unit s.
This paper uses the accuracy (Acc) to describe the prediction accuracy of the CGSRS scheme evaluation model, including SR, TCSR, and RUR. The specific calculation process is as follows.
Valuetrue is the evaluation index predicted by the multi-task allocation strategy. Valuepred is the evaluation index predicted by the CGSRS scheme evaluation model.
The root mean square error (RMSE) describes the prediction stability of the CGSRS scheme evaluation model, including SR, TCSR, and RUR. The specific calculation process is as follows.
Constraints
In the coal preparation process, the crusher processes the granularity of coal and gangue. This paper discusses coal and gangue ranging from 50 to 300 mm. The specific constraints are as follows:
where, sGi is the granularity of the i-th gangue in the gangue queue, in mm.
In this paper, the coal flow in the time window is discussed, and the symbol is Q(τ), and the unit is kg/s. The length of the time window is set to one second. The sequence number of the time window is described as τ. The belt speed and maximum coal flow are known in a scene. The change of the coal flow in continuous time windows describes a working condition. The constraint of coal flow is as follows.
The gangue content changed with time. The symbol is μ(τ), and the constraint is as follows.
The CGSRS scheme evaluation method for all-condition TVRCF is constructed
The CGSRS evaluation method with time-varying multiple scenarios, based on deep learning, integrates the strengths of deep learning and a multi-task allocation model. As shown in Fig. 1, it is the construction flow chart of the CGSRS scheme evaluation method for multi-scenario and full-condition TVRCF. The RGB image generation process, CGSRS scheme evaluation model and CGSRS scheme evaluation method are described in turn.
The detailed implementation of RGB image generation, CGSRS scheme evaluation model, and CGSRS scheme evaluation method is as follows.
-
Step 1
An RGB image generation process to determine the scene and working condition. First, the TVRCF is mapped to the gangue queue according to a scene and working condition. Then, the granularity of the feature dimension, the time dimension’s arrival time interval, and the space dimension’s position coordinate were normalized and mapped to the Red, Green, and Blue channels, respectively. Finally, the three channels fuse an RGB image, which contains information on the gangue queue’s feature, time and space dimensions. The TVRCF of each working condition can obtain many gangue queues. Each gangue queue generates an RGB image. Therefore, the gangue queue of full-condition TVRCF generates the RGB image data set in this scene.
-
Step 2
A CGSRS scheme evaluation model based on DenseNet in the determined scene. In general, the belt speed of the scene is stable. The essence of the CGSRS scheme is the difference in robot number and rule combination, which is the key to adjusting the multi-task allocation strategy. The evaluation indexes such as SR, TCSR, and RUR of the gangue queue are calculated through a multi-task allocation strategy. The evaluation index of encoding is the label of an RGB image, generating an RGB image sample. Finally, the multi-task allocation strategy of this CGSRS scheme needs to calculate the gangue queue data set of TVRCF in the full working condition to establish an RGB image sample set. The CGSRS scheme evaluation model is obtained by training the DenseNet network with the RGB image sample set.
-
Step 3
CGSRS scheme evaluation method with time-varying multi-scenario based on deep learning. In the determined scene, the RGB image data set is generated according to the gangue queue data set of full-condition TVRCF. The existing CGSRS schemes are exhaustively enumerated according to the belt speed in this scene. The multi-task allocation strategy of step 2 calculates the evaluation index of the gangue queue, which is the label of RGB image, and will change with the CGSRS scheme. To summarize, a scene’s RGB image data sets are the same. However, different CGSRS schemes build different RGB image sample sets. The CGSRS scheme evaluation model obtained by training the DenseNet network is not unique and is not independent of each other. It is the key to implementing the CGSRS scheme evaluation method.
-
Step 4
CGSRS scheme evaluation process. A scene is randomly selected, and all CGSRS scheme evaluation models conforming to this scene are selected. The gangue queue of full-condition TVRCF was collected to generate an RGB image data set. The selected CGSRS scheme evaluation model evaluated the SR, TCSR, and RUR of the RGB image data set. The CGSRS scheme of near-optimal solution is selected, and the multi-task allocation strategy calculates the determined solution. Because the CGSRS scheme evaluation model is end-to-end, the time complexity is much less than that of the multi-task allocation strategy. It allows us to do many evaluations quickly using the CGSRS scheme evaluation model. When selecting the near-optimal solution, the multi-task allocation strategy must only be calculated once to obtain the determined solution for the gangue queue. It is the key to achieving online optimization of the multi-task allocation strategy.
Improved multi-task allocation strategy
Mathematical description of multi-task allocation
This paper aims to discuss the application of deep learning to the problem of multi-task allocation in CGSRS. Some contents are improved to ensure the stability of the multi-task allocation process2.
The Gangue queue is expressed as follows.
where, i is the sequence number of gangue in gangue queue. xGi is the X-axis coordinated of the i-th gangue, unit is mm. yGi is the Y-axis coordinate of the i-th gangue, unit is mm. zGi is the Z-axis coordinate of the i-th gangue, unit is mm. zGi is set as 0 mm. mGi is the mass of i-th gangue, unit is kg. The gangue density is determined, so the mass is related to the granularity.
The robot is expressed as follows.
According to the state, moving ability and position coordinates of the robot, and the moving ability and position coordinates of the gangue, the possibility of picking and placing gangue by the robot is judged. The robot state is set to Pj. When the value is 1, the robot is idle. Then, the judgment result can be expressed as follows.
where, ti is the waiting time for i-th gangue to leave the picking and placing area, according to the moving ability and position coordinates of gangue. According to the moving ability and position coordinates of the robot, tij is the time for the robot to pick and place gangue.
The waiting time is calculated as follows.
where, ljmax is the boundary value of the picking and placing area, unit is mm. The vb is the belt speed, unit is m/s. This variable also describes the moving ability of gangue.
The picking and placing time is calculated as follows.
The iteration of the gangue queue in the picking and placing area can be regarded as a Markov process. Taking the picking and placing time of the robot or the arrival time interval of the gangue as the iteration time will cause instability in the solution process. Therefore, a fixed step size is chosen as the iteration time in this paper, described as Ts.
where s is the number of iterations. Es is the environment state, which describes the picking and placing time, gangue mass, serial number, and the judgment result. It is described as follows.
Scheduling rules
This paper discusses the application of three scheduling rules in the multi-task allocation strategy of CGSRS, namely first-in-first-out (FIFO)26, shortest processing time (SPT)22, and mass first (MF)2. Among them, FIFO prioritizes the gangue with the smallest sequence number. MF gives priority to the gangue with the largest mass. SPT gives priority to the gangue with the smallest picking and placing time. The details are described as follows.
CGSRS scheme evaluation method based on DenseNet
Sample set construction based on decomposition method
This paper uses the decomposition method of TVRCF to simulate the gangue queue data set with multi-scene and full working condition. It contains the position coordinates of the X-axis, Y-axis, and Z-axis of the gangue queue and the gangue mass. It is the basis for obtaining an RGB image data set.
The gangue mass is related to the granularity. Because the gangue density is determined, the symbol is ρG, set as 2100 kg/m3. The gangue shape is similar to a cube, and the compensation factor similar to the cube is set as α, a value of 0.85. The calculation process for gangue mass is as follows.
The specifications and parameters of the crusher affect the granularity distribution of gangue, and the specific constraints are as follows:
where n is set to 1. v is set to 130. Г is the gamma function.
The arrival time interval of gangue along the X-axis follows an exponential distribution with mean μ-1. The calculation of xGk is as follows:
In a time window, according to the law of mass conservation, gangue content, coal flow, gangue quantity, and gangue mass have the following relationship.
The belt width is 800 mm, and the distance between the two sides is reserved to avoid the fall of large-grain gangue. The coordinates of gangue perpendicular to the direction of belt movement are uniformly distributed.
The sample set construction is discussed based on the decomposition method of TVRCF. It includes TVRCF of multi-scene and full-working condition, Decomposition method of TVRCF, CGSRS scheme set, Scene consistency, and RGB sample set of full-condition TVRCF.
TVRCF of multi-scene and full working condition. The scene affects the belt speed. The working condition affects the maximum coal flow of TVRCF. To this end, a working condition is taken as an example. The function of coal flow is unchanged. The function of gangue content undergoes eight phase shifts in one cycle. The sign of phase shift is δ, and this fixed value is set to π/4.
Decomposition method of TVRCF. Select a scene where the belt speed is 0.5m/s. Select a working condition where the maximum coal flow is 100kg/s. The 7th phase shift is chosen, and the value of δ is 7π/4. At this point, the TVRCF is determined. The decomposition method generates the gangue queue, which contains the arrival time interval, position coordinates and mass of 800 gangues. Each TVRCF cycles the decomposition method 200 times, such that the generated 200 gangue queues are a gangue queue data subset. Finally, the gangue queue data subset obtains the RGB image data subset.
CGSRS scheme set, scene consistency, and RGB sample set of full-condition TVRCF. The robot scheme contains the robot number and the rule combination. We did simple coding. When the robot number is 1, the rule combines three kinds, which are (1, 1), (1, 2), (1, 3). Taking (1, 2) as an example, 1 describes the robot number, and 2 describes the scheduling rule of the robot. Among them, 1 describes the FIFO. 2 describes the MF. 3 describes the SPT. The multi-task allocation strategy is modified once every CGSRS scheme is determined. Note that, according to the scene consistency, the belt speed of the CGSRS scheme must be the same as that of the TVRCF. The full-condition TVRCF describes the seven maximum coal flow. There are eight phase shifts for each maximum coal flow. There are 200 gangue queues for each phase shift. So, the multi-task allocation strategy needs to calculate 56 gangue queue data subsets or the evaluation indexes of 11,200 gangue queues. Finally, the label of each RGB image sample is generated according to these evaluation indexes.
TVRCF with multi-scene and full working condition. According to the gangue content and maximum value of coal flow, the specific calculation process of phase translation is as follows.
The decomposition method of TVRCF. The tGi, yGi, and mGi of gangue need to be normalized. It is equivalent to the dimensional data normalization in Fig. 1. The normalized values will map to the red, green, and blue channels. The range is all [0, 255]. The normalization process is as follows.
ValueGi describes the dimensional data of the i-th gangue, such as tGi, yGi, and mGi. ValueGmax describes the maximum value of dimensional data. ValueGmin describes the minimum value of dimensional data. ValueGinorm describes the normalized dimensional data of the i-th gangue.
To sum up, according to TVRCF of multi-scene and full working condition, the RGB image sample set that the CGSRS scheme set can construct is shown in the following table (Table 1).
CGSRS scheme evaluation method based on DenseNet
The information on the gangue queue’s time, space and feature dimensions is more complex. Shallow networks struggle to capture key features in the data. Although deep networks can learn richer features, improving complexity reduces generalization ability. Therefore, the DenseNet network, which is based on a dense block structure, is used in this paper. Among them, the direct connection between each layer and all previous layers is realized through the dense connection mechanism, and its output is passed to all subsequent layers, which improves the reuse and propagation of features27. SR, TCSR, and RUR are the network’s outputs. Figure 2 shows that the RGB sample set of the full-condition TVRCF in one scene contains 11,200 gangue queues. Firstly, the samples are randomly sorted, and then the training, test, and validation sets are constructed according to the ratio of 0.9:0.05:0.05. Figure 3 shows the network structure diagram of the CGSRS scheme evaluation model based on DenseNet.
It can be seen from Fig. 3. The DenseNet network in this paper consists of an initial convolution block, six densely connected Dense blocks, five transition blocks, a global average pooling layer (AdaptiveAvgPool2d), a flat layer (Flatten) and a fully connected layer (Linear).
The initial convolution block is used to extract the low-level features of the input image. It takes 3-channel input (e.g., an RGB image) and outputs 32-channel feature maps. This block contains a convolution layer with a convolution kernel size of 3 × 3, a batch normalization layer, a ReLU activation function, and a max pooling layer with a pooling kernel size 3 × 3.
Each Dense block internally contains 4 convolution layers, each of which sequentially performs batch normalization (BatchNorm2d), ReLU activation function, and 3 × 3 convolution operation (Conv2d), where the padding of the convolution operation is 1 to keep the feature map size constant. In each Dense block, the number of input channels is computed dynamically and is equal to the number of initial input channels plus the sum of the number of output channels of all previous Dense blocks. This design allows the output of each layer to be concatenated with the output of all previous layers in the channel dimension, enabling the accumulation and reuse of features.
The transition block is located between two Dense blocks and usually contains a 1 × 1 convolution layer and a 2 × 2 average pooling layer for progressively reducing the number of feature maps. The global average pooling layer reduces the spatial dimension of the feature maps to 1 × 1, resulting in one global description for each feature map. The flat layer flattens the 2D feature map into a 1D vector, which a fully connected layer maps to the final output space. The output layer uses a modified Sigmoid activation function for regression prediction.
Adam optimizer is used during training, and the learning rate is set at 0.0001. The model performance was evaluated by calculating the RMSE between SR, TCSR, and RUR. The mapping relationship between RUR and RGB image samples is the most complex among the sample labels. To ensure the accuracy of the overall prediction results, we weighted the sum of each loss in the proportion of 0.45, 0.45, and 0.10 to optimize the model parameters and minimize the prediction error.
Simulation experiments
The experiment employs the Windows 10 operating system, Pytorch2.1.0 + cu118 deep learning development framework, and Python as the programming language for development. The CPU utilized is Intel Core i5-10400F, while the GPU employed belongs to NVIDIA 4060.
Parameter settings
According to the CGSRS scheme, the multi-task allocation strategy simulation is carried out in MATLAB R2021b. The specifications of robots are the same. The end-effector is 400 mm from the belt plane to avoid collisions between the robot and the gangue. The other parameters are shown in Table 2.
CGSRS scheme evaluation model
Figure 2 shows that the RGB sample set of each scene contains 11,200 gangue queues. The RGB sample set is divided into training, test, and validation sets according to the proportion. In the CGSRS scheme evaluation model of each scene, 500 gangue queues of the validation set were randomly selected for the experiment.
RMSE of CGSRS scheme evaluation model
RMSE describes the stability of the CGSRS scheme evaluation model. The CGSRS scheme evaluation model experiment of multi-scene and the specific results are as follows.
Figure 4 shows the RMSE of the CGSRS scheme evaluation model in multi-scene. Figure 4a–c show the results of SR-RMSE, TCSR-RMSE, and RUR-RMSE, respectively. Among them, SR-RMSE and TCSR-RMSE are less than 0.15. RUR-RMSE is less than 0.1. It indicates that the CGSRS scheme evaluation model is stable. Figure 4d shows a scene where the belt speed is 0.1 m/s. The deviation value of the CGSRS scheme evaluation model and multi-task allocation strategy is distributed between − 0.4 and 0.6. and there is no skewness. It shows good symmetry. Among them, the deviation values of CGSRS schemes such as (2, 2), (3, 2), and (3, 5) are long-tailed. It shows that the rule combination affects the multi-task allocation results of the CGSRS scheme. It conforms to the objective law of multi-task allocation strategy. In summary, the CGSRS scheme evaluation model has good stability and symmetry.
Acc of CGSRS scheme evaluation model
The Acc describes the accuracy of the CGSRS scheme evaluation model. The CGSRS scheme evaluation model experiment of multi-scene and the specific results are as follows.
Figure 5 shows the Acc of the CGSRS scheme evaluation model in multi-scene. Figure 5a–c show the results of SR-Acc, TCSR-Acc, and RUR-Acc, respectively. Among them, SR-Acc and TCSR-Acc are greater than 0.85. RUR-Acc is greater than 0.9. This indicates that the CGSRS scheme evaluation model is accurate. The maximum Acc and minimum Acc of the CGSRS scheme (3, 2) are obvious. Table 1. Description of RGB image sample set based on CGSRS schemes with multi-scene and all-condition TVRCF shows that this is affected by the FIFO rule. Figure 5d shows the Acc of CGSRS scheme (3, 2) in multi-scene. It can be seen that SR-Acc and TCSR-Acc are smaller than RUR-Acc, but both are about 0.87. Such accuracy meets the application requirement of the CGSRS scheme evaluation model in multi-scene and full-condition TVRCF. Considering that the belt speed in Fig. 5d is 0.4 m/s and 0.5 m/s, the SR-Acc, TCSR-Acc, and RUR-Acc are poor. We show the predicted data of 500 gangues for SR, TCSR and RUR in Fig. 5e,f, respectively. It can be seen that the fitting curve of the predicted data is good, but the very few data are poor. It is because of the instability of FIFO.
Time complexity of CGSRS scheme evaluation model
The time complexity describes the computation time of the CGSRS scheme evaluation model and multi-task allocation strategy. The computation time is relatively stable. This paper selects 50 RGB samples for a single set of tests. For example, in a scene with a belt speed of 0.1 m/s, the CGSRS scheme (1, 1) has 11,200 RGB samples, and the mean value of 50 RGB samples is randomly sampled for the time complexity test. As shown in Fig. 6, it is the time complexity of the CGSRS scheme evaluation model and multi-task allocation strategy.
Obviously, in the multi-scene and full-condition TVRCF, the time complexity of the CGSRS scheme evaluation model is far less than that of the multi-task allocation strategy. Moreover, CGSRS schemes and multi-scene only slightly affect this time complexity.
Experiment and result analysis
Experiment 1: Reliability of solutions, RGB image data sampled from gangue queue
The CGSRS scheme evaluation model is applied to the multi-task allocation of gangue queues. The primary problem is inconsistent quantity in the gangue queue and the RGB image. Therefore, in the TVRCF of the multi-scene and full working condition, 800 gangues of the RGB image are randomly sampled from the gangue queue to experiment with the stability of the solution. We discuss the scene where the belt speed is 0.5 m/s. The maximum coal flow is 60 kg/s and 120 kg/s, respectively. As shown in Table 3 describes the gangue queue and RGB image data. Among them, the case of RGB image data is coded according to the scene number and the working condition number .
In this experiment, solution 2 of the multi-task allocation strategy calculates the gangue queue. Solution 1 of the CGSRS scheme evaluation model predicts RGB image data. Whether there is a difference between the two solutions depends on the stability of the solution. According to Table 1, the experimental results of all CGSRS schemes are as follows.
Figure 7 illustrates the solution of the multi-task allocation strategy and the CGSRS scheme evaluation model. Figure 7a shows the working condition of low coal flow, and solution 1 is the multi-task allocation strategy to calculate 1096 gangues. Solution 2 is the CGSRS scheme evaluation model predicting RGB image data. The larger difference is the CGSRS scheme (3, 2). According to Table 1, FIFO greatly influences the prediction results. Figure 7b shows the working condition of low coal flow, and solution 1 is the multi-task allocation strategy to calculate 1836 gangues. Solution 2 is the CGSRS scheme evaluation model predicting RGB image data. The larger difference is the CGSRS scheme (3, 5). According to Table 1, the rule combination of FIFO and MF greatly influences the prediction results.
In summary, with the RGB image data randomly sampled from the gangue queue, the solutions predicted by the CGSRS scheme evaluation model are in line with the solutions calculated by the multi-task allocation strategy. Combined with Fig. 2, we can see that the data of the RGB sample set comes from the TVRCF of one cycle, which contains all the information of the gangue queue. Therefore, the stability of the solution is not affected by random sampling.
Experiment 2: Methods comparison, Time complexity and solution
The RMSE, Acc and time complexity of the CGSRS scheme evaluation model have been discussed in “CGSRS scheme evaluation model”, and the results are good. Experiment 1 shows that the RGB image data are inconsistent with gangue quantity in the gangue queue, which does not affect the solution of the CGSRS scheme evaluation model. On this basis, the online multi-task allocation problem of CGSRS is discussed. It is assumed that the belt speed of the scene is 0.6 m/s, and the working condition is unknown. Ten gangue queues were randomly generated. The CGSRS scheme evaluation model and the multi-task allocation strategy were solved, respectively, and the two methods’ time complexity and solution results were compared.
Figure 8 is the flowchart of the method. The multi-task allocation strategy for CGSRS has been proposed in recent years2. The process is to traverse the robot number and rule combination to calculate the optimal solution of the gangue queue, resulting from the multi-task allocation problem. The CGSRS scheme evaluation model in this paper cannot directly calculate the multi-task allocation results of the gangue queue. The specific process is that RGB images are obtained by randomly sampling the gangue queue. The predicted near-optimal solution is obtained by traversing all CGSRS scheme evaluation models. According to the CGSRS scheme of the near-optimal solution, the multi-task allocation strategy is adjusted according to the robot number and the rule combination, and the multi-task allocation results of the gangue queue are calculated. Different scenes have different preferences for solutions. In this experiment, the preference setting is SR.
According to Table 4, the solutions of example 1, example 8, example 9 and example 10 are the same, indicating that the CGSRS scheme evaluation method can predict the optimal robot number and rule combination. The solution is different from the other examples. The average difference in SR among the 10 examples is 0.0139. Only in case 7 is the SR of the multi-task allocation strategy 0.0839 higher than that of the CGSRS scheme evaluation model. Figure 9a,b show the multi-task allocation results of the multi-task allocation strategy and the CGSRS scheme evaluation method, respectively. The comparison of solutions shows that the CGSRS scheme evaluation method is stable. In 10 examples, the time complexity of the multi-task allocation strategy is 19 times that of the CGSRS scheme evaluation method. All in all, the solution of the CGSRS scheme evaluation method does not get worse, but the time complexity is significantly reduced.
Conclusion
This paper proposes a Scheme evaluation method for CGSRS with time-varying multi-scenario based on deep learning. It aims to reduce the time complexity of solving multi-task allocation without changing the accuracy of the solution.
-
(1)
The basis of the CGSRS scheme evaluation method is the stability, accuracy and time complexity of the CGSRS scheme evaluation model. Regarding stability and accuracy, SR-RMSE and TCSR-RMSE are lower than 0.15, and RUR-RMSE is lower than 0.1. SR-Acc and TCSR-Acc are greater than 0.87, and RUR-ACC is greater than 0.9. This indicates that the solution predicted by the CGSRS scheme evaluation model is close to the solution calculated by the multi-task allocation strategy. In addition, the time complexity of the CGSRS scheme evaluation model is much less than that of the multi-task allocation strategy, and it is very stable. This paper applies the DenseNet network to evaluate the CGSRS scheme, and the experimental results show it is feasible. It provides more possibilities for the research of multi-task allocation problems.
-
(2)
The results of Experiment 1 show that the data inconsistency has no impact on the CGSRS scheme evaluation model, which is crucial for the method application. The results of Experiment 2 show that in the online multi-task allocation problem of CGSRS, the solution results of the proposed method are close to the multi-task allocation strategy. The time complexity is 19 times less than that of the multi-task allocation strategy. It allows CGSRS to be applied to scenes with higher belt speeds. In addition, the CGSRS scheme evaluation method is the first attempt to apply deep learning to the CGSRS multi-task allocation problem. It provides a methodological underpinning to the field and is an important contribution to this paper.
-
(3)
The CGSRS scheme evaluation method is applied to the TVRCF of multi-scene and full working conditions, and the time complexity is greatly reduced. It should be noted that the regression prediction of the DenseNet network is for a limited number of scenes and working conditions. It is suitable for regular coal preparation scenes, and its application in other fields still needs further research. In addition, the effect of rule combination on the CGSRS scheme evaluation method is objective, which needs follow-up research to improve. Finally, this paper proposes the CGSRS scheme evaluation method, but a better application requires in-depth research.
Data availability
The datasets used and analyzed during the current study available from the corresponding author on reasonable request. Correspondence and requests for materials should be addressed to X. C.
References
Zhang, L., Sui, Y., Wang, H., Hao, S. & Zhang, N. Image feature extraction and recognition model construction of coal and gangue based on image processing technology. Sci. Rep. 12(1), 20983. https://doi.org/10.1038/s41598-022-25496-5 (2022).
Wu, X., Cao, X., Wang, P., Ma, H. & Zhang, Y. Multi-task allocation framework of coal gangue sorting robot system for the time-varying raw coal flow. Int. J. Coal Prep. Util. 44(6), 715–739. https://doi.org/10.1080/19392699.2023.2217657 (2023).
Cao, X. Fei, J. Wang, P. Li, N. & Su, L. (2019) Study on coal-gangue sorting method based on multi-manipulator collaboration. Coal Sci. Technol. 47(4), 7–12. https://doi.org/10.13199/j.cnki.cst.2019.04.002.
Li, M., Duan, Y., He, X. & Yang, M. Image positioning and identification method and system for coal and gangue sorting robot. Int. J. Coal Prep. Util. 42(6), 1759–1777. https://doi.org/10.1080/19392699.2020.1760855 (2022).
Tzampazaki, M., Zografos, C., Vrochidou, E. & Papakostas, G. A. (2024) Machine vision—Moving from Industry 4.0 to Industry 5.0. Appl. Sci. 14(4), 1471. https://doi.org/10.3390/app14041471.
Xiang, J. Wang, L. Li, L. Lai, K. H. & Cai, W. Classification-design-optimization integrated picking robots: A review. J. Intelligent Manufact. https://doi.org/10.1007/s10845-023-02201-5 (2023).
Zhou, J. Guo, Y. Wang, S. Chen, W. & Cheng, G. Identification of coal and gangue based on R value method for dual-energy X-ray of Geant4 simulation. Environ. Res. https://doi.org/10.1016/j.envres.2023.115650 (2023).
Zhang, J. Wang, J. He, G. Cheng, D. Han, X. & Fan, T. An experimental study on the improvement of and gangue identification efficiency by liquid intervention. J. China Coal Society. 46(S2), 681–691. https://doi.org/10.13225/j.cnki.jccs.yt20.1949 (2021).
Yan, P. Sun, Q. Yin, N. Hua, L. Shang, S. & Zhang, C. Detection of coal and gangue based on improved YOLOv5.1 which embedded scSE module. Measurement. https://doi.org/10.1016/j.measurement.2021.110530 (2021).
Huang, H., Dou, D. & Zhang, C. Detecting coal-carrying rate in gangue based on binocular machine vision and particle queuing method. Int. J. Coal Prep. Util. 43(7), 1209–1221. https://doi.org/10.1080/19392699.2022.2104265 (2022).
Pan, H., Shi, Y., Lei, X., Wang, Z. & Xin, F. Fast identification model for coal and gangue based on the improved tiny YOLO v3. J. Real-Time Image Proc. 19(3), 687–701. https://doi.org/10.1007/s11554-022-01215-1 (2022).
Xue, G. Li, S. Hou, P. Gao, S. & Tan, R. Research on lightweight Yolo coal gangue detection algorithm based on resnet18 backbone feature network. Internet Things. https://doi.org/10.1016/j.iot.2023.100762 (2023).
Zhang, Y. et al. Research on dynamic target steady and quasi-grasping method of coal gangue sorting robot based on global planning and local vision. Int. J. Coal Prep. Util. https://doi.org/10.1080/19392699.2024.2363867 (2024).
Wang, P. Ma, H. Zhang, Y. Cao, X. Wu, X. Wei, X. & Zhou, W. Tarjectory planning for coal gangue sorting robot tracking fast-mass target under multiple constraints. Sensors. https://doi.org/10.3390/s23094412 (2023).
Cao, X. Wu, X. Wang, P. Li, Y. Liu, S. Zhang, G. & Xia, H. Collaborative strategy of multi-manipulator for coal-gangue sorting robot. J. China Coal Society. 44(S2), 763–774. https://doi.org/10.13225/j.cnki.jccs.2019.0734 (2019).
Ma, H. Wei, X. Wang, P. Zhang, Y. Cao, X. & Zhou, W. Multi-arm global cooperative coal gangue sorting method based on improved Hungarian algorithm. Sensors. https://doi.org/10.3390/s22207987 (2022).
Wang, P. et al. A cooperative strategy of multi-arm coal gangue sorting robot based on immune dynamic workspace. Int. J. Coal Prep. Util. 43(5), 794–814. https://doi.org/10.1080/19392699.2022.2078808 (2022).
Lu, T, Han, B, Chen, L, Yu, F. & Xue, C. A generic intelligent tomato classification system for practical applications using DenseNet-201 with transfer learning. Sci. Rep. 11(1), 15824, https://doi.org/10.1038/s41598-021-95218-w (2021).
Wang, Z. Li, X. Yao, M. Li, J. Jiang, Q. & Yan, B. (2022). A new detection model of microaneurysms based on improved FC-DenseNet. Sci. Rep. 12(1), 950. https://doi.org/10.1038/s41598-021-04750-2 (2022).
Bhadra, S. et al. End-to-end 3D CNN for plot-scale soybean yield prediction using multitemporal UAV-based RGB images. Precision Agric. 25(2), 1014–1037. https://doi.org/10.1007/s11119-023-10096-8 (2023).
Cao, B. Zhang, B. Zhang, W. Zhou, J. Lin, Y. & Chen, Y. Real-time, highly accurate robotic grasp detection utilizing transfer learning for robots manipulating fragile fruits with widely variable sizes and shapes. Computers Electron. Agric. https://doi.org/10.1016/j.compag.2022.107254 (2022).
Nilakantan, J. Ponnambalam. S. Jawahar. N. & Kanagaraj, G. Bio-inspired search algorithms to solve robotic assembly line balancing problems. Neural Comput. Appl. 26(6), 1379–1393. https://doi.org/10.1007/s00521-014-1811-x (2015).
Huang, Y., Chiba, R., Arai, T., Ueyama, T. & Ota, J. Robust multi-robot coordination in pick-and place tasks based on part-dispatching rules. Robot. Auton. Syst. 64, 70–83. https://doi.org/10.1016/j.robot.2014.10.018 (2015).
Liang, M. Hu, C. Yu, R. Wang, L. Zhao, B. & Xu, Z. Optimization of the process parameters of fully mechanized top-coal caving in thick-seam coal using BP Neural Networks. Sustainability. https://doi.org/10.3390/su14031340 (2022).
Zheng, K., Du, C. & Yang, D. Orthogonal test and support vector machine applied to influence factors of coal and gangue separation. Int. J. Coal Prep. Util. 34(2), 75–84. https://doi.org/10.1080/19392699.2013.847831 (2014).
Kim, H. & Lee. J. Scheduling of dual-gripper robotic cells with reinforcement learning. IEEE Trans. Autom. Sci. Eng. 19(2), 1120–1136. https://doi.org/10.1109/TASE.2020.3047924 (2022).
Huang, G. Liu, Z. Maaten, L. V. D. & Weinberger, K. Q. Densely connected convolutional networks. in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPP), 4700–4708, https://doi.org/10.1109/CVPR.2017.243 (2017).
Acknowledgments
The authors gratefully thank anonymous reviewers for their valuable comments to improve the paper quality. X. G. C. discloses support for the research of this work from Funder [grant number 2018GY-160], Funder [grant number 2021JLM-04] and Funder [grant number 2024GX-YBXM-524]. P. W. discloses support for publication of this work from Funder [grant number 2023-JC-YB-362]. Y. Z. discloses support for publication of this work from Funder [grant number 23JK0548].
Funding
This study was funded by Key R & D Program of Shaanxi Province (2018GY-160), Joint fund project—Enterprise—Shaanxi Coal joint fund project (2021JLM-04), Xiangnan Key Laboratory of Rare and Precious Metal Complexes and Their Applications (2024GX-YBXM-524) and Natural science research project of Education Department of Shaanxi Province (23JK0548).
Author information
Authors and Affiliations
Contributions
X. D. W. conceived the study; X. G. C., X. D. W., and W. T. D. were the principal investigators; X. Y., and Y. Z. directed the overall study design; X. D. W., P. W., W. T. D., and Y. Z. performed the experiments; X. G. C., X. D. W., and W. T. D. analysed the data; X. D. W. wrote the manuscript. All authors discussed and interpreted the results.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Wu, X., Cao, X., Ding, W. et al. Scheme evaluation method of coal gangue sorting robot system with time-varying multi-scenario based on deep learning. Sci Rep 14, 28063 (2024). https://doi.org/10.1038/s41598-024-78635-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-78635-5
Keywords
This article is cited by
-
Multitask allocation model of the coal foreign object sorting robot based on optimal capacity and benefit
Scientific Reports (2025)
-
A novel deep learning model based on YOLOv5 optimal method for coal gangue image recognition
Scientific Reports (2025)