Teleoperation system for multiple robots with intuitive hand recognition interface

Zick, Lucas Alexandre; Martinelli, Dieisson; Schneider de Oliveira, André; Cremer Kalempa, Vivian

doi:10.1038/s41598-024-80898-x

Download PDF

Article
Open access
Published: 04 December 2024

Teleoperation system for multiple robots with intuitive hand recognition interface

Lucas Alexandre Zick²,
Dieisson Martinelli^1,2^na1,
André Schneider de Oliveira²^na1 &
…
Vivian Cremer Kalempa^1,2^na1

Scientific Reports volume 14, Article number: 30230 (2024) Cite this article

4353 Accesses
5 Citations
Metrics details

Subjects

Abstract

Robotic teleoperation is essential for hazardous environments where human safety is at risk. However, efficient and intuitive human–machine interaction for multi-robot systems remains challenging. This article aims to demonstrate a robotic teleoperation system, denominated AutoNav, centered around autonomous navigation and gesture commands interpreted through computer vision. The central focus is on recognizing the palm of the hand as a control interface to facilitate human–machine interaction in the context of multi-robots. The MediaPipe framework was integrated to implement gesture recognition from a USB camera. The system was developed using the Robot Operating System, employing a simulated environment that includes the Gazebo and RViz applications with multiple TurtleBot 3 robots. The main results show a reduction of approximately 50% in the execution time, coupled with an increase in free time during teleoperation, reaching up to 94% of the total execution time. Furthermore, there is a decrease in collisions. These results demonstrate the effectiveness and practicality of the robotic control algorithm, showcasing its promise in managing teleoperations across multi-robots. This study fills a knowledge gap by developing a hand gesture-based control interface for more efficient and safer multi-robot teleoperation. These findings enhance human–machine interaction in complex robotic operations. A video showing the system working is available at https://youtu.be/94S4nJ3IwUw.

Sensory manipulation as a countermeasure to robot teleoperation delays: system and evidence

Article Open access 21 February 2024

GestureMoRo: an algorithm for autonomous mobile robot teleoperation based on gesture recognition

Article Open access 14 March 2024

Grasping learning, optimization, and knowledge transfer in the robotics field

Article Open access 16 March 2022

Introduction

Robots are mechanical devices designed to perform repetitive tasks, reducing the need for human intervention¹. With the advancement of technology, an increasing number of robotic equipment has been developed to operate in various scenarios ranging from industrial to residential spaces². For instance, industrial robots are utilized in production lines, and repurposed as autonomous home assistants. Moreover, the range of environments and specific demands for each application has expanded. A recent example is the exploration of Mars using various planetary rovers, such as Curiosity, Opportunity, and Perseverance³. These mobile robots can navigate within an environment to perform desired tasks⁴.

Although robots can replace humans in many tasks, there are cases where computational capabilities are not yet sufficient or feasible for full autonomy². Semi-autonomous and teleoperated robots have become increasingly prevalent in situations where human intervention is not always possible or safe. Semi-autonomous robots involve some level of decision-making by the machine, but human involvement is necessary in some parts of the process to exert direct control^5,6. In contrast, teleoperated robots require direct intervention and are remotely controlled by a human operator using Wi-Fi, Bluetooth, or more complex connections like the internet^1,7.

Teleoperated robotic models can operate in complex, minimally known, and even high-risk environments because they do not rely on complete mapping of the surroundings, given that an operator commands the decisions^8,9. Because commands can be triggered via Internet connections, robots can be operated from a well-prepared work environment, making them convenient for dangerous and inaccessible tasks^1,4, such as underwater and intra-volcanic exploration or bomb disposal^10,11.

Establishing an operating tool is necessary to implement teleoperated robots⁷. The currently used operating tools include joysticks or other physical devices that require the operator to carry them¹. In addition, these devices are expensive and specific to a single type of robot, making them difficult to maintain and replace¹². In recent applications, gesture recognition using cameras and video sensors has been employed to address the complexity of equipment. However some limitations still exist in the functionality of these tools. For instance, some of them need constant interaction, or can control only one model at a time, or even cannot recognize the orientation of the model¹.

This work proposes a novel teleoperation method for multi-robot systems that utilizes gesture recognition, enabling angle control and the simultaneous operation of multiple robots. This approach leverages the MediaPipe framework for intuitive control over robot navigation and orientation. Additionally, the implemented autonomous navigation allows the operator to issue high-level commands, while the system autonomously plans and executes optimal paths, thereby enhancing multitasking in dynamic and hazardous environments. This method improves operational flexibility and safety in industrial automation, exploration, and emergency response scenarios. A noteworthy feature is its scalability, allowing the simultaneous control of multiple robots, which can enhance efficiency in large-scale operations.

The robot used for the development of this study was TurtleBot 3¹³, which was chosen for its navigation capabilities. The gesture recognition relies on the MediaPipe computer vision framework¹⁴. This framework extracts data about points on the human body from video inputs. Development and testing were conducted in a simulated environment using the Gazebo¹⁵ and RViz¹⁶ applications.

The remainder of this paper is structured as follows: the next section discusses the “Related work” section. In the section titled “The AutoNav” section, the development of the algorithm is presented in detail. “Experiments and validation” section introduces the experiments and validations conducted, along with the obtained results. Finally, “Conclusion” section presents concluding remarks and discusses potential future work.

Related work

Teleoperation systems for mobile robots have evolved significantly, providing valuable insights into the field. Galarza et al. (2023) discuss a virtual reality teleoperation system that enhances robot manipulation; however, it requires specialized equipment and demands constant commands from the operator, which can be a limitation in accessibility and user fatigue¹⁷. Martinelli et al. (2020) focus on a human–robot interface for remote control, demonstrating effective motion recognition through deep learning techniques. Nonetheless, this approach operates a single robot at a time and requires constant operator involvement, which can impact efficiency in scenarios involving multiple robots¹. Shamshiri et al. (2024) introduce a teleoperation method that utilizes a digital shadow for path creation, which ensures a structured approach to robotic control but necessitates prior route planning, limiting adaptability in dynamic environments¹⁸.

Chen et al. (2024) propose GestureMoRo, an algorithm for autonomous mobile robot teleoperation based on gesture recognition. This system, while limited to controlling a single robot and requiring constant operator input, stands out for being tested on a real-world device¹⁹. Similarly, Pantusin et al. (2024) introduce a virtual teleoperation system focused on mobile manipulator robots for object transport and manipulation, which also necessitates specific equipment in the form of a haptic device²⁰. Zaman and Wu (2023) explore hand gesture-based control of a Mecanum-wheeled mobile robot, but like other approaches, it controls only one robot at a time and demands a high level of operator interaction, which can reduce efficiency in prolonged operations²¹. These systems highlight the trade-offs between specialized hardware and operator involvement, emphasizing that while they achieve high precision and control, they may hinder scalability and accessibility in more generalized environments.

The proposed apprach aims to address these limitations by introducing a teleoperation method that utilizes gesture recognition for more intuitive control. By minimizing reliance on specific equipment and enabling the simultaneous operation of multiple robots, this approach seeks to enhance efficiency and adaptability in various operational contexts.

The AutoNav

This section presents the proposed AutoNav, a multi-robot teleoperation algorithm based on autonomous navigation and gesture commands captured through a USB camera. The robots interact through wireless connections directed by the Robot Operating System (ROS) through topics. The interface was implemented in the RViz application, a 3D tool for displaying robot models, sensor data, and spatial transformations in three-dimensional environments¹⁶. Two independent markers signal the position and state of the operator’s hand and the desired positions of the robots. Data regarding the operator’s hand position were extracted using MediaPipe, which received video input from a USB camera connected to a computer. Based on this input, users can send new destinations to robots, specifying the final position and angle, with autonomous navigation executed by ROS Navigation Stack²². Figure 1 summarizes components and data flow in the proposed strategy.

Figure 2 illustrates the interactions of the system, with each agent represented by specific lanes. Information flows unidirectionally between components, showing a clear sequence of operations. This cyclical model allows continuous information updates. The algorithm receives data from the operator’s hands via MediaPipe, fed by the camera. The algorithm then communicates with the robots and RViz using ROS, enabling the operator to visualize the execution time information provided by the system and initiate a new cycle after each interaction.

The development of AutoNav prioritized ethical considerations, with a focus on ensuring the safety of human operators and bystanders throughout the teleoperation process.

Recognition and capture of hand points

Hand gesture recognition is fundamental in creating more intuitive and natural human–machine interactions²³. Integrating gesture recognition methods with other techniques significantly enhances user experience, making it more dynamic and seamless. In this context, this research aims to operate the robot based on hand gesture recognition, eliminating the need for specific equipment and relying solely on a USB camera as the data input source.

The performance of hand pose recognition plays a crucial role in the overall functionality of the system. Accurate detection ensures that the robot can respond to user commands with precision¹. However, inaccuracies or delays in recognizing hand gestures can lead to misinterpretations, causing unintended robot movements. Therefore, the system’s stability and reliability heavily depend on a robust and consistent recognition framework that minimizes errors and maintains smooth interaction throughout the process.

MediaPipe is an open-source library developed by Google and designed to perform precise and efficient hand tracking¹⁴, as illustrated in Fig. 3. Hand-position reading using MediaPipe is based on deep learning and computer vision techniques. The library implements a trained neural network to detect hands in an image or video frame. If detected, MediaPipe defines 21 key points for each hand, referred to as landmarks, representing various parts of the hand anatomy, such as the fingertips, joints, and palms^1,24. These landmarks are then tracked during execution, allowing the system to accurately follow the position and orientation of the moving hand relative to the camera.

In the proposed method, the state of the hand, whether open or closed, is determined by evaluating the average distance of the operator’s five fingers. The ‘closed_hand_threshold’ variable in the system helps gauge the user’s distance from the camera to distinguish between open and closed hand gestures. When the average distance exceeds the threshold, the hand is classified as open; otherwise, it is classified as closed. This approach allows the operator to capture moments of hand state changes and provides visual feedback at the interface.

A grid method was employed to accurately determine the hand’s location relative to the map by dividing the camera-captured image into a mesh of points. This method allows images of different sizes to be translated to any map size²⁵. The algorithm identifies the hand’s position in the camera image, maps it to the corresponding coordinates on the map, and transmits this data for runtime visualization in the RViz interface.

Markers guide the user visually and work as follows. The hand-shaped marker serves as a cursor. It is positioned on a map, proportional to the operator’s hand position in the camera video. Although the markers for the robot positions are initially set in the algorithm by receiving the current positions of the robots, they can be moved to other points by the user using the hand states. An amplified representation of these markers is shown in Fig. 4.

When teleoperated, the markers for robot positions function as goal indicators. Once ‘released’ (when the hand carrying a marker returns to the ‘open hand’ state), the algorithm takes the marker point as the desired position and begins autonomous navigation until the desired position is reached or a new position is specified.

The following technique enhanced the accuracy and consistency of picking up and releasing the markers. When the operator’s hand is open, the action is exclusively directed toward navigation, and the robot marker remains static. Similarly, when the hand is in a closed state, the priority is to maintain the hand as a cursor. Therefore, to simulate the action of picking an object, the robot marker starts following the hand only when it transitions from the open to the closed state. Similarly, the marker is released when the hand transitions from closed to open. This approach avoids inconsistencies in operation, such as the closed hand overlaying multiple markers and triggering the pickup action improperly or interrupting the action owing to instability in reading hand points. This strategy provides more consistent and accurate operation, ensuring process integrity.

Virtual experimentation

The TurtleBot 3 Burger was selected for this project due to its reputation as a classic and accessible mobile robot. Its affordability and compatibility with current simulators, such as Gazebo and RViz, make it a practical choice for the development²⁶. The TurtleBot 3, manufactured by Robotis, features a cylindrical shape, weighs around 1 kg, and is equipped with components such as the LDS-01 laser distance sensor, a 3-axis IMU, and a Raspberry Pi as its main SBC¹³. Figure 5 shows the model in both simulated and real environments.

Gazebo and RViz were chosen as the simulation tools for their strong integration with ROS and their open-source availability. Gazebo simplifies TurtleBot 3 integration²⁶, while RViz efficiently handles sensor data visualization. Launchers were created to ensure consistent and flexible initialization of both the environment and algorithm. A detailed practical explanation of the virtual experimentation is available on AutoNav’s GitHub repository (address in code availability statement section). The setup allowed efficient simulation of maps and testing through the ROS topic-based communication system, ensuring seamless data transmission between the components²⁷.

Navigation strategy for multi-robots

Navigation strategy plays an essential role in the scope of this study, efficiently replacing human decisions to achieve autonomous navigation of multi-robots²⁸. This development employs the ROS Navigation Stack, a modular tool integrated into the ROS as the primary solution for attaining autonomous navigation of a fleet of robots.

Effective route planning within the ROS Navigation Stack involves intricate interactions between local and global planners^29,30. Each local planner uses techniques such as the dynamic window approach algorithm to make short-term decisions based on the dynamically changing local environment and capabilities of each robot³¹. Concurrently, the global planner employs algorithms such as A*³² or Dijkstra³³ to focus on long-term strategies and planning trajectories to connect starting points to final destinations for multi-robots, disregarding future environmental changes^34,35. This hybrid approach, blending short- and long-term strategies, enables the ROS Navigation Stack to generate efficient routes, adapt to unforeseen circumstances, and maximize the effectiveness of autonomous navigation for a fleet of robots. Examples of generated routes are shown in Fig. 6.

The cost map, responsible for creating a representation of the collective environment surrounding the fleet of robots, distinguishes accessible areas from obstacles²⁷. The dynamic cost map adapts to the evolving surroundings and is employed as a shared memory for all robots, facilitating collaborative route planning³⁶.

Integrating hand gestures and multi-robots teleoperation

Initially, computer vision using MediaPipe takes center stage in the application by identifying and classifying hand key points. Detecting these key points forms the foundation of teleoperation, as gesture recognition makes the interface between the user and robotic models fluid and intuitive²⁵. The algorithm collects, analyzes, and transforms these points into coordinates to represent the markers in RViz.

Markers are created on the RViz interface to generate a dynamic graphical representation of multi-robot cursors. These markers display the hand’s reading position and state as well as the current and desired positions of each robot. In addition, they dictate situations where interaction between the hand and robots can occur. The objective model derived from a marker provides the operator free time while allowing destinations to be redefined at any moment for each robot. Furthermore, they enable the integrated visualization of robots and their interactions. Although each robot is tracked by Gazebo and the hand by a MediaPipe video, RViz serves as the system’s primary interface with unified information. The interface of the operator used for multi-robots is shown in Fig. 7.

Considering the complexity and cost, multiple TurtleBot 3 robots were developed and tested in a simulated environment on the Gazebo platform. This platform provides robots with physical conditions similar to a real environment and allows for the visualization of detailed models.

Experiments and evaluation

This section describes the experiments and validation conducted in this study. The validations focused on testing the operator’s operation time and verifying user preferences. In addition, data on the number of collisions during the trajectories were collected.

The developed system was compared with a conventional teleoperation model, where the operator controls the robot with a joystick, to provide a comprehensive and unbiased analysis of different robotic teleoperation contexts. The proposed and conventional models were named AutoNav and ManualNav, respectively. Various participants representing a range of profiles were included in the tests without specific selection. The operators received brief instructions on the methods used.

Two limitations should be acknowledged. Firstly, the study did not assess the operators’ prior knowledge in the studied contexts. Secondly, only a single map was utilized, potentially limiting the generalizability of findings to different environments.

Operation experiments

The first test evaluated differences in the operation time required for each tested method. For each operation, the algorithm recorded the total execution and the operation time, which comprised the entire duration during which the user interacted with the system. From these data, the free time for each operation was calculated, which, in real-world scenarios, can be utilized productively for other tasks. A deliberate sample of 11 operators was selected as the study population to ensure validity. Although the choice was not random, specific criteria regarding age and sex were not predefined during the selection process, ensuring a diverse sample.

Two sets of tests were conducted for each operator, one for each teleoperation method, with three robots and three objectives. This allowed the investigation of the particularities of each method and the assessment of the impact of environmental differences, considering that the routes had unique characteristics.

Table 1 Results regarding the operation time tests.

Full size table

The total execution time was extracted from the difference between the timestamp at which the operator signals the completion of the execution and at which the algorithm starts, thus encompassing a small time margin before and after execution¹. The operation time was obtained by summing the set of intervals during which the operator interacted with the system. The free time for each execution is calculated by subtracting the operation time from the total execution time. Measured values for all operations are listed in Table 1.

Based on the tests, a noticeable difference in the operators’ free time was observed. While operations in ManualNav practically consumed the entire execution time, operations with AutoNav remained stable at around 23 seconds, reaching up to 94% of free time for experienced users. It is important to consider that free time is calculated based on the user’s interaction with the command model, implying the possibility of a greater difference in real demands due to the constant command provided when using ManualNav, as illustrated in Fig. 8.

Moreover, the findings reveal approximately 50% reduction in the total execution time of AutoNav compared to ManualNav. This improved outcome is expected because the operator loads the robot markers to the desired destinations during a pre-navigation time interval, eliminating the need for decision-making during operation. Additionally, efficiency gains stem from the simultaneous execution of multiple operations, which cannot be achieved through conventional remote-control methods. Moderate driving maneuvers performed by the robots contributed to this time reduction because they followed a predefined path and maintained a safe distance from the walls during turns.

Collision experiments

For the collision quantity test, the system counted the number of collisions during execution. The collision indication is obtained from RViz, which logs the sensor data and provides the distance to the nearest wall. This test was run concurrently with the operation time test for realistic and practical data. A graph depicting the number of collisions during operation is shown in Fig. 9.

A noticeable difference in the number of collisions is observed when using the two methods. With ManualNav, 21 collisions were recorded in 11 operations. In contrast, no collisions occurred when using AutoNav. This outcome was predictable because of the collision-prevention methods provided by the ROS Navigation Stack library.

Usage preference

This study verified user preference through a questionnaire after the completion of operations. Each operator completed only the data necessary for the experiments and indicated which method best met their expectations in terms of four categories: comfort, practicality, precision, and efficiency.

In addition to direct and generalized responses, operators were allowed to include an analysis of the main differences perceived during usage. This approach focuses on the intrinsic details that are not perceptible through visual analysis, allowing the operator to describe their perception after use. The survey results are shown in Fig. 10.

The results reveal that AutoNav was satisfactory in all aspects compared to ManualNav, as illustrated in the radar chart. In this chart, each axis represents a specific criterion, and the closer a data point is to the outer edge, the better the performance in that category. Notably, 100% of the operators prefer the proposed algorithm in terms of practicality and efficiency. Moreover, 90% of participants opted for AutoNav when addressing comfort during handling. Regarding precision, the number drops to approximately 63% because, according to evaluations from some operators, the ability to define the path on its own makes the conventional method more precise.

Among the submitted textual evaluations, considerations related to both methods are listed, with the advantage of the proposed algorithm over the conventional one mentioned repeatedly. The operators appreciated the capability of the proposed algorithm to allow them to detach from the equipment and perform other tasks after being sent to the desired destination. However, a major disadvantage is the lack of constant control, indicating the need, in some cases, to relocate the marker to make slight adjustments to the sent path. Furthermore, an operator affected by repetitive strain injuries (RSI) preferred the proposed algorithm, stating difficulties associated with prolonged manual management due to the RSI condition.

For further insights, the evaluations can be accessed in both English and their original language in the data repository used for this study.

Overall evaluation

The evaluation of the proposed approach compares the developed system, AutoNav, with a conventional teleoperation model referred to as ManualNav, in which the operator controls the robot using a joystick. This comparison aims to provide a comprehensive and unbiased analysis across various robotic teleoperation contexts, considering the intuitive aspects of multi-robot teleoperation.

The analysis is conducted from several perspectives. The operational experiments evaluate the differences in task execution time and available free time. In collision experiments, we analyze the performance of untrained operators regarding the number of collisions during task execution. Finally, we examine usage preferences by having operators share their experiences with both approaches.

These comprehensive analyses allow us to compare the proposed AutoNav approach with the standard Manual approach, as illustrated in Table 2. The AutoNav system shows significant improvements across all metrics compared to the standard manual method.

Table 2 Overall comparison of AutoNav and ManualNav.

Full size table

Conclusion

This work contributes to the research on mobile robotics with the application of autonomous navigation, focusing on the human–robot interface used for multi-robot teleoperation. An architecture that integrates autonomous navigation with point capture was selected for practical and affordable teleoperation.

Currently, acquiring specific equipment for robotic control is optional. Similar results can be achieved at a lower cost using a USB camera for image recognition.

Tests and validations were conducted to prove the effectiveness of the proposed algorithm. The proposed model reduces the total execution time by 50% and increases the operator’s free time during execution compared to conventional teleoperation models. This increase can reach more than 94% of the total time for long routes, providing convenience and comfort to the operator. Moreover, collision and time tests demonstrated the efficiency of applying autonomous navigation in the algorithm, with a 100% reduction in collisions compared to the conventional model.

The final validation was conducted using the operators in the previous procedures, where each received a form indicating their preferences for each of the four mentioned aspects. In addition, operators could provide a brief analysis of their experiences with each model along with the forms. The research results reveal a 100% preference among operators for the proposed algorithm in terms of efficiency and practicality and a 90% and 63% preference for comfort and precision, respectively.

The analyses identified reduced effort required by the proposed algorithm to perform tasks and precision limitations owing to gesture-based control. Furthermore, the application was efficient for operators with RSI because of its quick and efficient method of selecting the destination.

Consequently, the benefits of concurrently implementing computer vision and autonomous navigation for a more intuitive operator experience are evident. It ensures the safe completion of routes and offers objective advantages in the logistics process in outdoor environments, along with considerable cost reduction, avoiding unnecessary acquisition of specific equipment for robot operation. Additionally, there are safety benefits for the operator, who can perform operations in a prepared environment, free from external risks. Moreover, reallocating the operator time allows them to operate multi-robots simultaneously or perform other tasks while the robot completes its trajectory. Finally, reducing the collision rate per operation is beneficial in terms of the frequency of maintenance and replacement of the robotic equipment.

Future work

In the future, it will be noteworthy to transition this algorithm from a simulated to a real-world scenario by implementing it in physical robots. Practical validation can provide unique insights into the effectiveness and adaptability of the algorithm under real-world conditions. Moreover, an additional innovation would involve transforming the control vector into a manipulable object, allowing operators to control the physical objects carried by the robots to set new destinations. This approach decouples the robot teleoperation action and focuses on object control, relevant to the practical demands of mobile robotics in real-world situations.

Data availibility

The datasets generated and analyzed during the current study are available in the GitHub repository, https://github.com/LucasZick/auto_nav/blob/Main/results.

Code availability

The code for the main application developed and discussed in this study is available in the GitHub repository, https://github.com/LucasZick/auto_nav/tree/Main.

References

Martinelli, D., Cerbaro, J., Fabro, J. A., de Oliveira, A. S. & Teixeira, M. A. S. Human-robot interface for remote control via iot communication using deep learning techniques for motion recognition. In Latin American Robotics Symposium (LARS), 1–6 (IEEE, 2020).
Zhang, Y. et al. Earthshaker: A mobile rescue robot for emergencies and disasters through teleoperation and autonomous navigation. JUSTC 53, 3–1 (2023).
Article Google Scholar
Aydan, Ö. Some thoughts on rock slope stability issues in mars. In IOP Conference Series: Earth and Environmental Science, vol. 1124, 012077 (IOP Publishing, 2023).
Li, J., Wang, J., Wang, S. & Yang, C. Human-robot skill transmission for mobile robot via learning by demonstration. Neural Comput. Appl. 35, 23441–23451 (2023).
Article Google Scholar
Abdulsaheb, J. A. & Kadhim, D. J. Classical and heuristic approaches for mobile robot path planning: A survey. Robotics 12, 93 (2023).
Article Google Scholar
Kubota, T., Ogawa, K., Yoshikawa, Y. & Ishiguro, H. Alignment of the attitude of teleoperators with that of a semi-autonomous android. Sci. Rep. 12, 10473 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Gao, Q. et al. Hand gesture teleoperation for dexterous manipulators in space station by using monocular hand motion capture. Acta Astronaut. 204, 630–639 (2023).
Article ADS Google Scholar
Wu, Y., Liu, X. & Yang, Y. Position and force control of bilateral teleoperation systems with time-varying delays based on force estimation. Int. J. Control Autom. Syst. 22, 276–287 (2024).
Article Google Scholar
Du, J., Vann, W., Zhou, T., Ye, Y. & Zhu, Q. Sensory manipulation as a countermeasure to robot teleoperation delays: System and evidence. Sci. Rep. 14, 4333 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, H. & Chou, W. Adaptive fnn backstepping control for nonlinear bilateral teleoperation with asymmetric time delays and uncertainties. Int. J. Control Autom. Syst. 21, 3091–3104 (2023).
Article Google Scholar
Yoon, K.-I., Ko, D.-K. & Lim, S.-C. Real-time video prediction using gans with guidance information for time-delayed robot teleoperation. Int. J. Control Autom. Syst. 21, 2387–2397 (2023).
Article Google Scholar
Fei, H. et al. Seamless robot teleoperation: Intuitive control through hand gestures and neural network decoding. In 2024 International Joint Conference on Neural Networks (IJCNN), 1–6 (IEEE, 2024).
Robotis. Turtlebot 3: Burger and waffle. http://emanual.robotis.com/docs/en/platform/turtlebot3/overview/ (2023).
Google. Mediapipe: A framework for building perception pipelines (2023).
Howard, A. Gazebo: Simulação 3d em robótica (2023).
Hershberger, D. Rviz: Visualizador 3d de robótica (2023).
Galarza, B. R., Ayala, P., Manzano, S. & Garcia, M. V. Virtual reality teleoperation system for mobile robot manipulation. Robotics 12, 163 (2023).
Article Google Scholar
Shamshiri, R. R. et al. Internet of robotic things with a local lora network for teleoperation of an agricultural mobile robot using a digital shadow. Discover Appl. Sci. 6, 414 (2024).
Article Google Scholar
Chen, L., Li, C., Fahmy, A. & Sienz, J. Gesturemoro: an algorithm for autonomous mobile robot teleoperation based on gesture recognition. Sci. Rep. 14, 6199 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Pantusin, F. J., Carvajal, C. P., Ortiz, J. S. & Andaluz, V. H. Virtual teleoperation system for mobile manipulator robots focused on object transport and manipulation. Technologies 12, 146 (2024).
Article Google Scholar
Zaman, M. Q. & Wu, H.-M. Hand gesture-based teleoperation control of a mecanum-wheeled mobile robot. IFAC-PapersOnLine 56, 1484–1489 (2023).
Article Google Scholar
ROS Community. ROS Navigation Stack (2023).
Uboweja, E. et al. On-device real-time custom hand gesture recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 4273–4277 (2023).
Altayeb, M. Hand gestures replicating robot arm based on mediapipe. Indonesian J. Electr. Eng. Inform. (IJEEI) 11, 727–737 (2023).
Google Scholar
Gao, Q., Deng, Z., Ju, Z. & Zhang, T. Dual-hand motion capture by using biological inspiration for bionic bimanual robot teleoperation. Cyborg Bionic Syst. 4, 0052 (2023).
Article PubMed PubMed Central Google Scholar
Gai, A. d. M. Avaliação da fusão de sensores imu e odometria para um robô turtlebot utilizando amcl no framework ros. Universidade Federal de Santa Maria (2023).
Zheng, K. Ros navigation tuning guide. Robot Operating System (ROS) The Complete Reference (Volume 6) 197–226 (2021).
Qin, H. et al. Review of autonomous path planning algorithms for mobile robots. Drones 7, 211 (2023).
Article Google Scholar
Gurevin, B. et al. A novel gui design for comparison of ros-based mobile robot local planners. IEEE Access (2023).
Wang, S., Zhang, Y., Zhang, X. & Gao, Z. A novel maritime autonomous navigation decision-making system: Modeling, integration, and real ship trial. Expert Syst. Appl. 222, 119825 (2023).
Article Google Scholar
Fox, D., Burgard, W. & Thrun, S. The dynamic window approach to collision avoidance. IEEE Robot. Autom. Mag. 4, 23–33 (1997).
Article Google Scholar
Hart, P. E., Nilsson, N. J. & Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 4, 100–107. https://doi.org/10.1109/TSSC.1968.300136 (1968).
Article Google Scholar
Dijkstra, E. W. A note on two problems in connexion with graphs. Numer. Math. 1, 269–271. https://doi.org/10.1007/BF01386390 (1959).
Article MathSciNet Google Scholar
Apurin, A., Abbyasov, B., Martínez-García, E. A. & Magid, E. Comparison of ros local planners for a holonomic robot in gazebo simulator. In International Conference on Interactive Collaborative Robotics, 116–126 (Springer, 2023).
Mansakul, T., Fan, I.-S. & Tang, G. Navigation for a mobile robot to inspect aircraft. In 2023 7th International Young Engineers Forum (YEF-ECE), 7–13 (IEEE, 2023).
Huang, Y., Shi, X., Zhou, Y. & Xiong, Z. Autonomous navigation of mobile robot in radiation environment with uneven terrain. Int. J. Intell. Robot. Appl. 7, 497–509 (2023).
Article Google Scholar

Download references

Author information

These authors contributed equally: Dieisson Martinelli, André Schneider de Oliveira and Vivian Cremer Kalempa.

Authors and Affiliations

Department of Information Systems, Universidade do Estado de Santa Catarina (UDESC), São Bento do Sul, 89283-081, Brazil
Dieisson Martinelli & Vivian Cremer Kalempa
Graduate Program in Electrical and Computer Engineering, Universidade Tecnolágica Federal do Paraná (UTFPR), Curitiba, 80230-901, Brazil
Lucas Alexandre Zick, Dieisson Martinelli, André Schneider de Oliveira & Vivian Cremer Kalempa

Authors

Lucas Alexandre Zick
View author publications
Search author on:PubMed Google Scholar
Dieisson Martinelli
View author publications
Search author on:PubMed Google Scholar
André Schneider de Oliveira
View author publications
Search author on:PubMed Google Scholar
Vivian Cremer Kalempa
View author publications
Search author on:PubMed Google Scholar

Contributions

L.Z., V.C., and D.M. designed this research. L.Z. and D.M. developed the system. L.Z. conducted the experiments. V.C. and A.S. analyzed the results. All authors contributed to the scientific discussion and manuscript revisions.

Corresponding author

Correspondence to Lucas Alexandre Zick.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zick, L.A., Martinelli, D., Schneider de Oliveira, A. et al. Teleoperation system for multiple robots with intuitive hand recognition interface. Sci Rep 14, 30230 (2024). https://doi.org/10.1038/s41598-024-80898-x

Download citation

Received: 17 May 2024
Accepted: 22 November 2024
Published: 04 December 2024
DOI: https://doi.org/10.1038/s41598-024-80898-x