Hand-like autonomous flying robot for airborne grasping and interaction

Wu, Yuze; Yang, Fan; Jin, Rui; Zhong, Yuhang; Wang, Junjie; Wu, Xuankang; Gao, Fei

doi:10.1038/s41467-026-68967-3

Download PDF

Article
Open access
Published: 30 January 2026

Hand-like autonomous flying robot for airborne grasping and interaction

Yuze Wu ORCID: orcid.org/0000-0002-8894-5118^1,2,3^na1,
Fan Yang²^na1,
Rui Jin^1,2,
Yuhang Zhong^1,2,
Junjie Wang^1,2,
Xuankang Wu² &
…
Fei Gao ORCID: orcid.org/0000-0002-6513-374X^1,2,3

Nature Communications volume 17, Article number: 2200 (2026) Cite this article

5232 Accesses
Metrics details

Subjects

Abstract

Birds’ extraordinary aerial agility and environmental interaction enable complex tasks such as mid-air hunting, perching, and nest-building, inspiring the development of advanced aerial robots with similar manipulation capabilities. However, existing platforms often face challenges such as large size, heavy payloads, end-effector torque interference, and limited functionality, severely restricting their practical deployment. Drawing inspiration from the biological, structural, and actuation characteristics of human hands, we propose a hand-like robot that integrates flight and grasping, demonstrating the synergistic advantages of compact structure, agile flight, and versatile manipulation. We propose an autonomous framework including efficient mission planning and multi-level adaptive control, enabling the robot to precisely and smoothly perform human-like grasping, opening doors, forest perching, object transport, and interactive tasks. Additionally, the framework supports human-robot collaboration, empowering individuals with mobility impairment to conduct remote transportation and airborne operations. Outdoor tests, which include perching in various scenarios, navigating confined spaces, and transporting payloads across challenging terrain, validate the proposed vehicle’s potential in aerial delivery and manipulation tasks. These results demonstrate emerging possibilities for aerial operation, assistance, and delivery with integrated flight and manipulation abilities.

High-speed aerial grasping using a soft drone with onboard perception

Article Open access 26 August 2024

Crash-perching on vertical poles with a hugging-wing robot

Article Open access 12 July 2024

Learning vision-based agile flight via differentiable physics

Article 16 June 2025

Introduction

Through evolutionary processes, birds have developed a notable ability to use their forelimbs (wings) for aerial locomotion while employing their hindlimbs (talons) for interaction, allowing complex activities such as aerial hunting, grasping, perching, and nest building^1,2. This biological characteristic has ignited numerous research interests to employ flying machines with manipulation capabilities^{3,4,5,6,7,8,9,10,11,12,13,14,15}, in order to interact with objects and humans in midair. Flying robots, as the most maneuverable robots, are highly anticipated to deeply participate in our social activities, especially safety-critical scenarios like earthquake rescue and high-risk inspections. In the past decade, flying robots are widely used in applications related to information acquisition, such as geographic surveying, aerial photography/videography, inspection and monitoring. For instance, in hazardous environments such as nuclear power plants or chemical facilities, close-range interactions including valve turning and button pushing are quite common. In search-and-rescue missions, quick catch and release are vital for supply delivery or collaborative transportation. In daily life, item distribution across the air, goods retrieval from human-unreachable areas, or even touch-range extending for the disabled, often occur in our imagined future house or factory. These cross-domain applications highlight the vast potential of aerial manipulation, motivating aerial robots from flying eyes to flying hands.

Existing research on aerial manipulation has made significant progress, while some fundamental limits restrict their further applicability and extensibility in real-world scenarios. Early research in this area primarily focuses on directly mounting robotic arms to drones^{16,17,18,19,20,21,22}, but their large size, heavy weight, and high energy consumption severely hurt their maneuverability and endurance, making them unsuitable for delicate or long-duration operations, especially in confined situations such as human-involved activities. Subsequent efforts seek to address these issues by optimizing end-effector designs, including simplifying actuators^{23,24,25,26,27,28,29,30,31,32,33,34,35,36}, developing novel drive mechanisms^37,38,39,40, and introducing soft grasping components^41,42,43. Although these works shine in structural innovations, they introduce unavoidable control coupling problems, resulting in compromises in agility and stability of the robot. In response, researchers opt to leverage robots’ own structures for aerial manipulation, works^{44,45,46,47,48,49,50,51} try to theoretically reduce system complexity by minimizing external attachments. However, these robots struggle with either low accuracy caused by extra actuators, or mechanical complexity because of movable structures, limiting their operating range, precision, and speed. These challenges all underscore the necessity of a novel flying manipulation robot, which simultaneously satisfies compact design, simplified mechanism, wide adaptability, superior agility, stability and passibility, as well as high autonomy.

As the call for the appearance of such an ideal flying operational robot, we dive into the fundamentals of nature. Human hands exhibit dexterous interaction movements (Fig. 1a), efficiently adapting to complex environments and performing a wide range of tasks. For instance, humans grasp large objects like cups or doorknobs using the palm (Fig. 1c), and delicately pinch smaller objects such as paper or pills with fingertips (Fig. 1d). Research^52,53 shows that the bones, joints, muscles, and tendons of hands constitute a highly efficient biological structure, precisely adapting to the shape and size of objects through multi-degree-of-freedom (DOF) movements and tendon drive mechanisms. This notable architecture inspires our integrated design, which combines the dexterous grasping abilities of human hands with the swift maneuverability of aerial flight, leading to a Hand-lIke compact Aerial Robot for Manipulation, abbreviated as HI-ARM in this article (Fig. 1b). The proposed flying robot achieves delicate, multifunctional, maneuverable, and continuous aerial manipulation with a size of solely an adult hand (see Supplementary Movie 1). HI-ARM’s design incorporates hand-like features for functional grasping, including an open C-shaped grasping contour to extend its range, a multi-DOF deformable joint structure to accommodate objects of various shapes, and a concise tendon-driven mechanism to reduce its total size and weight. As illustrated in Fig. 1b, the C-shaped grasping contour provides a hand-like enveloping geometry, significantly enhancing grasping stability and adaptability for objects. The composite 5-DOF finger-like structure, including a 2-DOF torsion and a 3-DOF extension (Fig. 2a) parts, enables efficient manipulation. Additionally, HI-ARM equips four rotor-propellers for locomotion (Fig. 2a), inheriting the nature of flight agility and control simplicity from a conventional quadrotor aircraft. Thanks to the hand-like structure, HI-ARM enjoys enhanced adaptability for versatile tasks. It not only performs hand-like grasping such as palm gripping and fingertip pinching (Fig. 1e and 1f), but also executes sophisticated operations like tree perching, door opening, object transportation, and human interaction (Fig. 1g-1j).

**Fig. 1: Overview of the proposed HI-ARM.**

**Fig. 2: Hardware and software architecture.**

To accomplish complicated tasks with precision and smoothness, autonomy is also crucial for HI-ARM. To this end, we propose a framework consisting of a task planner, trajectory generation, state feedback, parameter estimation, and adaptive control (Fig. 2b). HI-ARM supports two working modes: (1) autonomous operation, where the task planner selects proper operating sequences from an action library according to input task types and the robot executes them under closed-loop perception, planning, and control (e.g., grasping, perching, door opening, and human interaction), and (2) human-robot collaboration, where the task planner generates reference trajectories and grasping commands from human intentions, and the controller subsequently tracks these commands (e.g., human-robot collaborative teleoperation). Thanks to its integrated hardware configuration, HI-ARM naturally divides its mission planning into flight trajectory planning (millisecond-level) and end-effector deformation planning (microsecond-level). This decoupled scheme significantly reduces the planning complexity, and can run at high frequency to meet the real-time demands for airborne reactive operation. State feedback is necessary for closed-loop autonomy, HI-ARM achieves this by using a 6-DoF state estimator to online update the pose and position of its center of gravity (COG), and a motor observer to monitor the angle and displacement of its end-effectors. In the real world, HI-ARM encounters notable parameter changes and non-negligible external disturbances, caused by close-proximity operation, load variation, and deformation dynamics (Fig. 2c). The above-mentioned model mismatch commonly occurs in HI-ARM, thereby drastically harming the accuracy of its flight and manipulation. To address these issues, the proposed controller integrates a lightweight, high-frequency disturbance estimator along with a torque and thrust compensator. The entire algorithm architecture is shown in Fig. 2b. With all these components integrated, the proposed HI-ARM successfully achieves autonomous planning, accurate control, task decoupling, and human interaction, therefore can be applied to multiple situations, as shown in Fig. 1g–j.

Contributions. Firstly, we propose a biomimetic design that integrates grasping with flight in a compact robot platform. To the best of our knowledge, this represents a pioneering flying robotic hand in the robotics community. Secondly, we propose an efficient planner for empowering the flying robotic hand with precise autonomous operations, along with an adaptive controller for stable flight with varying loads and dynamic interactions, and state feedback for closed-loop autonomy. Finally, we demonstrate the huge potential of applying HI-ARM to versatile tasks, including object grasping, door opening, pole perching, cross-terrain transportation, and continuous human-robot interactions, pushing the boundaries of aerial robots from passive observation to active manipulation. In what follows, HI-ARM successfully completes multiple continuous aerial interactive tasks quickly and smoothly, highlighting its great potential as an intelligent assistant (see Section Interaction with humans). Moreover, the robot demonstrates rapid object grasping and cross-terrain transportation, presenting an innovative solution for aerial delivery, in Section Applications in the wild. These experimental results not only validate HI-ARM’s multi-task capabilities but also lay the foundation for its application in unmanned autonomous operations, robotic household service, wilderness rescue, remote assistance, and more.

Results

Hand-like mechanism design

The design of HI-ARM draws inspiration from the grasping configuration, biological structure, and tendon drive mechanism of human hands. The employed configuration adopts a hand-like open grasping contour, providing a relatively broad grabbing range. In order to replicate dexterous grabbing capabilities while remaining mechanically efficient, the robot incorporates finger and palm modules as its core operational units (Fig. 3b(i)). With this design, HI-ARM includes a palm region for powerful gripping and fingertip areas for precise pinching (Fig. 3a(ii)). This configuration supports a wide grasping range (0 ~ 10.0 cm), allowing HI-ARM to securely hold larger objects (e.g., water bottles) with its palm while delicately picking up smaller items (e.g., tissues) using its fingertips, showcasing multi-modal grasping capabilities.

**Fig. 3: Hand-like mechanical design.**

Human grasping relies on joint angle variations to achieve contraction (Fig. 3a(iii)). Following this functional principle, HI-ARM’s finger modules incorporate a torsion structure comprising torsion springs and circular bearings (Fig. 3b(i)) to enable flexible finger bending. Given their ability to displace significantly in a short time, telescopic structures are incorporated into the deformable mechanism to increase the speed of grasping movements. To ensure that the telescopic modules are activated first, their spring stiffness is deliberately designed to be lower than that of the rotational modules, thereby enabling rapid contraction and closer contact with the target object. As illustrated in Fig. 3e, the springs in the mechanism absorb energy during compression and torsion, and then release their stored energy to restore the shape of the robot, thereby reducing overall energy consumption. The integration of telescopic and torsional mechanisms forms a hybrid 5-DoF structure, allowing a variety of adaptive and flexible grasping, as shown in Fig. 3f.

Building on the principles of the finger tendon sheath pulley system⁵³, HI-ARM employs a tendon drive mechanism for shape adaptation (Fig. 3e). Mimicking a finger’s flexor digitorum profundus tendon (FDP tendon, shown in Fig. 3a(i)), a lightweight nylon rope is employed to transmit driving forces. Several V-shaped pulleys are integrated into the inner sides of finger and palm modules to redirect the force (Fig. 3b(i)). Unlike conventional morphing drones that rely on multiple actuators, this tendon-driven design utilizes a single actuator to drive the 5-DoF composite structure, minimizing the robot’s size, weight, and energy consumption while simplifying its control complexity. Without knowing an object’s shape, this underactuated structure can conform to the object’s contour, and passively and collaboratively adjust the deformation of each torsion and extension component under single rope actuation to achieve stable grasping, demonstrating its adaptive grasping ability for various objects, as shown in Fig. 4c.

With the proposed integrated design, HI-ARM features a compact size with a hand-like profile, with a total weight of just 556g. This allows it to possess more space for flight and grasping operations in narrow indoor environments, a task that can be challenging for larger aerial robots. Additionally, the robot can deform to reduce its dimensions (Fig. 3b(ii)), improving passability in narrow spaces (as validated in Section Applications in the wild). HI-ARM defaults to the open configuration, enabling rapid execution of grasping tasks. By contrast, the closed configuration imposes continuous torque demands on the servo motor and partially obstructs propeller airflow, elevating power consumption and compromising flight endurance.

Flight design and electronic components

This section provides a detailed overview of the HI-ARM flight system, which is powered by four rotor-propellers. As shown in Fig. 2a, HI-ARM is equipped with 3.5-inch propellers, each driven by a T-motor F1404 2900KV motor, capable of generating a combined thrust of up to 1000 g. The motors are mounted beneath the finger and palm modules, and the bottom-mounted layout avoids direct propeller airflow and enhances human-robot interaction safety by minimizing hand contact. The propellers are designed with a 6 mm height offset to prevent interference during module deformation and folding.

With a 70C battery discharge rate, the system delivers sufficient instantaneous power output to handle payloads of over 450 g for grasping operations. The primary structure of the flight platform is composed of 2 mm thick carbon fiber plates, selected for their high strength and lightweight properties. For flight control, HI-ARM uses a Kakute H7 mini flight control board to run the ArduPilot flight firmware, which collects high-frequency ( > 300Hz) real-time data on motor speed from the electronic speed controller (ESC) and posture from the Inertial Measurement Unit (IMU). High-frequency feedback helps the robot to estimate its motion states quickly, reducing system delay and improving control accuracy. The on-board computer, a Radxa ZERO board, runs a mission planner and adaptive controller (to be introduced in the following sections) for autonomous aerial manipulation. This controller processes system state data from the IMU, the servo motor, and the localization module, then sends control inputs to the brushless motor and the servo motor, which adjusts motor speeds for precise flight operations. More details about the components can be found in Supplementary Information.

For localization and estimation, we aim to balance accuracy, weight, and onboard computational load. In indoor experiments, an external optical motion-capture system provides sub-millimeter state estimation of both the aerial robot and the manipulated objects, enabling closed-loop integration with mission planning and adaptive control for autonomous operation. In outdoor experiments, we employ the Intel RealSense T261 tracking module, which weighs 26 g and offers a lightweight alternative to LiDAR-based localization systems while maintaining stable short-range state estimation (for example, within 20 m) suitable for experimental validation.

Mission planning

To perform diverse aerial tasks, HI-ARM introduces an efficient autonomous framework that supports both autonomous operation and human-robot collaboration. As shown in Fig. 2b, we establish an action library for common tasks such as grasping, perching, door opening, and human interaction. For autonomous operation, HI-ARM dynamically assembles basic motions from the library based on the task type to generate desired operation sequences (as validated in Section Interaction with humans). For human-robot collaboration, HI-ARM generates appropriate commands for flight and grasping based on the operator’s intent. These commands are subsequently sent to the mission planner.

Unlike traditional aerial manipulation robots with multi-DOF robotic arms, HI-ARM’s integrated structure reduces the number of actuators, significantly decreasing the number of planning variables. Thanks to this integrated design, we decompose the mission planner into two independent modules: flight trajectory planning and end-effector deformation planning. In terms of trajectory planning, the quadrotor configuration employed by HI-ARM exhibits differential flatness⁵⁴, allowing constraint simplification to accelerate solving speed⁵⁵. To obtain an efficient, trackable trajectory for the robot, the optimization problem incorporates constraints on trajectory smoothness, total time, and dynamic feasibility. This process can be completed at a millisecond level on onboard computing devices, meeting the rapid real-time solving requirements for aerial operations. The trajectory planner eventually outputs the desired state sequence related to time, as detailed information provided in Methods.

For end-effector deformation planning, HI-ARM utilizes a single-motor actuation strategy and a tendon drive mechanism to achieve structural deformation. We establish a univariate quadratic mapping model between motor angles and morphing states, with additional details available in Methods. Given the desired deformation state, HI-ARM rapidly generates a time-dependent deformation sequence in microseconds based on the above mapping model. Then we synchronize this sequence with the time series of the flight trajectory, and send them to the adaptive controller, as shown in Fig. 2b.

Multi-level adaptive control

An accurate, adaptive, and efficient controller is essential for smooth aerial manipulation. Compared to mature controllers for traditional quadrotors, the flight control system of HI-ARM faces greater challenges. These challenges mainly arise from two aspects (Fig. 2c(iv)): first, the model parameters (such as size, shape, COG, and inertia) dynamically change due to body deformation, which significantly disturbs flight control; second, during airborne operations, external factors such as load variations, close-range interaction forces, and air disturbances severely affect the stability and accuracy of the robot’s control. In particular, to increase the speed of aerial operations, HI-ARM needs to deform while flying to complete tasks, imposing higher requirements on both flight control and deformation control.

To address these challenges, we propose a multi-level adaptive controller (Fig. 2b), endowing HI-ARM with precise aerial operation capabilities. To mitigate the negative effects of model variations, the robot incorporates an online model parameter identification approach to estimate key physical parameters, such as COG and inertia, as illustrated in Fig. 2b and detailed further in Supplementary Information. For external disturbances, the system introduces an estimation and compensation method for external forces and torque disturbances to reduce their impact on control. Specifically, the ${{{\mathcal{L}}}}_{1}$ adaptive control algorithm⁵⁶ is employed in the position loop to mitigate external forces, while the incremental nonlinear dynamic inversion (INDI) control algorithm⁵⁷ is used in the attitude loop to handle external torques. These adaptive algorithms also reduce the negative impact of factors that affect trajectory tracking performance, such as propeller airflow interference and model mismatch⁵⁸.

In terms of deformation control, we implement a feedback control method based on angular errors to regulate the servo motor’s motion. The servo motor provides torque state feedback, which helps determine the success of the gripping action. Through merging the above-mentioned grasping and flight control, HI-ARM can quickly and efficiently perform operations such as grasping, perching, and interaction, which are verified in the following experiments. More details of the control and modeling can be found in Methods.

Hand-like grasping performance

The proposed biomimetic design endows HI-ARM with versatile gripping mechanisms that facilitate palm, fingertip, and adaptive grasping (see Supplementary Movie 2). These capabilities are demonstrated in experiments, as described below.

Palm grasping

Humans typically grasp larger objects (e.g., water bottles or oranges) using their palms, HI-ARM exhibits a similar ability. We conduct an autonomous grasping experiment, where HI-ARM is asked to grip a water bottle (diameter: 62 mm, mass: 153 g) before transporting it to a destination (Fig. 4a). The position tracking curves shown in Fig. 4b indicate that HI-ARM can successfully track a reference trajectory with relative accuracy at a maximum velocity of 1.1 m s⁻¹. Our proposed controller can accurately estimate external force and torque disturbances in real time (Fig. 4b) and effectively compensate for them. As shown in Fig. 4b, after correcting for thrust loss, the robot’s actual total thrust increases by about 1.5 N upon grasping, closely matching the object’s gravitational force. During gripping, once the set torque has been reached, the grasping action is considered to be completed, and the servo motor stops rotating.

Fingertip grasping

Other than palm grasping, fingertip grasping is also a normal operation for human hands, especially for taking small objects. In this experiment, HI-ARM is commanded to grasp some napkins. As shown in Fig. 4d, napkins are quite thin (less than 1 mm), soft, and light (less than 1 g), thus pose unique challenges for accurate automated grasping. Fig. 4d demonstrates that HI-ARM grasps a single napkin with fingertip precision. During tissue extraction, HI-ARM encounters disturbances caused by friction, which are effectively compensated by the controller, resulting in a control error of less than 3 cm.

Adaptive grasping

HI-ARM’s 5-DoF grasping structure can deform flexibly, and its under-actuated design allows it to conform to the shape of objects upon contact. This feature enables the system to adaptively grasp objects without prior knowledge of their precise contours and shapes. We test HI-ARM on common objects with various shapes and sizes, proving that it can conform to the surface of objects ranging from small to big, and grip and carry them effectively (Figs. 4c and e). To increase the difficulty of grasping, we also introduce irregular letter-shaped blocks. As shown in Fig. 4c, HI-ARM is capable of deforming asymmetrically to fit the shape of these objects, holding them accurately and stably.

Human-like applications

HI-ARM, with the above biomimetic features, can perform more human-like applications (see Supplementary Movie 3).

Perching

Similar to the way humans use handrails on trains or grasp tree trunks (Fig. 5a), HI-ARM can utilize its specialized structure to perch on fixed objects. To evaluate the perching capability, we set up a tree trunk as the test object. In this experiment, the flying robot autonomously flies toward the tree and accurately grasps and perches on it (Fig. 5b). As shown in Fig. 5c, all motors cease rotation, the system’s gravity is counterbalanced by the friction generated from the HI-ARM’s grip on the tree trunk, and the energy consumption of the system is significantly reduced (the servo motor consumes two orders of magnitude less energy than the rotor-propellers, as shown in Fig. 5e). By contrast, conventional hovering consumes over 160 W. After a while, the drone receives a release command and the propellers begin to rotate again. The servo motor drives the vehicle to subsequently expand, gradually disengaging from the tree, as HI-ARM re-enters flight mode (Fig. 5d). This innovative perch-and-relaunch mechanism allows for a break between consecutive missions and supports a long-stay mission with low energy consumption.

Door opening

Thanks to its superior operational capability, HI-ARM can even be used to open a door. In this experiment, when the door is being pushed open, it introduces significant disturbances to the control of the drone. To ensure smooth door-opening (Fig. 5h), we employ the mission planner to generate an appropriate reference trajectory that accounts for dynamic constraints (more details can be found in Methods), enabling the drone to accurately grasp the door handle and open the door, as shown in Fig. 5g. External force estimations suggest that the door mainly exerts disturbing forces on the robot along the X and Y axes (Fig. 5j), which are compensated by the proposed controller. As illustrated in Fig. 5h, when the robot pushes the door while grasping the handle, the door-opening angle θ_door is ~30°, and the maximum pushing force reaches about ${T}_{\max }\sin 3{0}^{\circ }\approx 5\,{{\rm{N}}}$. Without the constraint of the handle, the maximum achievable door-opening angle ${\theta }_{\max }$ increases to 55°, with a corresponding maximum pushing force of 8.2N.

Interaction with humans

The compact size of HI-ARM enables it to adapt to spatially constrained environments in typical households, offering autonomous robotic services with great potential as an intelligent home assistant. We design a series of experiments that simulate daily scenarios to evaluate the possibility of HI-ARM in domestic environments (Fig. 6a). In this experiment, the positions and orientations of objects are acquired using a motion capture system, while the flying robot is required to sequentially perform multiple tasks involving human-robot interaction. Figure 6b illustrates the continuous and smooth flight reference trajectory for multi-task operations generated by the aforementioned mission planner. Upon the arrival of the person, HI-ARM approaches the door, performs a fingertip grasp on an express box held by him, and delivers it to a storage bin (Fig. 6a ⓪①). The drone then flies to a table, uses a palm grasp to pick up a bottle of water, and hands it to the person (Fig. 6a ②③). While the person is drinking, the flying robot retrieves a boxed snack from the table (Fig. 6a ④⑤). Once the person finishes drinking, the empty bottle is handed to HI-ARM, which then deposits it into a trash can (Fig. 6a ⑥⑦). After completing services, HI-ARM flies to the coat rack and perches on it, entering standby mode (Fig. 6a ⑧). As shown in Fig. 6d, HI-ARM can adaptively grasp different objects by adjusting the actuator’s angle accordingly. During this experiment, HI-ARM encounters perturbation errors at each operation point (Fig. 6c), primarily caused by load variations introduced by the objects. Thanks to the multi-level adaptive controller, HI-ARM is able to accurately estimate external force interference (Fig. 6d) and effectively compensate for the disturbances, gradually reducing errors. As shown in Fig. 6c, the mean absolute tracking error (ATE) of the single-axis trajectory during the experiment is less than 2 cm. This coherent process demonstrates HI-ARM’s notable ability to adaptively grasp different objects and move flexibly in domestic scenarios.

**Fig. 6: Continuous multi-task interactions with human.**

Applications in the wild

We also present several outdoor experiments to confirm that HI-ARM can be applied to natural environments. Firstly, we test the perching capability of HI-ARM on a variety of objects in outdoor environments, such as bamboo, different trees, and electric poles. As shown in Fig. 7a, HI-ARM successfully completes the grasping and staying in various scenes without external structures, showing great potential for wild applications where long-stay is necessary. In the second experiment, we test the passability of HI-ARM utilizing its shrinking ability in a quite narrow cave. As shown in Fig. 7b, HI-ARM deforms to reduce its width and successfully passes through a narrow space. Moreover, we demonstrate that the proposed flying robot may offer a groundbreaking solution for aerial delivery, addressing the growing demand for flexible and rapid daily logistics. Due to its compact and simple structure and high mobility, HI-ARM can easily grasp various objects and perform swift aerial transportation. In this experiment, we give HI-ARM a bottle of water that needs to be delivered across a river. As shown in Fig. 7c, HI-ARM successfully grips a drink cup using its deformable mechanism and carries it to the other side of the river.

Teleoperation for human-robot collaboration

As a bio-inspired aerial interaction device, HI-ARM can function as a third flying hand for humans, responding to human intentions. As shown in Fig. 8a, HI-ARM is equipped with a remote video transmission system that provides first-person-view (FPV) visual feedback. Similar to DJI’s Avatar FPV drone (¹ DJI’s Avatar FPV drone specifications: https://www.dji.com/cn/avata-2), HI-ARM can be operated with a simplified, single-handed motion-based 3D controller, which maps hand movements to velocity commands in different directions. The controller also integrates a grasp button, allowing both flight and grasping actions to be performed with one hand. As shown in Fig. 8a, the communication between the controller and the robot is established via the ROS framework. As described in Section Mission planning, these commands are then processed by the mission planner to generate flight and deformation inputs for the controller, enabling the robot to execute aerial tasks. The following two experiments demonstrate HI-ARM’s potential in human-assisted remote operations (see Supplementary Movie 4).

**Fig. 8: Human-robot collaborative remote aerial operation.**

For individuals with limited mobility, retrieving items from different locations can be inaccessible, particularly in rugged terrains, areas with stairs/steps, or regions with significant height variations. In this experiment, we invite a participant with mobility impairment to operate HI-ARM, wearing video glasses to receive the onboard perspective. After a brief tutorial, the user is able to control HI-ARM to complete the object retrieval task from a distance according to his/her intention. As shown in Fig. 8b, HI-ARM takes off from the second floor, navigates through trees and shrubs, and reaches the target position near the ground. It precisely grasps the target object-a cup of coffee-and successfully returns to the participant’s side, demonstrating HI-ARM’s potential in assisting individuals with disabilities. In this task, the 46.2 m total retrieval trajectory is completed at an average velocity of 0.33 m/s, with an end-effector ATE of 0.08 m and a control latency of 256ms, highlighting the stability and effectiveness of HI-ARM’s teleoperation over distances exceeding 40 m.

Even for individuals with normal mobility, there may be some situations where reaching objects at high places is difficult. This experiment simulates a badminton stuck in a tree, where the operator remotely assesses its position via onboard imagery. Subsequently, the person maneuvers HI-ARM to fly to the tree, grasp the badminton with its fingertips, and smoothly return to the ground (Fig. 8c), demonstrating its potential for remote airborne operations. In this scenario, the trajectory measures 15.4 m, with an average velocity of 0.10 m/s, and an end-effector ATE of 0.04 m. The lower speed compared to the coffee task reflects the higher precision required for grasping lightweight objects.

Discussion

In this study, we develop an innovative hand-like compact flying robot for manipulation, which integrates biomimetic grasping and aerial flying capabilities. We propose an efficient autonomous framework that enables the robot to perform precise and smooth aerial manipulation in real-world environments. The proposed robot demonstrates notable performance across indoor and outdoor tasks, showing considerable potential as future smart home assistants, moving flying cameras, aerial teleoperation tools and flying delivery robots. However, current state estimation depends on external localization, and the onboard visual localization module exhibits cumulative drift during long-range outdoor tasks. Achieving full autonomy requires not only accurate localization but also the ability to interpret the environment from visual inputs. End-to-end visual reinforcement learning approaches^59,60, which directly map perception to control while adaptively correcting state estimation errors, offer a promising route to enhance system autonomy in the future.

We also provide a theoretical analysis of the system control stability, with detailed information provided in Supplementary Information. The experimental results align with and validate this analysis. In addition, we conduct a robustness analysis of the proposed controller, followed by experiments in different scenarios to assess its performance under external disturbances. The results demonstrate that HI-ARM effectively estimates external interference, with the adaptive controller’s error gradually converging, ultimately restoring the system to a stable hovering state.

In future study, we plan to integrate force feedback into HI-ARM to enhance its ability to grasp fragile objects, such as eggs. Additionally, we aim to develop an innovative type of inner gripping surface to increase friction on smooth objects. We also plan to incorporate a multi-modal foundation model to enhance the cognitive ability of HI-ARM, enabling it to perform more sophisticated tasks, such as autonomous valve operation in industrial scenarios. Notably, human-robot collaboration demonstrates HI-ARM’s teleoperation capabilities, enabling the acquisition of high-quality real-world data to train models with minimized sim-to-real gaps. Furthermore, building on our previous research on aerial swarm⁶¹, we intend to develop a team of HI-ARM robots, which can execute much more complicated tasks such as collaborative transportation. We will continue our in-depth research into miniaturization and autonomy of the proposed flying robot under limited onboard resources, and push it to be industrialized.

Methods

Dynamics

In this study, we employ bold lowercase letters to denote vectors (e.g., v) and bold uppercase letters for matrices (e.g., J). Scalars are represented otherwise. As shown in Fig. 2c(iii), we utilize a world frame W with an orthonormal basis {x_W, y_W, z_W}, where z_W is oriented upward, counter to the direction of gravity. The body frame B, situated at the geometric center of the proposed vehicle, is defined with an orthonormal basis {x_B, y_B, z_B}. Let the position of the vehicle in the world frame be represented by ${{{\bf{p}}}}_{W}={({p}_{x},{p}_{y},{p}_{z})}^{\top }$, its orientation by the quaternion ${{{\bf{q}}}}_{W}={({q}_{x},{q}_{w},{q}_{y},{q}_{z})}^{\top }$, and its linear velocity by ${{{\bf{v}}}}_{W}={({v}_{x},{v}_{y},{v}_{z})}^{\top }$. Additionally, the angular velocity of the vehicle, expressed in the body frame, is given by ${\omega }_{B}={({\omega }_{x},{\omega }_{y},{\omega }_{z})}^{\top }$.

HI-ARM flight system is composed of four parallel rotor-propellers, whose position distribution approximates the ’X’ configuration of a conventional quadrotor. Consequently, its flight dynamics model is similar to that of traditional quadrotors, and the device possesses the differential flatness characteristics⁵⁴ as general quadrotors. We utilize 6 DoFs to describe the ideal rigid-body kinematics and dynamics model of the robot. For the translational dynamics component, the following equations are utilized:

$${{{{\bf{p}}}}^{\cdot }}_{W}= \, {{{\bf{v}}}}_{W},\\ {{{{\bf{v}}}}^{\cdot }}_{W}= \, T{{{\bf{z}}}}_{B}/m+{{\bf{g}}},$$

(1)

where T and m are the collective thrust and total mass, respectively; z_B is the Z axis of the body frame expressed in the world frame; g = [0, 0, −g]^⊤ is the gravitational vector.

The rotational kinematic and dynamic equations are expressed as

$${{{{\bf{q}}}}^{\cdot }}_{W}=\, \frac{1}{2}({\left[\begin{array}{c}0\\ {\omega }_{B}\\ \end{array}\right]}_{\times })\cdot {{{\bf{q}}}}_{W},\\ {{\omega }^{\cdot }}_{B}=\, {{{\bf{J}}}}^{-1}(\tau -{\omega }_{B}\times {{\bf{J}}}{\omega }_{B}),$$

(2)

where [⋅]_× is the skew-symmetric matrix; τ and J are the total torque and inertia tensor matrix, respectively, both of which are dynamically updated during morphing and grasping, with more details provided in Supplementary Information.

Let k_t and k_c denote the thrust coefficient and torque coefficient, respectively, associated with the j-th motor. The rotational speed of the j-th motor is represented by Ω_j, and its position within the body frame is given by ${{{\bf{l}}}}_{j}={[{l}_{{x}_{j}},{l}_{{y}_{j}},{l}_{{z}_{j}}]}^{\top }$. The collective thrust T and the torque τ generated by the actuators are expressed as follows:

$$\left[\begin{array}{c}T\\ \tau \\ \end{array}\right]={{{\bf{H}}}}_{k}{{\bf{t}}},$$

(3)

where ${{\bf{t}}}={[{k}_{t}{\Omega }_{1}^{2},{k}_{t}{\Omega }_{2}^{2},{k}_{t}{\Omega }_{3}^{2},{k}_{t}{\Omega }_{4}^{2}]}^{\top }$ represents the thrust generated by each rotor, H_k is the time-variant mixed control matrix (MCM) while the vehicle morphing. Assuming that the center of gravity is ${{{\bf{r}}}}_{COG}={({r}_{x},{r}_{y},{r}_{z})}^{\top }$ at this time, then H_k is as follows:

$${{{\bf{H}}}}_{k}=\left[\begin{array}{cccc}1 & 1 & 1 & 1\\ {r}_{y}+{l}_{{y}_{1}} & {r}_{y}+{l}_{{y}_{2}} & {r}_{y}+{l}_{{y}_{3}} & {r}_{y}+{l}_{{y}_{4}}\\ {r}_{x}-{l}_{{x}_{1}} & {r}_{x}-{l}_{{x}_{2}} & {r}_{x}-{l}_{{x}_{3}} & {r}_{x}-{l}_{{x}_{4}}\\ -{k}_{c}/{k}_{t} & {k}_{c}/{k}_{t} & -{k}_{c}/{k}_{t} & {k}_{c}/{k}_{t}\\ \end{array}\right].$$

(4)

The module positions ${{{\bf{r}}}}_{mo{d}_{i}}$ and propeller coordinates $({l}_{{x}_{i}},{l}_{{y}_{i}})$ and not fixed; instead, they are dynamically updated during morphing and flight. Specifically, these parameters are derived from the tendon-driven morphing kinematics: the servo motor angle θ_a and rope displacement L_t determine the deformation of both the telescopic and torsional mechanisms, which in turn define the instantaneous propeller positions. As illustrated in Fig. 2c, ${L}_{i}^{{\prime} }$ and L_i denote the lengths of the telescopic mechanism during motion and under nominal conditions, respectively, with ${L}_{i}^{{\prime} }-{L}_{i}$ representing the compression displacement and reflecting variations in the distances between adjacent modules. As shown in Fig. 2c(ii)), θ_j denotes the rotation angle of the j-th torsional mechanism (comprising a torsion spring and a circumferential bearing). Accordingly, the propeller coordinates (${l}_{{x}_{i}},{l}_{{y}_{i}}$) are updated to (${l}_{{x}_{i}}^{{\prime} },{l}_{{y}_{i}}^{{\prime} }$) as follows:

$${l}_{{x}_{1}}^{{\prime} }=\, {l}_{{x}_{1}}-({L}_{i}^{{\prime} }-{L}_{i})/2-{l}_{0}(1-\cos {\theta }_{1})\\ {l}_{{x}_{2}}^{{\prime} }=\, {l}_{{x}_{2}}+({L}_{i}^{{\prime} }-{L}_{i})/2\\ {l}_{{x}_{3}}^{{\prime} }=\, {l}_{{x}_{3}}+({L}_{i}^{{\prime} }-{L}_{i})/2\\ {l}_{{x}_{4}}^{{\prime} }=\, {l}_{{x}_{4}}-({L}_{i}^{{\prime} }-{L}_{i})/2-{l}_{0}(1-\cos {\theta }_{2})\\ {l}_{{y}_{1}}^{{\prime} }=\, {l}_{{y}_{1}}+({L}_{i}^{{\prime} }-{L}_{i})/2+{l}_{0}(1-\sin {\theta }_{1})\\ {l}_{{y}_{2}}^{{\prime} }=\, {l}_{{y}_{2}}+({L}_{i}^{{\prime} }-{L}_{i})/2\\ {l}_{{y}_{3}}^{{\prime} }=\, {l}_{{y}_{3}}-({L}_{i}^{{\prime} }-{L}_{i})/2\\ {l}_{{y}_{4}}^{{\prime} }=\, {l}_{{y}_{4}}-({L}_{i}^{{\prime} }-{L}_{i})/2-{l}_{0}(1-\sin {\theta }_{2})$$

(5)

As a consequence, the control matrix H_k is updated to ${{{\bf{H}}}}_{k}^{{\prime} }$ online according to the current morphing state. This formulation explicitly incorporates geometry changes induced by morphing into the control layer, rather than treating them as static.

HI-ARM utilizes the tendon drive mechanism powered by a single servo motor to actuate deformation. This system enables dynamic and precise structural deformation by transmitting force through a lightweight rope. To facilitate accurate control of morphing movements, we model the deformation structure. As the servo motor’s rotary disk rotates, the rope is guided through pulleys, driving coordinated deformations of both passive telescopic and rotational mechanisms, thus enabling body contraction. This study further establishes deformation dynamics equations, providing an effective motion control strategy for the morphing process.

Assume the radius of the servo motor’s rotary disk is R, its torque is τ_a, the rope tension is F_t, the servo motor’s angular displacement is θ_a, and the rope displacement is L_t. The dynamic equations are expressed as follows:

$${F}_{t}= \, \frac{{\tau }_{a}}{R},\\ {L}_{t}=\, {\theta }_{a}\cdot R.$$

(6)

As illustrated in Fig. 2c(i), the rope displacement L_t is the sum of the deformations of three telescopic mechanisms and two torsional mechanisms, given by:

$${L}_{t}=\mathop{\sum }\limits_{i=1}^{3}({L}_{i}^{{\prime} }-{L}_{i})+\mathop{\sum }\limits_{j=1}^{2}\gamma ({\theta }_{j}),$$

(7)

where ${L}_{i}^{{\prime} }$ and L_i represent the lengths of the telescopic mechanisms during motion and under normal conditions. As shown in Fig. 2c(ii), θ_j denotes the rotation angle of the j-th torsional mechanism (comprising a torsion spring and a circumferential bearing), which is 0° under normal conditions (Fig. 2c(i)). γ(θ_j) represents the rope displacement corresponding to the rotation angle θ_j.

Assuming the spring stiffness of the telescopic mechanisms is k_cs and the torsional stiffness is k_ts, the forces F_c,i applied by the rope on the telescopic mechanisms and the torques M_t,j on the torsional mechanisms are:

$${F}_{c,i}=\, {k}_{cs}({L}_{i}^{{\prime} }-{L}_{i}),\,i=1,2,3,\\ {M}_{t,j}=\, {k}_{ts}{\theta }_{j},\,j=1,2.$$

(8)

If the moment arm corresponding to the rope tension for the j-th torsional mechanism is l_t,j, the total rope force exerted by the servo motor can be expressed as:

$${F}_{t}=\mathop{\sum }\limits_{i=1}^{3}{F}_{c,i}+\mathop{\sum }\limits_{j=1}^{2}\frac{{M}_{t,j}}{{l}_{t,j}}.$$

(9)

Given that the spring constants of all telescopic mechanisms are identical, as are the torsional stiffness values of the torsional mechanisms, and that the body exhibits symmetrical deformation under no-load conditions, the deformation variables of each mechanism satisfy:

$${F}_{c,1}={F}_{c,2}={F}_{c,3},\,\frac{{M}_{t,1}}{{l}_{t,1}}=\frac{{M}_{t,2}}{{l}_{t,2}}.$$

(10)

Through the above modeling, we can accurately calculate the motion changes of each module and angular variations of the servo motor during no-load deformation. Combined with the following time-varying curve of the servo motor angle θ_a(t):

$${\theta }_{a}(t)=\left\{\begin{array}{ll}100{t}^{2},\hfill & 0\le t\le 0.1\hfill\\ 12.5t-0.25,\hfill & 0.1 < t < {t}_{0}-0.1\\ -100{(t-{t}_{0})}^{2}+12.5{t}_{0}-0.5,& {t}_{0}-0.1\le t\le {t}_{0}\end{array}\right.$$

(11)

where t₀ is the total deformation time. Ultimately, these calculations can obtain reference angle commands for the servo motor, thus providing a theoretical foundation for deformation control.

Flight and deformation control

As previously discussed, the flight dynamics model of HI-ARM closely resembles that of traditional quadrotors. Inspired by research on quadrotor control^54,62, we adopt a simple yet effective geometric tracking control strategy for reference trajectories in Normal Size. To mitigate the effects of perturbations, the ${{{\mathcal{L}}}}_{1}$ adaptive control algorithm is implemented to compensate for external forces, while the INDI control algorithm is employed to handle external torques. For grasping control, a feedback controller based on angular error is used to facilitate morphing, which is quasi-decoupled from flight control. As illustrated in Fig. 2b, the flight and deformation control form an integrated multi-level adaptive control framework to execute autonomous aerial manipulation tasks. Further details of each component in the control architecture are provided below.

Geometric tracking control baseline

The purpose of the geometric tracking controller is to enable the vehicle to track the predetermined trajectory ${{{\bf{p}}}}_{d}(t)\in {{\mathbb{R}}}^{3}$ and yaw angle ψ(t) within the specified time interval [0, t_f]. Ignoring disturbances such as the dynamics of the motors and the aerodynamic effects of the propellers, the translational and rotational motions are controlled by the desired thrust and torque as follows:

$$T=\, \parallel -{{\bf{K}}}_{p}{{\bf{e}}}_{p}-{{\bf{K}}}_{v}{{\bf{e}}}_{v}-m{\bf{g}}+m{\ddot{{\bf{p}}}}_{W}\parallel,\\ \tau=\, -{{\bf{K}}}_{R}{{\bf{e}}}_{R}-{{\bf{K}}}_{\omega }{{\bf{e}}}_{\omega }+{\omega }_{B}\times {\bf{J}}{\omega }_{B}-{\bf{J}}({\widehat{\omega }}_{B}{{\bf{R}}}^{\top }{{\bf{R}}}_{d}{\omega }_{B,d}-{{\bf{R}}}^{\top }{{\bf{R}}}_{d}{\mathop{\omega }\limits^{\cdot }}_{B,d}),$$

(12)

where K_R, ${{{\bf{K}}}}_{\omega }\in {{\mathbb{R}}}^{3\times 3}$ are positive definite gain matrices selected by the user; R_d, ω_B,d, and ${{\omega }^{\cdot }}_{B,d}$ are the desired rotation matrix, desired angular velocity, and desired angular velocity derivative, respectively. ${{{\bf{e}}}}_{R}={({{{\bf{R}}}}_{d}^{\top }{{\bf{R}}}-{{{\bf{R}}}}^{\top }{{{\bf{R}}}}_{d})}^{\vee }/2$ and ${{{\bf{e}}}}_{\omega }={\omega }_{B}-{{{\bf{R}}}}^{\top }{{{\bf{R}}}}_{d}{{\omega }^{\cdot }}_{B,d}$ are the rotation error and angular velocity error, respectively. The geometric tracking controller provides a flight control baseline framework for aerial robots, enabling precise flight control in Normal Size.

${{{\mathcal{L}}}}_{1}$ adaptive position control

The dynamics equation (1) provides a theoretical description of the robot’s motion under ideal conditions. However, in real-world scenarios, HI-ARM is also subject to deformation motion disturbances, external forces, and propeller aerodynamic effects. Therefore, in order to improve the control performance of tasks such as aerial grasping, we consider incorporating these uncertainties into the state space representation of the system.

Since these uncertainties appear purely in the robot dynamics, we ignore the kinematic part of the motion dynamics and consider only the dynamics. Disturbances experienced by the proposed robot during translational motion is represented by $\sigma \in {{\mathbb{R}}}^{3}$. Due to the under-actuated characteristic of the robot, its rotor thrust is aligned in a single plane and can only provide linear acceleration along the body’s Z axis. Therefore, in the actual modeling, we further divide the disturbances into matched modeling ${\sigma }_{m}={f}_{z}\in {{\mathbb{R}}}^{1}$ and unmatched modeling ${\sigma }_{um}={[{f}_{x}{f}_{y}]}^{\top }\in {{\mathbb{R}}}^{2}$.

The ${{{\mathcal{L}}}}_{1}$ adaptive controller consists of a state predictor, an adaptation law, and a low-pass filter (LPF). Inspired by previous studies^56,63, we select ${{\bf{z}}}={{{\bf{v}}}}_{W}\in {{\mathbb{R}}}^{3}$ as the state variables of the dynamics, and the dynamics equation considering external disturbances is:

$${{{\bf{z}}}}^{\cdot }={{\bf{g}}}+\frac{{{{\bf{z}}}}_{B}}{m}(T+{f}_{{{{\mathcal{L}}}}_{1}}+{\sigma }_{m})+\frac{[{{{\bf{x}}}}_{B}\,{{{\bf{y}}}}_{B}]}{m}{\sigma }_{um}.$$

(13)

Further, it is written in a more general form as follows:

$${{{\bf{z}}}}^{\cdot }={{\bf{f}}}({{{\bf{R}}}}_{B}^{W})+{{\bf{g}}}({{{\bf{R}}}}_{B}^{W})({f}_{{{{\mathcal{L}}}}_{1}}+{\sigma }_{m})+{{{\bf{g}}}}^{\perp }({{{\bf{R}}}}_{B}^{W}){\sigma }_{um},$$

(14)

where

$${{\bf{f}}}({{{\bf{R}}}}_{B}^{W})={{\bf{g}}}+\frac{{{{\bf{z}}}}_{B}}{m}T,\,{{\bf{g}}}({{{\bf{R}}}}_{B}^{W})=\frac{{{{\bf{z}}}}_{B}}{m},\,{{{\bf{g}}}}^{\perp }({{{\bf{R}}}}_{B}^{W})=\frac{[{{{\bf{x}}}}_{B}\,{{{\bf{y}}}}_{B}]}{m}.$$

The state predictor of the ${{{\mathcal{L}}}}_{1}$ adaptive controller is defined as:

$${{{\bf{z}}}}^{\cdot }={{\bf{f}}}+{{\bf{g}}}({f}_{{{{\mathcal{L}}}}_{1}}+{\widehat{\sigma }}_{m})+{{{\bf{g}}}}^{\perp }{\widehat{\sigma }}_{um}+{{{\bf{A}}}}_{s}\widetilde{{{\bf{z}}}},$$

(15)

where $\widetilde{{{\bf{z}}}}=\widehat{{{\bf{z}}}}-{{\bf{z}}}$ is the prediction error, and for simplicity, we assume $\widehat{{{\bf{z}}}}(0)={{\bf{z}}}(0)$. A_s represents a Hurwitz matrix, which makes the prediction error $\parallel \widetilde{{{\bf{z}}}}\parallel$ exponentially converge to 0 rapidly. Define ${{\boldsymbol{\Phi }}}\triangleq {{{\bf{A}}}}_{s}^{-1}(\exp ({{{\bf{A}}}}_{s}{T}_{s})-{{\bf{I}}})$, we use a piecewise constant adaptation law formula instead of the projection operator formula because the former is numerically robust. For t ∈ [iT_s, (i + 1)T_s], the piecewise constant adaptation law formula is:

$$\left[\begin{array}{c}{\widehat{\sigma }}_{m}(i{T}_{s})\\ {\widehat{\sigma }}_{um}(i{T}_{s})\end{array}\right]=-\left[\begin{array}{cc}{{{\bf{1}}}}_{1\times 1} & {{{\bf{0}}}}_{1\times 2}\\ {{{\bf{0}}}}_{2\times 1} & {{{\bf{1}}}}_{2\times 2}\end{array}\right]{{\bf{G}}}{(i{T}_{s})}^{-1}{{{\boldsymbol{\Phi }}}}^{-1}\mu (i{T}_{s}),$$

(16)

where ${{\bf{G}}}(i{T}_{s})=[{{\bf{g}}}({{{\bf{R}}}}_{B}^{W})\,{{{\bf{g}}}}^{\perp }({{{\bf{R}}}}_{B}^{W})]$, $\mu (i{T}_{s})=\exp ({{{\bf{A}}}}_{s}{T}_{s})\widetilde{{{\bf{z}}}}(i{T}_{s})$, and for $i\in {\mathbb{N}}$.

The ${{{\mathcal{L}}}}_{1}$ adaptive controller only compensates for the matched components of uncertainties within the strictly proper stable bandwidth of the low-pass filter C(s):

$${f}_{{{{\mathcal{L}}}}_{1}}=-{{\bf{C}}}(s){\widehat{{{\boldsymbol{\sigma }}}}}_{m}(s).$$

(17)

Here we use a first-order low-pass filter as an example, with the transfer function C(s) = ω_c/(s + ω_c). Moreover, the effect of the unmatched modeling part in the formula does not need to be compensated directly; they can be indirectly canceled out by the baseline control law. Proofs regarding stability and bounds on states and controls can be found in ref. ⁵⁸.

INDI torque feedback control

The dynamics equation (2) neglects unmodeled torque terms, which can adversely affect the overall control performance. To address this issue, we adopt INDI, a control strategy based on motor speed and IMU feedback. This approach uses instantaneous sensor measurements to represent system dynamics and is capable of responding quickly to input commands in real time.

INDI is also robust against model uncertainties and external disturbances, and its effectiveness and robustness of INDI have been confirmed in previous studies^57,64. Drawing on previous work⁵⁷, we design an INDI torque compensation control method for HI-ARM.

We re-model the rotational dynamics equation (2), incorporating external torque disturbances τ_ext, and its expression is given by:

$${{\omega }^{\cdot }}_{B}={{{\bf{J}}}}^{-1}(\tau+{\tau }_{ext}-{\omega }_{B}\times {{\bf{J}}}{\omega }_{B}).$$

(18)

Using angular velocity, angular acceleration, and control torque, we can calculate the external torque disturbance, as expressed below:

$${\tau }_{ext}={{\bf{J}}}{{\omega }^{\cdot }}_{B,f}-{\tau }_{f}+{\omega }_{B,f}\times {{\bf{J}}}{\omega }_{B,f},$$

(19)

where τ_f is the control torque in the body coordinate system, which is obtained by measuring the motor speed through low-pass filter. The terms ω_B,f and ${{\omega }^{\cdot }}_{B,f}$ represent the measured body angular velocity and angular acceleration, respectively. The external torque τ_ext is assumed to vary slowly relative to the LPF dynamics. Substituting the above equation (19) into equation (18), we get:

$${\mathop{\omega }\limits^{\cdot }}_{B}=\, {{\bf{J}}}^{-1}(\tau+{\tau }_{ext}-{\omega }_{B}\times {\bf{J}}{\omega }_{B})\\=\, {{\bf{J}}}^{-1}(\tau+({\bf{J}}{\mathop{\omega }\limits^{\cdot }}_{B,f}-{\tau }_{f}+{\omega }_{B,f}\times {\bf{J}}{\omega }_{B,f})-{\omega }_{B}\times {\bf{J}}{\omega }_{B})\\=\, {\mathop{\omega }\limits^{\cdot }}_{B,f}+{{\bf{J}}}^{-1}(\tau -{\tau }_{f}).$$

(20)

In equation (20), we assume that the difference between the gyroscopic torque term and its filtered counterpart is sufficiently small to be negligible, as it changes relatively slowly compared to angular acceleration and control torque.

Using the four motor speed commands, the desired total thrust and angular acceleration can be inferred. By inverting equation (20), we obtain the incremental expression of the desired control torque command τ_INDI as follows:

$${\tau }_{{{\rm{INDI}}}}={\tau }_{f}+{{\bf{J}}}({\widehat{{{\boldsymbol{\omega }}}}}_{B,d}-{{\omega }^{\cdot }}_{B,f}).$$

(21)

Multi-level adaptive flight control

External forces and torques estimated using the aforementioned ${{{\mathcal{L}}}}_{1}$ adaptive position control and INDI torque feedback algorithms are incorporated into a geometric tracking baseline through feedback compensation. The final form of the multi-level adaptive flight control module can then be represented as follows:

$${T}_{d}=\, \parallel -{{\bf{K}}}_{p}{{\bf{e}}}_{p}-{{\bf{K}}}_{v}{{\bf{e}}}_{v}-m{\bf{g}}+m{\ddot{{\bf{p}}}}_{W}\parallel -{f}_{{\mathcal{L}}1},\\ {\tau }_{d}=\, -{{\bf{K}}}_{R}{{\bf{e}}}_{R}-{{\bf{K}}}_{\omega }{{\bf{e}}}_{\omega }+{\omega }_{B}\times {\bf{J}}{\omega }_{B}-{\bf{J}}({\widehat{\omega }}_{B}{{\bf{R}}}^{\top }{{\bf{R}}}_{d}{\omega }_{B,d}-{{\bf{R}}}^{\top }{{\bf{R}}}_{d}{\mathop{\omega }\limits^{\cdot }}_{B,d})-{\tau }_{{\rm{INDI}}}.$$

(22)

Servo motor control

The nylon rope towed by the servo motor is lightweight, and the tension experienced during deformation is much greater than the friction, allowing us to assume uniform tension in the rope for simplification. In the actual grasping scenario, HI-ARM’s deformation is controlled by the servo motor to accurately grasp a target object. With an initial reference angle of the servo motor set to θ_r and a reference angular velocity Ω_r, we design a proportional controller based on the angle position error. The servo motor’s rotational angular velocity is adjusted as follows:

$${\Omega }_{d}={K}_{\theta }({\theta }_{r}-\widehat{\theta })+{\Omega }_{r},$$

(23)

where $\widehat{\theta }$ is the estimated angle of the servo motor, and K_θ is the control gain. Since the initial reference angle may not be perfectly accurate due to varying object sizes and shapes, torque sensing becomes crucial. The servo motor includes torque feedback and overload protection mechanisms to ensure precise grasping control. If the set angle reaches or the motor torque exceeds the blocking torque, the grasping action is considered complete, indicating that the robot has successfully grasped the object.

Trajectory planning

Quadrotors possess differential flatness, meaning that their system states and system inputs can be represented by a set of system outputs such as [x, y, z, ψ]^⊤⁵⁴. This characteristic provides a foundation for the drone such as HI-ARM to achieve fast and efficient trajectory planning. Inspired by our previous work⁵⁵, we employ a trajectory representation method called MINCO, which is based on the differential flatness of quadrotors, to plan the trajectory in space-time for flight. The advantage of MINCO is that it allows users to independently adjust the spatial and temporal attributes of the path, creating efficient operations with linear complexity and making spatial-temporal deformation more convenient.

The MINCO piecewise trajectory involves two main parameters: first, the duration of each segment, represented by ${{\bf{T}}}\in {{\mathbb{R}}}^{M}$; second, the waypoints connecting each segment, represented by ${{\bf{q}}}\in {{\mathbb{R}}}^{3\times (M-1)}$, where M is the number of segments. Then, the three-dimensional spatial point p(t) on the MINCO trajectory at a time t is determined through the operation ${{\mathcal{M}}}$:

$$p(t)={{{\mathcal{M}}}}_{{{\bf{q}}},{{\bf{T}}}}(t).$$

(24)

For the s-integrator chain dynamics (in this work s=3), the MINCO trajectory is a ${{{\mathcal{C}}}}^{s-1}$ polynomial spline of degree 2s-1 by default, with constant boundaries and minimum control effort given {q, T}. Since we are using a jerk-based control system model, smoothness is maximized by minimizing control effort. For the trajectory defined over the time domain t ∈ [t₀, t_M], the control effort optimization is given by:

$${\min }_{p(t)}{\int }_{{t}_{0}}^{{t}_{M}}\parallel {p}^{(s)}(t){\parallel }^{2}dt.$$

(25)

MINCO demonstrates its advantage of linear complexity when converting user-defined parameters {q, T} into polynomial coefficients c and time profiles T_p. This conversion can be achieved through a non-singular banded matrix ${{\bf{M}}}\in {{\mathbb{R}}}^{2Ms\times 2Ms}$ and ${{\bf{b}}}\in {{\mathbb{R}}}^{2Ms\times 3}$, which are valid for any T ≻ 0, as follows:

$${{\bf{M}}}({{\bf{T}}}){{\bf{c}}}={{\bf{b}}}({{\bf{q}}}),{{{\bf{T}}}}_{p}={{\bf{T}}},$$

(26)

at the same time, the restoration of the trajectory through banded PLU Factorization also has linear complexity, and the gradients of the polynomial coefficients can also be propagated to MINCO parameters in linear time. More details on the representation of MINCO trajectories can be found in the literature⁵⁵.

Based on the above advantages of MINCO trajectories, we design a trajectory planning method for HI-ARM. To achieve smooth motion and efficient flight for HI-ARM, we define two metrics for smoothness and time, and minimize their weighted sum. Decision variables are the MINCO parameters q and T.

Within the time t ∈ [t₀, t_M], the MINCO trajectory optimization form is as follows:

$$\begin{array}{l}\mathop{\min }\limits_{{\bf{q}},{\bf{T}}}{\int }_{{t}_{0}}^{{t}_{M}}J({\bf{q}},{\bf{T}},t)\,dt,\\ \,{\rm{s.t.}}\,{\mathcal{H}}({\bf{q}},{\bf{T}},t)=0,\,{\mathcal{G}}({\bf{q}},{\bf{T}},t)\le 0,\end{array}$$

(27)

where ${{\mathcal{H}}}$ and ${{\mathcal{G}}}$ represent equality and inequality constraints, respectively, including continuity constraints for start and end states, dynamic feasibility constraints, time constraints, etc. According to the literature⁶⁵, the above constrained optimization problem can be further transformed into an unconstrained problem for more efficient solution. The transformed trajectory optimization problem is as follows:

$$\mathop{\min }\limits_{{\bf{q}},{\bf{T}}}\mathop{\sum }\limits_{x}{\lambda }_{x}{J}_{x},$$

(28)

where J_x are various penalty terms, and λ_x are relative weights. The subscript x = {s, t, d} represents smoothness (s), total time (t), and dynamic feasibility (d), etc. Their corresponding detailed penalty functions are as follows:

Maximizing Smoothness: According to equation (25), the smoothness penalty J_s is defined as the integral of the square of the s-order derivative, i.e.,

$${J}_{s}={\int }_{{t}_{0}}^{{t}_{M}}\parallel {p}^{(s)}(t){\parallel }^{2}\,dt.$$

(29)

This integral can be analytically calculated because the MINCO trajectory can be represented as a piecewise polynomial according to equation (26).

Minimizing Total Time: In most cases, shorter flight time is desirable, so we also minimize the weighted total flight time to obtain the total time penalty J_t as

$${J}_{t}=\,{{\rm{sum}}}\,({{\bf{T}}}).$$

(30)

Dynamic Feasibility: For differentially flat multicopters, dynamic feasibility is ensured by limiting the magnitude of trajectory derivatives. In our work, we add penalties to limit the amplitude of velocity, acceleration, and jerk if these derivatives exceed physical thresholds, i.e.,

$${J}_{d,v}=\mathop{\sum }\limits_{i=0}^{\kappa }\max {\{(\mathop{p}\limits^{\cdot }{({t}_{i})}^{2}-{v}_{m}^{2}),0\}}^{3},$$

(31a)

$${J}_{d,a}=\mathop{\sum }\limits_{i=0}^{\kappa }\max {\{(\ddot{p}{({t}_{i})}^{2}-{a}_{m}^{2}),0\}}^{3},$$

(31b)

$${J}_{d,j}=\mathop{\sum }\limits_{i=0}^{\kappa }\max {\{({p}^{(3)}{({t}_{i})}^{2}-{j}_{m}^{2}),0\}}^{3},$$

(31c)

$${J}_{d}={J}_{d,v}+{J}_{d,a}+{J}_{d,j},$$

(31d)

where v_m, a_m, and j_m are the maximum permissible magnitudes of velocity, acceleration, and jerk, respectively; t_i = t₀ + (t_M − t₀)i/κ indicates a finite number of sampled timestamps, where κ + 1 equals the sample number. We sum J_d,v, J_d,a, and J_d,j directly because they are of similar magnitude.

Finally, we begin to solve the trajectory optimization problem (28) using the open-source L-BFGS solver. The trajectory planner starts with the global target position given by the user and iteratively optimizes to ultimately generate a local trajectory that meets task requirements. Specific details can be found in the literature⁵⁵.

Data availability

The data that support the findings of this study are publicly available on Zenodo at https://doi.org/10.5281/zenodo.17340496.

Code availability

The code of the autonomous manipulation mode used in this study is publicly available at (https://github.com/Wyz000/HIARM)(https://doi.org/10.5281/zenodo.17310380).

References

Ruben, J. & Feduccia, A. The origin and evolution of birds. Bioscience 47, 145–186 (1997).
Dial, K. P. Wing-assisted incline running and the evolution of flight. Science 299, 402–404 (2003).
Article ADS CAS PubMed Google Scholar
Cano, R., Pérez, C., Pruano, F., Ollero, A. & Heredia, G. Mechanical design of a 6-dof aerial manipulator for assembling bar structures using UAVs. In 2nd RED-UAS 2013 workshop on research, education and development of unmanned aerial systems, 218 (2013).
Kondak, K. et al. Aerial manipulation robot composed of an autonomous helicopter and a 7 degrees of freedom industrial manipulator. In 2014 IEEE international conference on robotics and automation (ICRA), 2107–2112 (IEEE, 2014).
Jimenez-Cano, A. E., Martin, J., Heredia, G., Ollero, A. & Cano, R. Control of an aerial robot with multi-link arm for assembly tasks. In 2013 IEEE International Conference on Robotics and Automation, 4916–4921 (IEEE, 2013).
Zufferey, R. et al. How ornithopters can perch autonomously on a branch. Nat. Commun. 13, 7713 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, M., Okada, K. & Inaba, M. Versatile articulated aerial robot dragon: Aerial manipulation and grasping by vectorable thrust control. Int. J. Robot. Res. 42, 214–248 (2023).
Article Google Scholar
Roderick, W. R., Cutkosky, M. R. & Lentink, D. Bird-inspired dynamic grasping and perching in arboreal environments. Sci. Robot. 6, eabj7562 (2021).
Article CAS PubMed Google Scholar
Ubellacker, S., Ray, A., Bern, J. M., Strader, J. & Carlone, L. High-speed aerial grasping using a soft drone with onboard perception. npj Robot. 2, 5 (2024).
Article Google Scholar
Stewart, W., Guarino, L., Piskarev, Y. & Floreano, D. Passive perching with energy storage for winged aerial robots. Adv. Intell. Syst. 5, 2100150 (2023).
Article Google Scholar
Aucone, E. et al. Drone-assisted collection of environmental DNA from tree branches for biodiversity monitoring. Sci. Robot. 8, eadd5762 (2023).
Article PubMed Google Scholar
Kannan, S. S. & Min, B.-C. Autonomous drone delivery to your door and yard. In 2022 International Conference on Unmanned Aircraft Systems (ICUAS), 452–461 (2022).
Afifi, A. et al. Physical human-aerial robot interaction and collaboration: Exploratory results and lessons learned. In 2023 International Conference on Unmanned Aircraft Systems (ICUAS), 956–962 (2023).
Alejandro, S., Antonio, G., Carlos, A. & Anibal, O. Through-window home aerial delivery system with in-flight parcel load and handover: Design and validation in indoor scenario. Int. J. Soc. Robot. 16, 2109–2132 (2024).
Article Google Scholar
Ollero, A., Tognon, M., Suarez, A., Lee, D. & Franchi, A. Past, present, and future of aerial robotic manipulators. IEEE Trans. Robot. 38, 626–645 (2022).
Article Google Scholar
Heredia, G. et al. Control of a multirotor outdoor aerial manipulator. In 2014 IEEE/RSJ international conference on intelligent robots and systems, 3417–3422 (IEEE, 2014).
Bellicoso, C. D., Buonocore, L. R., Lippiello, V. & Siciliano, B. Design, modeling and control of a 5-dof light-weight robot arm for aerial manipulation. In 2015 23rd Mediterranean Conference on Control and Automation (MED), 853–858 (IEEE, 2015).
Kim, S., Choi, S. & Kim, H. J. Aerial manipulation using a quadrotor with a two dof robotic arm. In 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 4990-4995 (IEEE, 2013).
Thomas, J., Polin, J., Sreenath, K. & Kumar, V. Avian-inspired grasping for quadrotor micro uavs. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, 55935, V06AT07A014 (American Society of Mechanical Engineers, 2013).
Wang, M. et al. Millimeter-level pick and peg-in-hole task achieved by aerial manipulator. IEEE Transactions on Robotics (2023).
Suarez, A. et al. Lightweight and human-size dual arm aerial manipulator. In 2017 international conference on unmanned aircraft systems (ICUAS), 1778–1784 (IEEE, 2017).
Orsag, M., Korpela, C., Bogdan, S. & Oh, P. Valve turning using a dual-arm aerial manipulator. In 2014 international conference on unmanned aircraft systems (ICUAS), 836–841 (IEEE, 2014).
Danko, T. W., Chaney, K. P. & Oh, P. Y. A parallel manipulator for mobile manipulating uavs. In 2015 IEEE international conference on technologies for practical robot applications (TePRA), 1–6 (IEEE, 2015).
Zhang, K. et al. Aerial additive manufacturing with multiple autonomous robots. Nature 609, 709–717 (2022).
Article ADS CAS PubMed Google Scholar
Cao, H., Shen, J., Liu, C., Zhu, B. & Zhao, S. Motion planning for aerial pick-and-place based on geometric feasibility constraints. IEEE Transactions on Automation Science and Engineering. 22, 2577–2594 (2024).
Backus, S. B., Odhner, L. U. & Dollar, A. M. Design of hands for aerial manipulation: Actuator number and routing for grasping and perching. In 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 34–40 (IEEE, 2014).
Pounds, P. E., Bersak, D. R. & Dollar, A. M. Grasping from the air: Hovering capture and load stability. In 2011 IEEE international conference on robotics and automation, 2491–2498 (IEEE, 2011).
Mellinger, D., Lindsey, Q., Shomin, M. & Kumar, V. Design, modeling, estimation and control for aerial grasping and manipulation. In 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2668–2673 (IEEE, 2011).
Popek, K. M. et al. Autonomous grasping robotic aerial system for perching (agrasp). In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 1–9 (IEEE, 2018).
Ryll, M. et al. 6d interaction control with aerial robots: The flying end-effector paradigm. Int. J. Robot. Res. 38, 1045–1062 (2019).
Article Google Scholar
Franchi, A., Carli, R., Bicego, D. & Ryll, M. Full-pose tracking control for aerial robotic systems with laterally bounded input force. IEEE Trans. Robot. 34, 534–541 (2018).
Article Google Scholar
Bodie, K. et al. Active interaction force control for contact-based inspection with a fully actuated aerial vehicle. IEEE Trans. Robot. 37, 709–722 (2020).
Article Google Scholar
Park, S. et al. Odar: Aerial manipulation platform enabling omnidirectional wrench generation. IEEE/ASME Trans. Mechatron. 23, 1907–1918 (2018).
Article Google Scholar
Tsukagoshi, H., Watanabe, M., Hamada, T., Ashlih, D. & Iizuka, R. Aerial manipulator with perching and door-opening capability. In 2015 IEEE international conference on robotics and automation (ICRA), 4663–4668 (IEEE, 2015).
Darivianakis, G., Alexis, K., Burri, M. & Siegwart, R. Hybrid predictive control for aerial robotic physical interaction towards inspection operations. In 2014 IEEE international conference on robotics and automation (ICRA), 53–58 (IEEE, 2014).
Bodie, K. et al. An omnidirectional aerial manipulation platform for contact-based inspection. arXiv preprint arXiv:1905.03502 (2019).
Peng, R., Wang, Z. & Lu, P. Aecom: An aerial continuum manipulator with imu-based kinematic modeling and tendon-slacking prevention. IEEE Transactions on Systems, Man, and Cybernetics: Systems (2023).
Jiang, P. et al. A novel scaffold-reinforced actuator with tunable attitude ability for grasping. IEEE Trans. Robot. 39, 1164–1177 (2022).
Article Google Scholar
Hingston, L., Mace, J., Buzzatto, J. & Liarokapis, M. Reconfigurable, adaptive, lightweight grasping mechanisms for aerial robotic platforms. In 2020 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), 169–175 (IEEE, 2020).
Xu, M., Huang, S., He, R., Yu, D. & Wang, H. Aerial shooting manipulator for distant grasping. IEEE Robot. Autom. Lett. 8, 1991–1998 (2023).
Article Google Scholar
Peng, R., Wang, Y., Lu, M. & Lu, P. A dexterous and compliant aerial continuum manipulator for cluttered and constrained environments. Nat. Commun. 16, 889 (2025).
Article ADS CAS PubMed PubMed Central Google Scholar
Bauer, E. et al. An open-source soft robotic platform for autonomous aerial manipulation in the wild. In Conference on Robot Learning (2024).
Nguyen, P. H., Patnaik, K., Mishra, S., Polygerinos, P. & Zhang, W. A soft-bodied aerial robot for collision resilience and contact-reactive perching. Soft Robot. 10, 838–851 (2023).
Article PubMed Google Scholar
Zhao, M. et al. Design, modeling, and control of an aerial robot dragon: A dual-rotor-embedded multilink robot with the ability of multi-degree-of-freedom aerial transformation. IEEE Robot. Autom. Lett. 3, 1176–1183 (2018).
Article Google Scholar
Shi, F., Zhao, M., Murooka, M., Okada, K. & Inaba, M. Aerial regrasping: Pivoting with transformable multilink aerial robot. In 2020 IEEE International Conference on Robotics and Automation (ICRA), 200–207 (IEEE, 2020).
Zhao, M. et al. Versatile multilinked aerial robot with tilted propellers: Design, modeling, control, and state estimation for autonomous flight and manipulation. J. Field Robot. 38, 933–966 (2021).
Article Google Scholar
Bucki, N., Tang, J. & Mueller, M. W. Design and control of a midair-reconfigurable quadcopter using unactuated hinges. IEEE Trans. Robot. 39, 539–557 (2022).
Article Google Scholar
Zhao, N., Luo, Y., Deng, H., Shen, Y. & Xu, H. The deformable quad-rotor enabled and wasp-pedal-carrying inspired aerial gripper. In 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 1–9 (IEEE, 2018).
Falanga, D., Kleber, K., Mintchev, S., Floreano, D. & Scaramuzza, D. The foldable drone: a morphing quadrotor that can squeeze and fly. IEEE Robot. Autom. Lett. 4, 209–216 (2018).
Article Google Scholar
Wu, Y. et al. Ring-rotor: A novel retractable ring-shaped quadrotor with aerial grasping and transportation capability. IEEE Robot. Autom. Lett. 8, 2126–2133 (2023).
Article Google Scholar
Xu, M. et al. Biomimetic morphing quadrotor inspired by eagle claw for dynamic grasping. IEEE Transactions on Robotics 40, 2513–2528 (2024).
Susan, S. et al. Gray’s anatomy e-book. In The anatomical basis of clinical practice. 5, 18–56 (2015).
Doyle, J. R. Anatomy of the finger flexor tendon sheath and pulley system. J. Hand Surg. 13, 473–484 (1988).
Article CAS Google Scholar
Mellinger, D. & Kumar, V. Minimum snap trajectory generation and control for quadrotors. In 2011 IEEE international conference on robotics and automation, 2520–2525 (IEEE, 2011).
Wang, Z., Zhou, X., Xu, C. & Gao, F. Geometrically constrained trajectory optimization for multicopters. IEEE Trans. Robot. 38, 3259–3278 (2022).
Article Google Scholar
Wu, Z. et al. L1 adaptive augmentation for geometric tracking control of quadrotors. In 2022 International Conference on Robotics and Automation (ICRA), 1329–1336 (IEEE, 2022).
Tal, E. & Karaman, S. Accurate tracking of aggressive quadrotor trajectories using incremental nonlinear dynamic inversion and differential flatness. IEEE Trans. Control Syst. Technol. 29, 1203–1218 (2020).
Article Google Scholar
Wang, X. & Hovakimyan, N. L1 adaptive controller for nonlinear time-varying reference systems. Syst. Control Lett. 61, 455–463 (2012).
Article Google Scholar
Liu, M. et al. Visual whole-body control for legged loco-manipulation (2024). arXiv: 2403.16967
Wu, T., Chen, Y., Chen, T., Zhao, G. & Gao, F. Whole-body control through narrow gaps from pixels to action. In 2025 IEEE International Conference on Robotics and Automation (ICRA), 11317–11324 (2025).
Zhou, X. et al. Swarm of micro flying robots in the wild. Sci. Robot. 7, eabm5954 (2022).
Article PubMed Google Scholar
Lee, T., Leok, M. & McClamroch, N. H. Geometric tracking control of a quadrotor uav on se (3). In 49th IEEE conference on decision and control (CDC), 5420–5425 (IEEE, 2010).
Pravitra, J., Ackerman, K. A., Cao, C., Hovakimyan, N. & Theodorou, E. A. L1-adaptive mppi architecture for robust and agile control of multirotors. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 7661–7666 (IEEE, 2020).
Sun, S., Romero, A., Foehn, P., Kaufmann, E. & Scaramuzza, D. A comparative study of nonlinear mpc and differential-flatness-based control for quadrotor agile flight. IEEE Trans. Robot. 38, 3357–3373 (2022).
Article Google Scholar
Teo, K. L., Rehbock, V. & Jennings, L. S. A new computational algorithm for functional inequality constrained optimization problems. Automatica 29, 789–792 (1993).
Article MathSciNet Google Scholar

Download references

Acknowledgements

We thank M. Wang, T. Wu, Y. Zhong, X. Zhou, T. Zhang, Y. Gao, who offered valuable suggestions to the manuscript, and R. Jin for photography and video recording. We sincerely appreciate the work of J.W. and Y.Z. for help on experiments. Furthermore, we are truly grateful for J. Zhang’s help in artwork. This work was supported by the National Key R&D Program of China under grant no. 2023YFB4706600 and the National Natural Science Foundation of China under grant no. 62322314.

Author information

These authors contributed equally: Yuze Wu, Fan Yang.

Authors and Affiliations

Institute of Cyber-Systems and Control, College of Control Science and Engineering, Zhejiang University, Hangzhou, China
Yuze Wu, Rui Jin, Yuhang Zhong, Junjie Wang & Fei Gao
Huzhou Institute, Zhejiang University, Huzhou, China
Yuze Wu, Fan Yang, Rui Jin, Yuhang Zhong, Junjie Wang, Xuankang Wu & Fei Gao
Differential Robotics, Hangzhou, China
Yuze Wu & Fei Gao

Authors

Yuze Wu
View author publications
Search author on:PubMed Google Scholar
Fan Yang
View author publications
Search author on:PubMed Google Scholar
Rui Jin
View author publications
Search author on:PubMed Google Scholar
Yuhang Zhong
View author publications
Search author on:PubMed Google Scholar
Junjie Wang
View author publications
Search author on:PubMed Google Scholar
Xuankang Wu
View author publications
Search author on:PubMed Google Scholar
Fei Gao
View author publications
Search author on:PubMed Google Scholar

Contributions

Y.W. contributed to the hardware and software design, experiments, and manuscript writing. F.Y. contributed to the hardware design, controller design, and experiments. R.J. contributed to artwork and experiments. Y.Z. contributed to experiments and gave several suggestions for manuscript writing. J.W. and X.W. contributed to experiments. F.G. directed the research, provided the primary idea and funding with some key suggestions about software and hardware debugging, and revised the manuscript.

Corresponding author

Correspondence to Fei Gao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Moju Zhao, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information (download PDF )

Description of Additional Supplementary Files (download PDF )

Supplementary Movie S1 (download MP4 )

Supplementary Movie S2 (download MP4 )

Supplementary Movie S3 (download MP4 )

Supplementary Movie S4 (download MP4 )

Transparent Peer Review file (download PDF )

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, Y., Yang, F., Jin, R. et al. Hand-like autonomous flying robot for airborne grasping and interaction. Nat Commun 17, 2200 (2026). https://doi.org/10.1038/s41467-026-68967-3

Download citation

Received: 21 March 2025
Accepted: 09 January 2026
Published: 30 January 2026
Version of record: 04 March 2026
DOI: https://doi.org/10.1038/s41467-026-68967-3