Developed an algorithm that allows transferring skills between robots
RoVi-Aug is changing the approach to robot learning.
Researchers at the University of California, Berkeley, have developed a new computational framework called RoVi-Aug that augments robotic data and facilitates skill transfer between different robots. The framework uses generative models to augment image data and create synthesized visual demonstrations of tasks with different camera views for different robots.
According to the development team, led by researchers Lawrence Chen and Chenfeng Xu, the goal of their work was to overcome the limitations of existing algorithms that cannot reliably transfer skills between robots with different bodies and characteristics. They noted that many existing robot training datasets are unbalanced and contain inaccuracies that can lead to overtraining certain types of robots.
“The success of current machine learning systems, particularly generative models, demonstrates impressive generalizability and motivates robotics researchers to seek how to achieve similar generalizability in robotics,” Chen and Xu said.
RoVi-Aug consists of two separate components: the robotic augmentation module (Ro-Aug) and the viewpoint augmentation module (Vi-Aug). The first component synthesizes demonstration data involving different robotic systems, while the second produces demonstrations from different angles.
“Ro-Aug has two key features: a fine-tuned SAM model for robot segmentation and a fine-tuned ControlNet for replacing the original robot with another robot. Meanwhile, Vi-Aug uses ZeroNVS, a state-of-the-art novel view synthesis model, to create new scene perspectives, making the model adaptable to different camera viewpoints,” explained Chen and Xu.
The researchers used their framework to create an augmented robot dataset and tested its effectiveness for policy learning and skill transfer between different robots. Their results showed that RoVi-Aug enables policy updates that generalize well between different robots and camera setups.
“The key innovation is the application of generative models, such as image generation and new view synthesis, to the task of cross-embodied robot learning,” explained Chen and Xu.
This work can contribute to robot development, helping researchers easily expand the skill sets of the systems. In the future, it can be used by other teams to transfer skills between different robots or develop better universal robot policies.
According to the authors of the paper, RoVi-Aug can be a cost-effective alternative for simply compiling robust datasets for training. They also noted that their approach can be extended to apply to other robot datasets and that they plan to further improve RoVi-Aug, including video generation instead of image generation.
“We also plan to apply RoVi-Aug to existing datasets, such as the Open-X Embodiment (OXE) dataset, and are excited about the potential for improving the performance of universal robot policies trained on this data. Expanding the capabilities of RoVi-Aug could significantly increase the flexibility and robustness of these policies for a wider range of robots and tasks,” the researchers concluded.
Arch Town Labs
Research & Innovation
9
1
0
Comments
Recommended
Arch Town Labs
105 subscribers
7 months ago
Subscribe
U.S. X-37B spaceplane prepares for a new series of experiments
The X-37B will use aerobraking to change its orbit.
The U.S. Space Force has announced a series of maneuvers to change orbit and jettison components using its X-37B spaceplane. This reusable aircraft, in orbit since December 28, 2023, will use an aerobraking method to change its trajectory around the Earth. This method involves making multiple passes through the planet's upper atmosphere using at
Show more
Arch Town Labs
Research & Innovation
14
1
0
Arch Town Labs
105 subscribers
7 months ago
Subscribe
New integer addition algorithm can reduce AI power consumption by 95%
The new technique uses integer addition instead of floating point multiplication.
A team of engineers at BitEnergy AI, an AI inference technology company, has reported a method to reduce the power consumption of AI-based applications by 95%. The team published a paper describing their technology on the arXiv preprint server.
As AI applications have become mainstream, their use has skyrocketed, l