Robotics – Inteliiai

Skip to content Skip to sidebar Skip to footer

Researchers from Stanford and Cornell Introduce APRICOT: A Novel AI Approach that Merges LLM-based Bayesian Active Preference Learning with Constraint-Aware Task Planning

Robotics2 days ago2Views 0Likes 0Comments

In the rapidly evolving field of household robotics, a significant challenge has emerged in executing personalized organizational tasks, such as arranging groceries in a refrigerator. These tasks require robots to balance user preferences with physical constraints while avoiding collisions and maintaining stability. While Large Language Models (LLMs) enable natural language communication of user preferences, this…

Google DeepMind Researchers Propose RT-Affordance: A Hierarchical Method that Uses Affordances as an Intermediate Representation for Policies

RoboticsNovember 12, 20244Views 0Likes 0Comments

In recent years, there has been significant development in the field of large pre-trained models for learning robot policies. The term “policy representation” here refers to the different ways of interfacing with the decision-making mechanisms of robots, which can potentially facilitate generalization to new tasks and environments. Vision-language-action (VLA) models are pre-trained with large-scale robot…

Latent Action Pretraining for General Action models (LAPA): An Unsupervised Method for Pretraining Vision-Language-Action (VLA) Models without Ground-Truth Robot Action Labels

RoboticsOctober 23, 202413Views 0Likes 0Comments

Vision-Language-Action Models (VLA) for robotics are trained by combining large language models with vision encoders and then fine-tuning them on various robot datasets; this allows generalization to new instructions, unseen objects, and distribution shifts. However, various real-world robot datasets mostly require human control, which makes scaling difficult. On the other hand, Internet video data offers…

Google DeepMind’s SIMA Project Enhances Agent Performance in Dynamic 3D Environments Across Various Platforms

RoboticsOctober 13, 202416Views 0Likes 0Comments

The exploration of artificial intelligence within dynamic 3D environments has emerged as a critical area of research, aiming to bridge the gap between static AI applications and their real-world usability. Researchers at Google DeepMind have pioneered this realm, developing sophisticated agents capable of interpreting and acting on complex instructions within various simulated settings. This new…

This AI Paper Proposes a Pipeline for Improving Imitation Learning Performance with a Small Human Demonstration Budget

RoboticsOctober 8, 202416Views 0Likes 0Comments

The practical application of robotic technology in automatic assembly processes holds immense value. However, traditional robotic systems have struggled to adapt to the demands of production environments characterized by high-mix, low-volume manufacturing. Robotic learning presents a potential solution to this challenge by enabling robots to acquire assembly skills through demonstration rather than scripted trajectories, thus…

Researchers at Stanford Propose TRANSIC: A Human-in-the-Loop Method to Handle the Sim-to-Real Transfer of Policies for Contact-Rich Manipulation Tasks

RoboticsSeptember 30, 202416Views 0Likes 0Comments

Learning in simulation and applying the learned policy to the real world is a potential approach to enable generalist robots, and solve complex decision-making tasks. However, the challenge to this approach is to address simulation-to-reality (sim-to-real) gaps. Also, a huge amount of data is needed while learning to solve these tasks, and the load of…

OpenVLA: A 7B-Parameter Open-Source VLA Setting New State-of-the-Art for Robot Manipulation Policies

RoboticsSeptember 25, 202422Views 0Likes 0Comments

A major weakness of current robotic manipulation policies is their inability to generalize beyond their training data. While these policies, trained for specific skills or language instructions, can adapt to new conditions like different object positions or lighting, they often fail when faced with scene distractors or new objects, and need help to follow unseen…

A Simple Open-loop Model-Free Baseline for Reinforcement Learning Locomotion Tasks without Using Complex Models or Computational Resources

RoboticsSeptember 20, 202426Views 0Likes 0Comments

The field of deep reinforcement learning (DRL) is expanding the capabilities of robotic control. However, there has been a growing trend of increasing algorithm complexity. As a result, the latest algorithms need many implementation details to perform well on different levels, causing issues with reproducibility. Moreover, even state-of-the-art DRL models have simple problems, like the…

Hyperion: A Novel, Modular, Distributed, High-Performance Optimization Framework Targeting both Discrete and Continuous-Time SLAM Applications

RoboticsSeptember 15, 202422Views 0Likes 0Comments

In robotics, understanding the position and movement of a sensor suite within its environment is crucial. Traditional methods, called Simultaneous Localization and Mapping (SLAM), often face challenges with unsynchronized sensor data and require complex computations. These methods must estimate the position at discrete time intervals, making it difficult to handle data from various sensors that…

Google DeepMind Researchers Present Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

RoboticsSeptember 10, 202421Views 0Likes 0Comments

Technological advancements in sensors, AI, and processing power have propelled robot navigation to new heights in the last several decades. To take robotics to the next level and make them a regular part of our lives, many studies suggest transferring the natural language space of ObjNav and VLN to the multimodal space so the robot…