-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with Mobile Aloha Inference in MuJoCo: Robot Wandering Without Performing Actions #24
Comments
Here's the video for the current result. The command is |
I noticed a discrepancy between the simulation environment and the joint range reported in your paper. For joint 3 of the arm, the paper specifies a range of To address this, I tried flipping the joint position in Please let me know if you need more information. |
At this point, I’m thinking that fine-tuning might be required to run inference in simulation environments. OpenVLA also seems to require fine-tuning to achieve reasonable performance on Libero. On the other hand, @chjchjchjchjchj's pull request gives me hope: while the WidowX arm didn’t complete the task, it at least approached the spool (the task instruction is to place the spool on the towel). |
Yes, fine-tuning is needed. By the way, do not use SimplerEnv currently. The origin of their coordinate system is different from that of the real-world data. We are trying to build simulation inference. Stay tuned! |
Thank you for your prompt reply! I’ll look forward to the good news. Regarding the need for fine-tuning, do you think it’s primarily required on the perception side, the control side, or both? |
Both. We usually call it the embodiment gap. |
Philosophically speaking, is it correct to understand that a more diverse set of robots in the training set leads to a smaller embodiment gap in practice (i.e., less fine-tuning needed) when adapting to new robots? |
Yes, I think it is. However, at present, the embodiment diversity of pre-training datasets is far from enough. |
If the fine-tuning is solely for closing the embodiment gap, is it correct to assume that a diverse set of objects or tasks is not strictly necessary (although, of course, it would be beneficial to include them)? Can I assume the model has already learned aspects beyond the embodiment gap (e.g., visual reasoning, task planning, etc.) during pre-training? |
Yes, you are right. |
We encountered the same issue in a real-world scenario and are unsure where the problem lies. We hope to receive some help.
WeChat_20241124224257.mp4 |
@zzl410 Have you fine-tuned the model? It seems quite abnormal... |
Thank you for your attention.We did not perform fine-tuning on the models, using only two base models: Pour water from the bottle into the mug. In all cases, a similar issue occurred where the robotic arm moved upwards during the grasping motion.
|
Interestingly, even in a completely dark experimental environment, the robotic arm still exhibits the same issue. We verified the reception of camera image data, confirming that it is complete and accurate. |
You should fine-tune first. Since the pre-trained checkpoint does not see your embodiment before. |
Nice! when will the simulation inference version be released? |
thank you ,it works. |
Hi folks,
First off, I want to say amazing work—I'm really impressed by this project!
I've been trying to perform inference on the Mobile Aloha robot in MuJoCo, but I'm encountering an issue: the robot seems to wander aimlessly and doesn't perform any meaningful actions. Do you have any suggestions for resolving this?
Here’s my setup:
agilex_inference.py
.The only modifications I’ve made to RDT's code are adjustments to align the gripper action range with Agilex's setup (0 to 0.0475). Specifically:
_format_joint_to_state()
, I changed:_unformat_action_to_joint()
, I changed:Could this modification be causing the issue? Or is there another step I might have missed in setting up Mobile Aloha in MuJoCo for inference?
Thanks again for the great work—I'm eager to get this working and achieve good results in MuJoCo!
The text was updated successfully, but these errors were encountered: