Discussion about this post

User's avatar
Jie Wang's avatar

In my opinion, VLAs research is extremely empirical compared to many other directions.

Simulation like Libero is no more statistically meaningful benchmark as we can overfit to ~99% easily right now. Urgent things are: 1) Create new benchmark in sim 2) Show the real world experiments actually improve in behaviors 3) Do more ablation study on data recipe / improvement.

Without above, we can hardly tell if it is the new incremental components work, or just the base model is strong enough to solve the problem.

Expand full comment
3 more comments...

No posts

Ready for more?