6 Comments
User's avatar
Chris Matthieu's avatar

Great post!

Do you consider the MolmoAct project and Action Reasoning Model (https://allenai.org/blog/molmoact) just another VLA or do you think they are doing something different/novel? Thanks.

Expand full comment
Chris Paxton's avatar

Great question. We actually just recorded a RoboPapers episode on MolmoAct last night! And i do think they're making some important innovations on the basic formula, especially pioneering what "reasoning" would look like in the context of robotics. I want to do a follow up post (maybe next week?) on some innovations and changes to the core formula, which so far seems pretty fixed across all the major players!

Expand full comment
Chris Matthieu's avatar

🙏🏼

Expand full comment
Advait Patel's avatar

> This particular problem would go away if everyone had just kept contributing to Open-X Embodiment like they were supposed to. But data is expensive, it’s the new coding, and in a very real way it’s your “moat”: it’s unreasonable to expect private companies to share large amounts of data freely.

I’m hoping that over time, LeRobot catches on in the research and hobbyist community and eventually all we’ll have to do is go through and aggregate it. It already seems to have some sway - IK SmolVLA used LeRobot data and iirc GR00T trained on SO100 data.

Expand full comment
Chris Paxton's avatar

Yeah I hope so too. I think LeRobot is a great project, though it too has its shortcomings.

Expand full comment
Albert Hu's avatar

Hi, Chris, great post. Very nice overview. 👍

Expand full comment