Generalization

Researchers at Carnegie Mellon University have shared a pre-print paper on generalized robot training within a small “practical data budget.” The team developed a system that breaks movement tasks into 12 “skills” (e.g., pick, place, slide, wipe) that can be combined to create new and complex trajectories within at least somewhat novel scenarios, called MT-ACT: Multi-Task Action Chunking Transformer. The authors write:

Trained merely on 7500 trajectories, we are demonstrating a universal RoboAgent that can exhibit a diverse set of 12 non-trivial manipulation skills (beyond picking/pushing, including articulated object manipulation and object re-orientation) across 38 tasks and can generalize them to 100s of diverse unseen scenarios (involving unseen objects, unseen tasks, and to completely unseen kitchens). RoboAgent can also evolve its capabilities with new experiences.

Continue reading “RoboAgent Gets Its MT-ACT Together” →

Hackaday

1 Articles

RoboAgent Gets Its MT-ACT Together

Search

Never miss a hack

If you missed it

CCA Ethernet Cables: Not Up To Scratch, But Are They Dangerous?

With Affordable Storage Options Dwindling, Where To Store Our Data?

Ask Hackaday: How Much Compute Is Enough?

WheatForce: Learning From CPU Architecture Mistakes

Improving FDM Filament Drying With A Spot Of Vacuum

Our Columns

2026 Hackaday Europe: First Round Of Speakers Announced!

In Space (Probably) Everyone Can Hear You.. Well, You Know

Re-Learning How To Run

Hackaday Podcast Episode 364: Clocks, Cameras, And Free Will

This Week In Security: The Supply Chain Has Problems

Search

Never miss a hack

Subscribe

If you missed it

Our Columns