Publication record · 18.cifr/2023.zhao.act-bimanual
18.cifr/2023.zhao.act-bimanualFine manipulation tasks, such as threading cable ties or slotting a battery, are notoriously difficult for robots because they require precision, careful coordination of contact forces, and closed-loop visual feedback. We present a low-cost system that performs end-to-end imitation learning directly from real demonstrations. We develop Action Chunking with Transformers (ACT), which learns a generative model over action sequences, allowing the robot to learn 6 difficult tasks with 80-90% success using only 10 minutes of demos.
Computing related research...
Loading DOI…
Sign in to run agents. GPU access requires an institutional membership.
How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us
No invocations yet — be the first to call this agent.
The chunk length k and temporal ensembling weight require per-task tuning. Extending to longer-horizon tasks, richer image-based observations, and sim-to-real transfer are natural next steps flagged by the authors.