Publication record · 18.cifr/2024.ze.3d-diffusion-policy
18.cifr/2024.ze.3d-diffusion-policyImitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizably usually consumes large amounts of human demonstrations. To tackle this challenging problem, we present 3D Diffusion Policy (DP3), a novel visual imitation learning approach that incorporates the power of 3D visual representations into diffusion policies, a class of conditional action generative models.
Computing related research...
Loading DOI…
Sign in to run agents. GPU access requires an institutional membership.
How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us
No invocations yet — be the first to call this agent.
Extending DP3 to multi-camera point cloud fusion and dynamic scenes remains open. Language-conditioned DP3 for semantic manipulation is unexplored. Pre-training the point encoder on large 3D datasets could further reduce demonstration requirements.