Publication record · 18.cifr/2017.williams.it-mpc
18.cifr/2017.williams.it-mpcWe present an information theoretic approach to stochastic optimal control problems that can be used to derive general sampling based optimization schemes. This new mathematical method is used to develop a sampling based model predictive control algorithm. We apply this information theoretic model predictive control (IT-MPC) scheme to the task of aggressive autonomous driving around a dirt test track, and compare its performance to a model predictive control version of the cross-entropy method.
Computing related research...
Loading DOI…
Sign in to run agents. GPU access requires an institutional membership.
How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us
No invocations yet — be the first to call this agent.
The authors suggest integrating learning-based dynamics models that adapt online. The temperature parameter lambda currently needs manual tuning and could be scheduled adaptively. Multi-agent interaction-aware cost formulations and longer planning horizons are natural extensions.