Reinforcement Learning: Publications
-
[1]
Inference Strategies for Solving Semi−Markov Decision Processes
Matthew Hoffman and Nando de Freitas
Chapter 5. Pages 82–96. Hershey: IGI Global. 2012.
Details about Inference Strategies for Solving Semi−Markov Decision Processes | BibTeX data for Inference Strategies for Solving Semi−Markov Decision Processes | Download (pdf) of Inference Strategies for Solving Semi−Markov Decision Processes
-
[2]
An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward
Matthew Hoffman‚ Nando de Freitas‚ Arnaud Doucet and Jan Peters
In Journal of Machine Learning Research − Proceedings Track for Artificial Intelligence and Statistics (AISTATS). Vol. 5. Pages 232–239. 2009.
Details about An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward | BibTeX data for An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward | Link to An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward
-
[3]
A Bayesian exploration−exploitation approach for optimal online sensing and planning with a visually guided mobile robot
Ruben Martinez−Cantin‚ Nando Freitas‚ Eric Brochu‚ José Castellanos and Arnaud Doucet
In Autonomous Robots. Vol. 27. No. 2. Pages 93–103. 2009.
Details about A Bayesian exploration−exploitation approach for optimal online sensing and planning with a visually guided mobile robot | BibTeX data for A Bayesian exploration−exploitation approach for optimal online sensing and planning with a visually guided mobile robot | DOI (10.1007/s10514-009-9130-2) | Link to A Bayesian exploration−exploitation approach for optimal online sensing and planning with a visually guided mobile robot
-
[4]
New inference strategies for solving Markov Decision Processes using reversible jump MCMC
Matthias Hoffman‚ Hendrik Kueck‚ Nando de Freitas and Arnaud Doucet
In Uncertainty in Artificial Intelligence (UAI). Pages 223–231. Corvallis‚ Oregon. 2009.
Details about New inference strategies for solving Markov Decision Processes using reversible jump MCMC | BibTeX data for New inference strategies for solving Markov Decision Processes using reversible jump MCMC | Link to New inference strategies for solving Markov Decision Processes using reversible jump MCMC
-
[5]
Inference and Learning for Active Sensing‚ Experimental Design and Control
Hendrik Kueck‚ Matt Hoffman‚ Arnaud Doucet and Nando Freitas
In Helder Araujo‚ Ana Maria Mendonca‚ Armando J. Pinho and Maria Ines Torres, editors, Pattern Recognition and Image Analysis. Vol. 5524 of Lecture Notes in Computer Science. Pages 1–10. Springer Berlin Heidelberg. 2009.
Details about Inference and Learning for Active Sensing‚ Experimental Design and Control | BibTeX data for Inference and Learning for Active Sensing‚ Experimental Design and Control | DOI (10.1007/978-3-642-02172-5_1) | Link to Inference and Learning for Active Sensing‚ Experimental Design and Control
-
[6]
Target−directed attention: Sequential decision−making for gaze planning
J. Vogel and N. de Freitas
In IEEE International Conference on Robotics and Automation (ICRA). Pages 2372–2379. 2008.
Details about Target−directed attention: Sequential decision−making for gaze planning | BibTeX data for Target−directed attention: Sequential decision−making for gaze planning | DOI (10.1109/ROBOT.2008.4543568)
-
[7]
Bayesian Policy Learning with Trans−Dimensional MCMC
Matthew Hoffman‚ Arnaud Doucet‚ Nando de Freitas and Ajay Jasra
In J.C. Platt‚ D. Koller‚ Y. Singer and S. Roweis, editors, Advances in Neural Information Processing Systems 20. Pages 665–672. MIT Press, Cambridge‚ MA. 2007.
Details about Bayesian Policy Learning with Trans−Dimensional MCMC | BibTeX data for Bayesian Policy Learning with Trans−Dimensional MCMC | Link to Bayesian Policy Learning with Trans−Dimensional MCMC
-
[8]
Estimation and control of industrial processes with particle filters
R. Morales−Menendez‚ N. de Freitas and D. Poole
In American Control Conference. Vol. 1. Pages 579–584. 2003.
Details about Estimation and control of industrial processes with particle filters | BibTeX data for Estimation and control of industrial processes with particle filters | DOI (10.1109/ACC.2003.1239081) | Download (pdf) of Estimation and control of industrial processes with particle filters