The application of expected utility theory to construct adaptive agents is both computationally intractable and statistically questionable. To overcome these difficulties, agents need the ability to delay the choice of the optimal policy to a later stage when they have learned more about the environment. How should agents do this optimally? An information-theoretic answer to this question is given by the Bayesian control rule—the solution to the adaptive coding problem when there are not only observations but also actions. This paper reviews the central ideas behind the Bayesian control rule.
Author(s): | Ortega, PA and Braun, DA |
Pages: | 1-4 |
Year: | 2012 |
Month: | December |
Bibtex Type: | Conference Paper (conference) |
Event Name: | NIPS 2012 Workshop on Information in Perception and Action |
Event Place: | Lake Tahoe, NV, USA |
URL: | http://www.kyb.tuebingen.mpg.defileadmin/user_upload/files/publications/2012/NIPS-Workshop-2012-Ortega.pdf |
Electronic Archiving: | grant_archive |
BibTex
@conference{OrtegaB2012, title = {Adaptive Coding of Actions and Observations}, abstract = {The application of expected utility theory to construct adaptive agents is both computationally intractable and statistically questionable. To overcome these difficulties, agents need the ability to delay the choice of the optimal policy to a later stage when they have learned more about the environment. How should agents do this optimally? An information-theoretic answer to this question is given by the Bayesian control rule—the solution to the adaptive coding problem when there are not only observations but also actions. This paper reviews the central ideas behind the Bayesian control rule.}, pages = {1-4}, month = dec, year = {2012}, slug = {ortegab2012}, author = {Ortega, PA and Braun, DA}, url = {http://www.kyb.tuebingen.mpg.defileadmin/user_upload/files/publications/2012/NIPS-Workshop-2012-Ortega.pdf}, month_numeric = {12} }