The logic of adaptive behavior : knowledge representation and algorithms for the Markov decision process framework in first-order domains
Otterlo van, Martijn (2008) The logic of adaptive behavior : knowledge representation and algorithms for the Markov decision process framework in first-order domains. thesis.
|Abstract:||Learning and reasoning in large, structured, probabilistic worlds is at the heart of artificial intelligence. Markov decision processes have become the de facto standard in modeling and solving sequential decision making problems under uncertainty. Many efficient reinforcement learning and dynamic programming techniques exist that can solve such problems.
Until recently, the representational state-of-the-art in this field was based on propositional representations.
However, it is hard to imagine a truly general, intelligent system that does not conceive of the world in terms of objects and their properties and relations to other objects. To this end, this book studies lifting Markov decision processes, reinforcement learning and dynamic programming to the first-order (or, relational) setting. Based on an extensive analysis of propositional representations and techniques, a methodological translation is constructed from the propositional to the relational setting. Furthermore, this book provides a thorough and complete description of the state-of-the-art, it surveys vital, related historical developments and it contains extensive descriptions of several new model-free and model-based solution techniques.
|Link to this item:||http://purl.utwente.nl/publications/58987|
|Export this item as:||BibTeX|
Daily downloads in the past month
Monthly downloads in the past 12 months
Repository Staff Only: item control page
Metis ID: 251036