dc.identifier.uri	http://hdl.handle.net/1951/59730
dc.identifier.uri	http://hdl.handle.net/11401/71296
dc.description.sponsorship	This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.	en_US
dc.format	Monograph
dc.format.medium	Electronic Resource	en_US
dc.language.iso	en_US
dc.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dc.type	Dissertation
dcterms.abstract	This dissertation focuses on training autonomous agents to plan and act under uncertainty, specifically for cases where the underlying state spaces are continuous in nature. Partially Observable Markov Decision Processes (POMDPs) are a class of models aimed at training agents to seek high rewards or low costs while navigating a state space without knowing their true location. Information regarding an agent's location is gathered in the form of possibly nonlinear and noisy measurements as a function of the true location. An exactly solved POMDP allows an agent to optimally balance seeking rewards and seeking information regarding its position in state space. It is computationally intractable to solve POMDPs for state domains that are continuous, motivating the need for efficient approximate solutions. The algorithm considered in this thesis is the Parametric POMDP (PPOMDP) method. PPOMDP represents an agent's knowledge as a parameterised probability distribution and is able to infer the impact of future actions and observations. The contribution of this thesis is in enhancing the PPOMDP algorithm making significant improvements in training and plan execution times. Several aspects of the original algorithm are generalized and the impact on training time, execution time, and performance are measured on a variety of classic robot navigation models in the literature today. In addition, a mathematically principled threefold adaptive sampling scheme is implemented. With an adaptive sampling scheme the algorithm automatically varies sampling according to the complexity of posterior distributions. Finally, a forward search algorithm is proposed to improve execution performance for sparse belief sets by searching several ply deeper than allowed by previous implementations.
dcterms.available	2013-05-22T17:34:56Z
dcterms.available	2015-04-24T14:46:55Z
dcterms.contributor	Xing, Haipeng	en_US
dcterms.contributor	Zhu, Wei	en_US
dcterms.contributor	Hu, Jiaqiao	en_US
dcterms.contributor	Zhang, Minghua.	en_US
dcterms.creator	Knapik, Timothy Ryan
dcterms.dateAccepted	2013-05-22T17:34:56Z
dcterms.dateAccepted	2015-04-24T14:46:55Z
dcterms.dateSubmitted	2013-05-22T17:34:56Z
dcterms.dateSubmitted	2015-04-24T14:46:55Z
dcterms.description	Department of Applied Mathematics and Statistics	en_US
dcterms.extent	132 pg.	en_US
dcterms.format	Application/PDF	en_US
dcterms.format	Monograph
dcterms.identifier	Knapik_grad.sunysb_0771E_11143	en_US
dcterms.identifier	http://hdl.handle.net/1951/59730
dcterms.identifier	http://hdl.handle.net/11401/71296
dcterms.issued	2012-12-01
dcterms.language	en_US
dcterms.provenance	Made available in DSpace on 2013-05-22T17:34:56Z (GMT). No. of bitstreams: 1 Knapik_grad.sunysb_0771E_11143.pdf: 1433740 bytes, checksum: 4ae62e04bfae35ab2298f5030910c931 (MD5) Previous issue date: 1	en
dcterms.provenance	Made available in DSpace on 2015-04-24T14:46:55Z (GMT). No. of bitstreams: 3 Knapik_grad.sunysb_0771E_11143.pdf.jpg: 1894 bytes, checksum: a6009c46e6ec8251b348085684cba80d (MD5) Knapik_grad.sunysb_0771E_11143.pdf.txt: 183114 bytes, checksum: 142d64b0fdc9646abd11ab7db6cd9f81 (MD5) Knapik_grad.sunysb_0771E_11143.pdf: 1433740 bytes, checksum: 4ae62e04bfae35ab2298f5030910c931 (MD5) Previous issue date: 1	en
dcterms.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subject	Mathematics
dcterms.subject	Continuous State Space, Partially Observable Markov Decision Processes
dcterms.title	Approximating Partially Observable Markov Decision Processes with Parametric Belief Distributions for Continuous State Spaces
dcterms.type	Dissertation

Files in this item

Name:: Knapik_grad.sunysb_0771E_11143.pdf
Size:: 1.367Mb
Format:: application/pdf

View/Open

This item appears in the following Collection(s)

Stony Brook Theses and Dissertations Collection [4009]

Show simple item record

Approximating Partially Observable Markov Decision Processes with Parametric Belief Distributions for Continuous State Spaces

Files in this item

This item appears in the following Collection(s)