dc.identifier.uri	http://hdl.handle.net/11401/76062
dc.description.sponsorship	This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.	en_US
dc.format	Monograph
dc.format.medium	Electronic Resource	en_US
dc.language.iso	en_US
dc.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dc.type	Dissertation
dcterms.abstract	The focus of this work is on practical applications of stochastic multi-armed bandits (MABs) in two distinctive settings. First, we develop and present REGA, a novel adaptive sampling-based algorithm for control of finite-horizon Markov decision processes (MDPs) with very large state spaces and small action spaces. We apply a variant of the epsilon-greedy multi-armed bandit algorithm to each stage of the MDP in a recursive manner, thus computing an estimation of the " reward-to-go" value at each stage of the MDP. We provide a finite-time analysis of REGA. In particular, we provide a bound on the probability that the approximation error exceeds a given threshold, where the bound is given in terms of the number of samples collected at each stage of the MDP. We empirically compare REGA against other sampling-based algorithms and find that our algorithm is competitive. We discuss measures to aid against the curse of dimensionality due to the backwards induction nature of REGA, necessary when the MDP horizon is large. Second, we introduce e-Discovery, a topic of extreme significance to the legal industry, which pertains to the ability of sifting through large volumes of data in order to identify the " needle in the haystack" documents relevant to a lawsuit or investigation. Surprisingly, the topic has not been explicitly investigated in academia. Looking at the problem from a scheduling perspective, we highlight the main properties and challenges pertaining to this topic and outline a formal model for the problem. We examine an approach based on related work from the field of scheduling theory and provide simulation results that demonstrate the performance of our approach against a very large data set. We also provide an approach based on list-scheduling that incorporates a side multi-armed bandit in lieu of standard heuristics. Necessarily, we propose the first MAB algorithm that accounts for both sleeping bandits and bandits with history. The empirical results are encouraging. Surveys of multi-armed bandits as well as scheduling theory are included. Many new and known open problems are proposed and/or documented.
dcterms.available	2017-09-18T23:49:57Z
dcterms.contributor	Arkin, Esther	en_US
dcterms.contributor	Hu, Jiaqiao	en_US
dcterms.contributor	Deng, Yuefan	en_US
dcterms.contributor	Ortiz, Luis.	en_US
dcterms.creator	Muqattash, Isa Mithqal
dcterms.dateAccepted	2017-09-18T23:49:57Z
dcterms.dateSubmitted	2017-09-18T23:49:57Z
dcterms.description	Department of Applied Mathematics and Statistics.	en_US
dcterms.extent	118 pg.	en_US
dcterms.format	Monograph
dcterms.format	Application/PDF	en_US
dcterms.identifier	http://hdl.handle.net/11401/76062
dcterms.issued	2014-12-01
dcterms.language	en_US
dcterms.provenance	Made available in DSpace on 2017-09-18T23:49:57Z (GMT). No. of bitstreams: 1 Muqattash_grad.sunysb_0771E_12112.pdf: 1705588 bytes, checksum: 06cdc910ffc781300ce04fb3012c18bd (MD5) Previous issue date: 1	en
dcterms.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subject	Electronic Discovery, Markov Decision Process (MDP), Multi-Armed Bandit (MAB), Optimization Under Uncertainties, Sampling, Stochastic Scheduling
dcterms.subject	Applied mathematics
dcterms.title	Multi-Armed Bandits with Applications to Markov Decision Processes and Scheduling Problems
dcterms.type	Dissertation

Files in this item

Name:: Muqattash_grad.sunysb_0771E_12 ...
Size:: 1.626Mb
Format:: application/pdf

View/Open

This item appears in the following Collection(s)

Stony Brook Theses and Dissertations Collection [4009]

Show simple item record

Multi-Armed Bandits with Applications to Markov Decision Processes and Scheduling Problems

Files in this item

This item appears in the following Collection(s)