dc.identifier.uri	http://hdl.handle.net/11401/77257
dc.description.sponsorship	This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.	en_US
dc.format	Monograph
dc.format.medium	Electronic Resource	en_US
dc.language.iso	en_US
dc.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dc.type	Dissertation
dcterms.abstract	The human visual perception is a complex system that excels in tasks such as object recognition and localization, face detection, object segmentation, action classification, and more. While we perform these everyday tasks with ease, little is known about the underlying process of the human visual system. In contrast, computer vision is a field of research that strives for better performance in the aforementioned tasks, which rely on carefully designed statistical machine learning based theories. As the field of computer vision and machine learning has matured over the past decade, there is an abundance of methods and theories that can be utilized for modeling human visual perception, which may shed more light on the underlying processes that make biological visual perception possible. This dissertation discusses novel computational models with the emphasis in visual clutter perception using proto-objects, and categorical search with category consistent features, two important problems in understanding human visual perception. Visual clutter is a global perception defined as being "crowded disorderly", affects aspects of our lives ranging from object detection to aesthetics, yet relatively little effort has been made to model this ubiquitous percept. Our approach models clutter as the number of proto-objects segmented from an image, with proto-objects defined as groupings of superpixels that are similar in low-level features. The proto-object model outperforms all other existing models and even a behavioral object segmentation ground truth, which indicates that the number of proto-objects in an image affects clutter perception more than the number of objects or the complexity of features. In a scope of perception that is more local within a visual field, object category recognition requires one to identify the correct category of a given object from an image. To better understand the human object recognition process, we introduce a generative model of category representation called the category-consistent features (CCFs) from images of category exemplars. The CCF model extracts category representative information from SIFT Bag-of-words (BoW) models and is able to predict human behavior in the context of a categorical search task. Finally, we introduce a ventral-stream inspired deep convolutional neural network (VsNet) and a convolutional version of the HMAX model (Deep-HMAX), and analyze these models with the baseline AlexNet under the representational similarity analysis framework (RSA). The results show that the two biologically-inspired models achieve higher object classification accuracies, and the layer-wise representations are more similar between the more biologically-informed models than that of the baseline model.
dcterms.available	2017-09-20T16:52:18Z
dcterms.contributor	Nguyen, Minh Hoai	en_US
dcterms.contributor	Samaras, Dimitris	en_US
dcterms.contributor	Konkle, Talia.	en_US
dcterms.contributor	Zelinsky, Gregory J	en_US
dcterms.creator	Yu, Chen-Ping
dcterms.dateAccepted	2017-09-20T16:52:18Z
dcterms.dateSubmitted	2017-09-20T16:52:18Z
dcterms.description	Department of Computer Science	en_US
dcterms.extent	141 pg.	en_US
dcterms.format	Monograph
dcterms.format	Application/PDF	en_US
dcterms.identifier	http://hdl.handle.net/11401/77257
dcterms.issued	2016-12-01
dcterms.language	en_US
dcterms.provenance	Made available in DSpace on 2017-09-20T16:52:18Z (GMT). No. of bitstreams: 1 Yu_grad.sunysb_0771E_13012.pdf: 17707277 bytes, checksum: 8bae622ff02f5ee828e35807974b60fc (MD5) Previous issue date: 1	en
dcterms.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subject	clustering, computational model, computer vision, deep learning, machine learning, proto-object
dcterms.subject	Computer science -- Cognitive psychology
dcterms.title	Computational models of visual features: from proto-objects to object categories
dcterms.type	Dissertation

Files in this item

Name:: Yu_grad.sunysb_0771E_13012.pdf
Size:: 16.88Mb
Format:: application/pdf

View/Open

This item appears in the following Collection(s)

Stony Brook Theses and Dissertations Collection [4009]

Show simple item record

Computational models of visual features: from proto-objects to object categories

Files in this item

This item appears in the following Collection(s)