dc.identifier.uri | http://hdl.handle.net/11401/77273 | |
dc.description.sponsorship | This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree. | en_US |
dc.format | Monograph | |
dc.format.medium | Electronic Resource | en_US |
dc.language.iso | en_US | |
dc.publisher | The Graduate School, Stony Brook University: Stony Brook, NY. | |
dc.type | Dissertation | |
dcterms.abstract | Truly understanding natural language requires grounding language to perceptions and actions in the physical and social world. This goes beyond studying the textual modality alone. Today's web not only has sheer volume of data, but also increasingly multi-modal data, intertwining text with videos, images, audios, and ontologies that are perceptions or abstractions of people's everyday life. Hence the web provides rich and ever growing resources for studying grounded language. This thesis presents a series of investigations of language woven into various types of online data, ranging from ontology and images to time series. We contribute data distillation approaches and large-scale datasets connecting language to vision, a collection of models and algorithms, and multiple novel applications in hierarchical product classification, image description, and photo album summarization. | |
dcterms.available | 2017-09-20T16:52:19Z | |
dcterms.contributor | Warren, David | en_US |
dcterms.contributor | Warren, David S | en_US |
dcterms.contributor | Fodor, Paul | en_US |
dcterms.contributor | Ramakrishnan, I.V. | en_US |
dcterms.contributor | Choi, Yejin | en_US |
dcterms.contributor | Hajishirzi, Hannaneh. | en_US |
dcterms.creator | Chen, Jianfu | |
dcterms.dateAccepted | 2017-09-20T16:52:19Z | |
dcterms.dateSubmitted | 2017-09-20T16:52:19Z | |
dcterms.description | Department of Computer Science. | en_US |
dcterms.extent | 93 pg. | en_US |
dcterms.format | Monograph | |
dcterms.format | Application/PDF | en_US |
dcterms.identifier | http://hdl.handle.net/11401/77273 | |
dcterms.issued | 2015-12-01 | |
dcterms.language | en_US | |
dcterms.provenance | Made available in DSpace on 2017-09-20T16:52:19Z (GMT). No. of bitstreams: 1
Chen_grad.sunysb_0771E_12566.pdf: 11778509 bytes, checksum: 0eed936a3cbf34c61d31b5daa841435a (MD5)
Previous issue date: 1 | en |
dcterms.publisher | The Graduate School, Stony Brook University: Stony Brook, NY. | |
dcterms.subject | Computer science | |
dcterms.subject | big data, computer vision, language grounding, Natural Langue Processing, web | |
dcterms.title | Language Grounding in Massive Online Data | |
dcterms.type | Dissertation | |