dc.identifier.uri | http://hdl.handle.net/11401/78141 | |
dc.description.sponsorship | This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree | en_US |
dc.format | Monograph | |
dc.format.medium | Electronic Resource | en_US |
dc.language.iso | en_US | |
dc.type | Dissertation | |
dcterms.abstract | The structure of real-world data (in the form of feature matrix) includes crucial information relevant to the performance of machine learning and data mining algorithms. The structure could be local manifold structure, global structure or discriminative information based on the requirements of learning or mining tasks. To model this intrinsic structure, an effective graph representation like k-nearest neighbor graph is necessary. Considering the increasing data size in this digital era, efficient sparse graph representations without parameter tuning are very demanding. In this thesis, we build novel sparse and nonparametric graph representation algorithms for unsupervised learning. The theory foundation of our research works is the similarity graph of Sparse Subspace Clustering. Our research works focus on: (1) alleviate the negative impacts of losing subspace structure assumption about the data: remove non-local edges and generate consistent edge connections, (2) solve the scalability issue for large size data: apply greedy algorithm with ranked dictionaries, (3) applications in unsupervised learning: redundant feature removal for high dimensional data. Moreover, this thesis includes graph structure analysis which connects to the quality of graph following Dense Subgraph theory: (1) data label estimation through dense subgraphs for semi-supervised learning, (2) graph robustness which can differentiate the topology and scale of subgraphs. | |
dcterms.available | 2018-03-22T22:39:05Z | |
dcterms.contributor | Qin, Hong | en_US |
dcterms.contributor | Qin, Hong. | en_US |
dcterms.contributor | Wang, Fusheng | en_US |
dcterms.contributor | Samaras, Dimitris | en_US |
dcterms.contributor | Orabona, Francesco | en_US |
dcterms.contributor | Harrison, Robert J. | en_US |
dcterms.creator | Han, Shuchu | |
dcterms.dateAccepted | 2018-03-22T22:39:05Z | |
dcterms.dateSubmitted | 2018-03-22T22:39:05Z | |
dcterms.description | Department of Computer Science. | en_US |
dcterms.extent | 170 pg. | en_US |
dcterms.format | Application/PDF | en_US |
dcterms.format | Monograph | |
dcterms.identifier | http://hdl.handle.net/11401/78141 | |
dcterms.issued | 2017-08-01 | |
dcterms.language | en_US | |
dcterms.provenance | Made available in DSpace on 2018-03-22T22:39:05Z (GMT). No. of bitstreams: 1
Han_grad.sunysb_0771E_13328.pdf: 6486537 bytes, checksum: 6cb651f521415edc647aec36004a7ae4 (MD5)
Previous issue date: 2017-08-01 | en |
dcterms.subject | Computer science | |
dcterms.subject | data mining | |
dcterms.subject | graph theory | |
dcterms.subject | machine learning | |
dcterms.subject | sparse graph | |
dcterms.title | Sparse Graph Representation and Its Applications | |
dcterms.type | Dissertation | |