Show simple item record

dc.identifier.urihttp://hdl.handle.net/11401/78141
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degreeen_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.typeDissertation
dcterms.abstractThe structure of real-world data (in the form of feature matrix) includes crucial information relevant to the performance of machine learning and data mining algorithms. The structure could be local manifold structure, global structure or discriminative information based on the requirements of learning or mining tasks. To model this intrinsic structure, an effective graph representation like k-nearest neighbor graph is necessary. Considering the increasing data size in this digital era, efficient sparse graph representations without parameter tuning are very demanding. In this thesis, we build novel sparse and nonparametric graph representation algorithms for unsupervised learning. The theory foundation of our research works is the similarity graph of Sparse Subspace Clustering. Our research works focus on: (1) alleviate the negative impacts of losing subspace structure assumption about the data: remove non-local edges and generate consistent edge connections, (2) solve the scalability issue for large size data: apply greedy algorithm with ranked dictionaries, (3) applications in unsupervised learning: redundant feature removal for high dimensional data. Moreover, this thesis includes graph structure analysis which connects to the quality of graph following Dense Subgraph theory: (1) data label estimation through dense subgraphs for semi-supervised learning, (2) graph robustness which can differentiate the topology and scale of subgraphs.
dcterms.available2018-03-22T22:39:05Z
dcterms.contributorQin, Hongen_US
dcterms.contributorQin, Hong.en_US
dcterms.contributorWang, Fushengen_US
dcterms.contributorSamaras, Dimitrisen_US
dcterms.contributorOrabona, Francescoen_US
dcterms.contributorHarrison, Robert J.en_US
dcterms.creatorHan, Shuchu
dcterms.dateAccepted2018-03-22T22:39:05Z
dcterms.dateSubmitted2018-03-22T22:39:05Z
dcterms.descriptionDepartment of Computer Science.en_US
dcterms.extent170 pg.en_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierhttp://hdl.handle.net/11401/78141
dcterms.issued2017-08-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2018-03-22T22:39:05Z (GMT). No. of bitstreams: 1 Han_grad.sunysb_0771E_13328.pdf: 6486537 bytes, checksum: 6cb651f521415edc647aec36004a7ae4 (MD5) Previous issue date: 2017-08-01en
dcterms.subjectComputer science
dcterms.subjectdata mining
dcterms.subjectgraph theory
dcterms.subjectmachine learning
dcterms.subjectsparse graph
dcterms.titleSparse Graph Representation and Its Applications
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record