Show simple item record

dc.identifier.urihttp://hdl.handle.net/11401/77355
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractThe imbalanced class problem in classification is highly relevant in many realistic scenarios such as the detection of a rare condition. One solution is to design specific algorithms incorporating the unbalanced classes in the training process of a classifier. In this dissertation, we propose a novel multi-class classification tree based on the area under the ROC curve (AUC) to resolve the imbalanced classification problem. This tree classifier aims to maximize the sum of AUC for all one versus all classifiers at the node attribute selection stage while balancing the performance of sensitivity and specificity of all one versus all classification at the node threshold selection stage. The ROC tree is extended to ROC random forest with suitable modifications. Furthermore, the volume under surface (VUS), the extension of AUC for multi-class classification, is discussed in this dissertation as well and used to measure the performance of classifiers. The simulation results show that this multi-class ROC tree/forest method is superior to the classic CART/random forest on severely imbalanced multi-class classification problems, while the ROC random forest performs equally well as the SMOTE random forest on imbalanced binary classification problems. The application on Boston housing data shows that the ROC random forest can also be used for model ensemble and it performs better than all the base models and other ensemble methods in this application.
dcterms.available2017-09-20T16:52:33Z
dcterms.contributorZhu, Weien_US
dcterms.contributorKuan, Pei Fenen_US
dcterms.contributorWu, Songen_US
dcterms.contributorXiao, Keli.en_US
dcterms.creatorYan, Jiaju
dcterms.dateAccepted2017-09-20T16:52:33Z
dcterms.dateSubmitted2017-09-20T16:52:33Z
dcterms.descriptionDepartment of Applied Mathematics and Statisticsen_US
dcterms.extent100 pg.en_US
dcterms.formatMonograph
dcterms.formatApplication/PDFen_US
dcterms.identifierhttp://hdl.handle.net/11401/77355
dcterms.issued2017-05-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2017-09-20T16:52:33Z (GMT). No. of bitstreams: 1 Yan_grad.sunysb_0771E_13287.pdf: 1218593 bytes, checksum: 33979f0ce95cd72b3427f233cb15d5c6 (MD5) Previous issue date: 1en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectAUC, Imbalanced Classification, Model Ensemble, Random Forest, ROC, Tree Based Method
dcterms.subjectStatistics
dcterms.titleMulti-Class ROC Random Forest for Imbalanced Classification
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record