relation: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/2504/ title: A semi-supervised multi-label classification framework with feature reduction and enrichment creator: Pham, Thi Ngan creator: Nguyen, Van Quang creator: Tran, Van Hien creator: Nguyen, Tri Thanh creator: Ha, Quang Thuy subject: Information Technology (IT) description: Multi-label classification has drawn much attention thanks to its usefulness and omnipresence in real-world applications in which objects may be char-acterized by more than one label as in the traditional approach. Getting mul-ti-label examples is costly and time-consuming therefore semi-supervised learning approach should be considered to take advantages of both labeled and unlabeled data. In this work, we propose a semi-supervised multi-label classification algorithm exploiting the specific features of the prominent class label(s) chosen by a greedy approach as an extension of the LIFT algo-rithm, and unlabeled data consumption mechanism from the TESC algo-rithm. We also make a semi-supervised multi-label classification application framework for Vietnamese texts with several feature enrichment steps in-cluding a) a stage of enriching features by adding hidden topic features; b) a stage of dimensional reduction for subtracting irrelevant features. Experi-mental results on a dataset of hotel reviews (for tourism) indicate that a rea-sonable amount of unlabeled data helps to increase the F1 score. Interesting-ly, with a small amount of labeled data, our algorithm can reach a compara-tive performance to the case of using a larger amount of labeled data. date: 2017 type: Article type: PeerReviewed identifier: Pham, Thi Ngan and Nguyen, Van Quang and Tran, Van Hien and Nguyen, Tri Thanh and Ha, Quang Thuy (2017) A semi-supervised multi-label classification framework with feature reduction and enrichment. Journal of Information and Telecommunication .