%0 Journal Article %A Pham, Thi Ngan %A Nguyen, Van Quang %A Tran, Van Hien %A Nguyen, Tri Thanh %A Ha, Quang Thuy %D 2017 %F SisLab:2504 %J Journal of Information and Telecommunication %T A semi-supervised multi-label classification framework with feature reduction and enrichment %U https://eprints.uet.vnu.edu.vn/eprints/id/eprint/2504/ %X Multi-label classification has drawn much attention thanks to its usefulness and omnipresence in real-world applications in which objects may be char-acterized by more than one label as in the traditional approach. Getting mul-ti-label examples is costly and time-consuming therefore semi-supervised learning approach should be considered to take advantages of both labeled and unlabeled data. In this work, we propose a semi-supervised multi-label classification algorithm exploiting the specific features of the prominent class label(s) chosen by a greedy approach as an extension of the LIFT algo-rithm, and unlabeled data consumption mechanism from the TESC algo-rithm. We also make a semi-supervised multi-label classification application framework for Vietnamese texts with several feature enrichment steps in-cluding a) a stage of enriching features by adding hidden topic features; b) a stage of dimensional reduction for subtracting irrelevant features. Experi-mental results on a dataset of hotel reviews (for tourism) indicate that a rea-sonable amount of unlabeled data helps to increase the F1 score. Interesting-ly, with a small amount of labeled data, our algorithm can reach a compara-tive performance to the case of using a larger amount of labeled data.