eprintid: 3719 rev_number: 7 eprint_status: archive userid: 427 dir: disk0/00/00/37/19 datestamp: 2019-12-06 14:05:08 lastmod: 2019-12-06 14:05:08 status_changed: 2019-12-06 14:05:08 type: monograph metadata_visibility: show creators_name: Nguyen, Cam-Van Thi creators_id: vanntc@vnu.edu.vn corp_creators: Tran Mai-Vu title: A Large Scale Multi-label Text Classification Methodusing Z-Label LDA ispublished: submitted subjects: IT divisions: fac_fit abstract: Multi-label Learning (MLL) is a supervised learning model that hasattracted much attention of the research community in recent yearsbecause of its wide variety applicability. In this paper, we built amulti-label classification model using Latent Dirichlet Allocationwith Topic-in-set Knowlegde (z-Label LDA) on the Vietnamesedata domain. z-Label LDA is a variant of LDA which is intended toprovide additional supervised information as a hidden topic into theLDA called "z-label". We also have experimented on the dataset inthe field of Education collected from Vietnamese online newspapers.Parallel, we applied the hidden topic model LDA to generate aprior-knowledge dataset comprising topics and typical keywordsrepresenting each topic. The supervised information also makesthe topic assignment more consistent. With this approach, theeffectiveness of the model has been demonstrated experimentally,this paper has obtained initial positive results. date: 2019 publisher: UET-VNU contact_email: vanntc@vnu.edu.vn full_text_status: public monograph_type: technical_report pages: 6 citation: Nguyen, Cam-Van Thi (2019) A Large Scale Multi-label Text Classification Methodusing Z-Label LDA. Technical Report. UET-VNU. (Submitted) document_url: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/3719/1/A_Large_Scale_Multi_label_Text_Classification_Method_using_Z_Label_LDA.pdf