VNU-UET Repository

A Positive-Unlabeled Learning Model for Extending a Vietnamese Petroleum Dictionary Based on Vietnamese Wikipedia Data

Vu, Ngoc Trinh and Nguyen, Quoc Dat and Nguyen, Tien Dat and Nguyen, Manh Cuong and Vu, Van Vuong and Ha, Quang Thuy (2018) A Positive-Unlabeled Learning Model for Extending a Vietnamese Petroleum Dictionary Based on Vietnamese Wikipedia Data. In: 2018 10th Asian Conference on Intelligent Information and Database Systems (ACIIDS), 2018, Dong Hoi, Vietnam.

This is the latest version of this item.

Full text not available from this repository.

Abstract

This study provides a positive-unlabeled learning model for extending a Vietnamese petroleum dictionary based on Vietnamese Wikipedia data. Machine learning algorithms with positive and unlabeled data together with separated and combined between Google similarity distance and Cosine similarity distance, used in this study. The data sources used to integrate are English - Vietnamese oil and gas dictionary and the Vietnamese Wikipedia. In the results, a novelty way for data integration with higher accuracy by using a combination of algorithms. The first Vietnamese oil and gas ontology was built in Vietnam. This ontology is a useful tool for staff in the oil and gas industry in training, research, search daily.

Item Type: Conference or Workshop Item (Paper)
Subjects: Information Technology (IT)
Divisions: Faculty of Information Technology (FIT)
Depositing User: Hà Quang Thụy
Date Deposited: 18 Apr 2018 03:37
Last Modified: 18 Apr 2018 03:37
URI: http://eprints.uet.vnu.edu.vn/eprints/id/eprint/2942

Available Versions of this Item

Actions (login required)

View Item View Item