Pham Van, Long and Vu Ngoc, Sang and Nguyen Van, Vinh (2018) Study of Information Extraction in Resume. Technical Report. Conference. (Unpublished)
PDF
Download (701kB) |
Abstract
This paper deals with the parsing application developed for the resumes (or CV) received in multiple formats like doc, docx, pdf, txt. These resumes can be automatically retrieved and processed by a resume information extraction system. Extracted information such as name, phone / mobile number, e-mail id, qualification, experience, skill sets etc., can be stored as a structured information in a database and then can be used in many different areas. Our system consists of 4 phases: Text Segmentation, Name Entity Recognition using Rule-based, Find Name Entities using Deep Neural Network and Text Normalization. Our work is conducted on a medium-sized collections of CV files in Vietnamese. We archived promising results with over 81% F1 for NER and also compared our model with other systems
Item Type: | Technical Report (Technical Report) |
---|---|
Subjects: | Information Technology (IT) |
Divisions: | Faculty of Information Technology (FIT) |
Depositing User: | Nguy�n V |
Date Deposited: | 20 Dec 2018 01:44 |
Last Modified: | 20 Dec 2018 01:44 |
URI: | http://eprints.uet.vnu.edu.vn/eprints/id/eprint/3349 |
Actions (login required)
View Item |