eprintid: 3349 rev_number: 11 eprint_status: archive userid: 345 dir: disk0/00/00/33/49 datestamp: 2018-12-20 01:44:39 lastmod: 2018-12-20 01:44:39 status_changed: 2018-12-20 01:44:39 type: monograph metadata_visibility: show creators_name: Pham Van, Long creators_name: Vu Ngoc, Sang creators_name: Nguyen Van, Vinh creators_id: vinhnv@vnu.edu.vn corp_creators: VNU University of Engineering and Technology corp_creators: VNU University of Engineering and Technology corp_creators: VNU University of Engineering and Technology title: Study of Information Extraction in Resume ispublished: unpub subjects: IT divisions: fac_fit abstract: This paper deals with the parsing application developed for the resumes (or CV) received in multiple formats like doc, docx, pdf, txt. These resumes can be automatically retrieved and processed by a resume information extraction system. Extracted information such as name, phone / mobile number, e-mail id, qualification, experience, skill sets etc., can be stored as a structured information in a database and then can be used in many different areas. Our system consists of 4 phases: Text Segmentation, Name Entity Recognition using Rule-based, Find Name Entities using Deep Neural Network and Text Normalization. Our work is conducted on a medium-sized collections of CV files in Vietnamese. We archived promising results with over 81% F1 for NER and also compared our model with other systems date: 2018 publisher: Conference full_text_status: public monograph_type: technical_report citation: Pham Van, Long and Vu Ngoc, Sang and Nguyen Van, Vinh (2018) Study of Information Extraction in Resume. Technical Report. Conference. (Unpublished) document_url: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/3349/1/Study-of-Information-Extraction-in-CV_paper%20%281%29.pdf