Can, Duy Cat and Ho, Thi Nga and Chng, Eng Siong (2018) A Hybrid Deep Learning Architecture for Sentence Unit Detection. In: International Conference on Asian Language Processing (IALP 2018 ), 15-18 November, 2018, Bandung, Indonesia. (In Press)
PDF
- Published Version
Restricted to Registered users only Download (913kB) | Request a copy |
Abstract
Automatic speech recognition systems currently deliver an unpunctuated sequence of words which is hard to peruse for human and degrades the performance of the downstream natural language processing tasks. In this paper, we propose a hybrid approach for Sentence Unit Detection, in which the focus is on adding the full stop [.] to the unstructured text. Our model profits from the advantage of two dominant deep learning architectures: (i) the ability to learn the long dependencies in both directions of a bidirectional Long Short-Term Memory; (ii) the ability to capture the local context with Convolutional Neural Networks. We also empirically study the training objective of our networks using extra-loss and further investigate the impacts of each model component on the overall result. Experiments conducted on two large-scale datasets demonstrated that the proposed architecture outperforms previous separated methods by a substantial margin of 1.82-1.91% of F1.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | Information Technology (IT) |
Divisions: | Faculty of Information Technology (FIT) |
Depositing User: | Duy-Cat Can |
Date Deposited: | 12 Dec 2018 06:35 |
Last Modified: | 12 Dec 2018 06:35 |
URI: | http://eprints.uet.vnu.edu.vn/eprints/id/eprint/3169 |
Actions (login required)
View Item |