eprintid: 2532 rev_number: 6 eprint_status: archive userid: 279 dir: disk0/00/00/25/32 datestamp: 2017-06-16 06:10:57 lastmod: 2017-06-16 06:10:57 status_changed: 2017-06-16 06:10:57 type: article metadata_visibility: show creators_name: Nguyen, Hai Chau creators_id: chaunh@vnu.edu.vn title: Enhancing Cholera Outbreaks Prediction Performance in Hanoi, Vietnam Using Solar Terms and Resampling Data ispublished: inpress subjects: IT divisions: fac_fit abstract: A solar term is an ancient Chinese concept to indicate a point of season change in lunisolar calendars. Solar terms are currently in use in China and nearby countries including Vietnam. In this paper we propose a new solution to increase performance of cholera outbreaks prediction in Hanoi, Vietnam. The new solution is a combination of solar terms, training data resampling and classification methods. Experimental results show that using solar terms in combination with ROSE resampling and random forests method delivers high area under the Receiver Operating Characteristic curve (AUC), balanced sensitivity and specificity. Without interaction effects the solar terms help increasing mean of AUC by 12.66%. The most important predictor in the solution is Sun’s ecliptical longitude corresponding to solar terms. Among the solar terms, "frost descent" and "start of summer" are the most important. date: 2017-09-27 date_type: published publisher: Springer official_url: http://cyprusconferences.org/iccci2017/ full_text_status: public publication: 9th International Conference on Computational Collective Intelligence refereed: TRUE citation: Nguyen, Hai Chau (2017) Enhancing Cholera Outbreaks Prediction Performance in Hanoi, Vietnam Using Solar Terms and Resampling Data. 9th International Conference on Computational Collective Intelligence . (In Press) document_url: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/2532/1/paper_71.pdf