%A Duc Linh Nguyen %A Duc Chuc Man %A Quang Hung Bui %A Thi Nhat Thanh Nguyen %T Standardization procedure for automatic environmental data: A case study in Hanoi, Vietnam %X In Vietnam, environmental data collected from ground-based stations may contain abnormal or missing values due to several problems during operation, i.e. sensor's problems. This paper proposes a standardization procedure which try to detect unusual values and fill in missing data. Experiments were conducted for PM10 data. Two datasets measured in 01/2011 and 01/2012 at Nguyen Van Cu station in Hanoi, Vietnam is used for experiments. For the abnormal detection process, unusual data can be informed to the data analyzers at ground stations for judging. For the missing filling process, the first dataset is used as training dataset to construct regression models for predicting missing data, the second dataset is used as testing data. In the worst case, suppose 100% PM10 is missing, Root Mean Square Error (RMSE) and Mean Absolute Percentage Error (MAPE) are 51 µg/m3 and 45% respectively. Correlation coefficient (R) between original PM10 data and predicted PM10 data is 0.56. In addition, different scenarios taking account of percentage of missing data of the whole testing dataset are also considered. Experimental results showed that it is best to perform missing filling process on datasets that contain 10% to 30% of missing data. For this case, RMSE ranges from 15–25 µg/m3 and MAPE varies from 5 to 13%. %K Atmospheric measurements;Correlation;Data models;Filling;Monitoring;Pollution measurement;Training;PM10;abnormal detection;environmental data;missing filling %P 321-326 %D 2016 %C Hanoi, Vietnam %L SisLab2776