TY - GEN
T1 - Data Analytics on Real-Time Air Pollution Monitoring System Derived from a Wireless Sensor Network
AU - Fuertes, Walter
AU - Cadena, Alyssa
AU - Torres, Jenny
AU - Benítez, Diego
AU - Tapia, Freddy
AU - Toulkeridis, Theofilos
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Air pollution is a problem that causes adverse effects, which tends to interfere with human comfort, health or well-being, and that may cause serious environmental damage. In this regard, this study aims to analyze large data sets generated by real-time wireless sensor networks that determine different air pollutants. Business Intelligence and Data Mining techniques have been applied in order to support subsequent decision-making strategies. For normalization and modeling, we applied the CRISP-DM methodology using the Pentaho Data Integration. Then, the Sap Lumira has been applied in order to acquire models of tables and views. For the data analysis, R-Studio has been used. For validation, Clustering has been applied using the k-means algorithm by the Jambu method, where it has been proceeded to check the consistency of these, being later stored and debugged in PostgreSQL. Results demonstrate that the increase in air pollutants is directly related to the traffic hours, which may cause an increase of asthma or sick related syndrome in the population. This analysis may also serve as a source of information to authorities for improving public policies in such matter.
AB - Air pollution is a problem that causes adverse effects, which tends to interfere with human comfort, health or well-being, and that may cause serious environmental damage. In this regard, this study aims to analyze large data sets generated by real-time wireless sensor networks that determine different air pollutants. Business Intelligence and Data Mining techniques have been applied in order to support subsequent decision-making strategies. For normalization and modeling, we applied the CRISP-DM methodology using the Pentaho Data Integration. Then, the Sap Lumira has been applied in order to acquire models of tables and views. For the data analysis, R-Studio has been used. For validation, Clustering has been applied using the k-means algorithm by the Jambu method, where it has been proceeded to check the consistency of these, being later stored and debugged in PostgreSQL. Results demonstrate that the increase in air pollutants is directly related to the traffic hours, which may cause an increase of asthma or sick related syndrome in the population. This analysis may also serve as a source of information to authorities for improving public policies in such matter.
KW - Air pollution
KW - Business Intelligence
KW - Data Mining
KW - Data analytics
KW - Pattern recognition
KW - Wireless sensor network
UR - http://www.scopus.com/inward/record.url?scp=85061373107&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-11890-7_6
DO - 10.1007/978-3-030-11890-7_6
M3 - Contribución a la conferencia
AN - SCOPUS:85061373107
SN - 9783030118891
T3 - Advances in Intelligent Systems and Computing
SP - 57
EP - 67
BT - Information Technology and Systems - Proceedings of ICITS 2019
A2 - Paredes, Manolo
A2 - Ferrás, Carlos
A2 - Rocha, Álvaro
PB - Springer Verlag
T2 - International Conference on Information Technology and Systems, ICITS 2019
Y2 - 6 February 2019 through 8 February 2019
ER -