Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction

Rajya Lakshmi, L.

Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction

dc.contributor.author	Rajya Lakshmi, L.
dc.date.accessioned	2024-05-07T09:58:46Z
dc.date.available	2024-05-07T09:58:46Z
dc.date.issued	2023
dc.description.abstract	In this paper, we review various k-Nearest-Neighbor (k-NN) based models and their accuracies to develop a better model to predict concentrations of air pollutants. The proposed model splits the range of target variable values into a number of buckets first. Then, a hybrid k-NN model, which is a combination of weighted attribute k-NN and distance-weighted k-NN, and where the weights are assigned by calculating Information Gain, is used for each attribute, to calculate the target variable value of each test case. The proposed model decreases the root mean square error (RMSE) of predicted NO, NO 2 and NO x values by 28.29%, 29.44%, and 16.51% respectively, compared to the state-of the-art. Similarly, the mean absolute error (MAE) values for NO, NO 2 , and NO x are decreased by 18.26%, 33.67%, and 14.54%, compared to the state-of the-art. This model gives good results when the size of each bucket is nearly equal.	en_US
dc.identifier.uri	https://ieeexplore.ieee.org/document/10216028
dc.identifier.uri	http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/14746
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.subject	Computer Science	en_US
dc.subject	k-Nearest-Neighbor (k-NN)	en_US
dc.subject	Random forests	en_US
dc.subject	Air Pollution Data Analysis	en_US
dc.title	Improved k-NN Regression Model Using Random Forests for Air Pollution Prediction	en_US
dc.type	Article	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Department of Computer Science and Information Systems